Davin A. GRANT, Associate Editor 
University of Wisconsin 


Norman H. ANDERSON, 

University of California, Los Angeles 
E. James ARCHER, University of Wisconsin 
Fren Arryeave, III, University of Oregon 
Cretus J. Burke, Indiana University 
James Derse, Johns Hopkins University 
Paur M. Fitts, University of Michigan 
‘Frank Gexparp, University of Virginia 
James TBson, Cornell University 
Harorp W., Hake, University of Illinois 
ARTHUR L, Irion, Tulane University 


HELEN Orr 
Managing Editor 


Rura P. Jacorrnzer 
Editorial Assistant 


r Journal of 
Experimental Psychology 


ArtHur W. Metron, Editor 


Department of Psychology, University of Michigan 
Ann Arbor, Michigan 


Wittram K. Estes, Associate Editor 
Stanford University 


Consulting Editors 


Howard H, Kenpier, New York University 
HerscHeL W., LersowiTz, 
International Business Machines Corporation 
Bethesda, Maryland 
Donatp B. LINDSLEY, 


University of California, Los Angeles 
KENNETH MacCorguopate, University of Minnesota 
Quinn McNemar, Stanford University 
Leo Postman, University of California, Berkeley 
L. Starxine Rer, University of Virginia 
Kennetu W. Spence, State University of Iowa 
Detos D. Wickens, Ohio State University 


Evizaneru S. Reep 
Advertising Manager 


—_—___ 2.6, 


Cae _ 7 
Volume 64, he2 


PUBLISHED MONTHLY BY THE 


AMERICAN PSYCHOLOGICAL ASSOCIATION, INC 


PRINCE AND LEMON STS., LANCASTER, PA. y 
o 1333 SIXTEENTH ST. N. W., WASHINGTON 6, D.C. F 
* Second-class postage paid at Lancaster, Pa, 


C—O —— 


CONTENTS OF VOLUME 64 


ADAMS, J. A. Test of the Hypothesis of Psychological Refractory Period... y AS 
ADAMS, J. A., AND BOULTER, L. R. An Evaluation of the Activationist Hypothesis of Human 
NT AAEE Paa EE « aaa AERE 2 ble. h I E ANCA 


AXELROD, S. See BONEAU, C. A. 
BAKER, R. A, See Srrowicz, R. R. 
Bartz, A. E. Eye-Movement Latency, Duration, and Response Time as a Function of Angular 


Displacement 


Bevan, W. See Turner, E. D. 

BIEDERMAN, I. See Feurer, E, 

BONEAU, C. A., AND AXELROD, S. Work Decrement and Reminiscence in Pigeon Operant 
ana EATE eee E T a mee 352 

BOULTER, L, R. See Apams, J. A. 

BousrieLp, W. A. See Krncarp, W. D., Jr. 

BRACKBILL, Y., AND BRAVOS, A. Supplementary Report: The Utility of Correctly Predicting 


Futregquant Eyente’s, i.. 6.0. ue ccespdone asinine yin ks Lennar on ee ne T aes 648 
Bratey, L. S. Some Conditions Influencing the Acquisition and Utilization of Cues....... L162 
BRAUNSTEIN, M. L. Depth Perception in Rotating Dot Patterns: Effects of Numerosity and 

Porapoctivey n dame.. SESS eee pty aa ee OR Vege med at 415 


Bravos, A. See BRACKBILL, Y. 

Briccs, G. E. See WILLIAMS, A. C. 

Briccs, G. E., AND NAYLOR, J. C. The Relative Efficiency of Several Training Methods as a 
Function of Transfer Task Complexity 

BroGpEN, W. J. Contiguous Conditioning 

BroGpen, W. J. See Ernst, R. L. 

BROGDEN, W. J. See Wynne, J. D. 

BucnwaLp, A. M. See Harrow, M. 

CapPaLDi, E. J., AND Hart, D. Influence of a Small Number of Partial Reinforcement Training 


Trials on Resistance to Extinction 


505 


Cowan, P. A. See MANDLER, G. 
Curran, C. R., AND Lane, H. L. On the Relations among Some Factors That Contribute to 


Estimates of Verticality 


DIAMOND, A. L. Simultaneous Contrast as a Function of Test-Field Area................., 


Di Vesta, F. J., AND Stover, D. O. The Semantic Mediation of Evaluative Meaning 


Duncan, C. P. See Isaacs, I. D. 
Eccer, M. D., AND MILLER, N. E. Secondary Reinforcement in Rats as a Function of Informa- 


tion Value and Reliability of the Stimulus... 20.20.66... onidan a E oa eee... 97 


Eriksen, C. W. See CHATTERJEE, B. B, 
ERIKSEN, C. W, See PauL, G. L. 


iii 


iv CONTENTS OF VOLUME 64 


Ernst, R. L., Tuompson, C. P., AND BROGDEN, W. J. Effect of Pattern and Pleonasm Location 
in Serial Lists upon Acquisition and Serial Position Errors. ............s-+-5ss5r5s 112s 

Fawcett, J. T. SEE Prokasy, W., F. 

FEHRER, E. See Raas, D. F 

FEnRER, E., AND BIEDERMAN, I. A Comparison of Reaction Time and Verbal Report in the 
Detection of Masked Stimuli... 20.0... eane diie eaaa eee shee eas E cise Wee eee 

FELDMAN, R. S. See SALZINGER, K. 

FLEIsuMman, E, A., AND PARKER, J. F., JR. Factors in the Retention and Relearning of Percep- 
tual-Motor Skill.................- 

FLESHLER, M. See Horrman, H. S. 

FORRIN, B. See MORIN, R, E. 

Ganz, L. Hue Generalization and Hue Discriminability in Macaca Mulatta. . 

GARNER, W. R. See WHITMAN, J. R. 

GLANZER, M., AND PETERS, S.C. Re-examination of the Serial Position Effect............... 

GLEITMAN, H., AND HERMAN, M. M. Replication Report: Latent Learning in a T Maze after 
Shock in One End Box. 

GORMEZANO, I., Moore, J. W., AND. DEAvx, E. Supplementary Report: Yoked Comparisons 
of Classical and Avoidance Eyelid Conditioning under Three UCS Intensities........... 

Grant, D. A. See Hartman, T. F. 

Greeno, J. G. Effects of Nonreinforced Trials in Two-Choice Learning with Noncontingent 
MOMEMTOPESIIONE A A a 5 r a sais!» asm fein Tey EE E E 

HALL, J. F. See Proxasy, W. F. 

HALPERN, S. See MEDNICK, S. A. 

Ham, M. See UnpERwoop, B. J. 

Harris, J. R. See STEVENS, S. S. 

Harris, S. J., SMITH, M. G., AND WEINSTOCK, S. Effects of Nonreinforcement on Subsequent 
E A VION NO a O A T E E E E 

Harrow, M., anp BuUcHwaLD, A. M. Reversal and Nonreversal Shifts in Concept Form: 
Using Consistent and Inconsistent Responses 

HART, D. See CAPALDI, E. J. 

HARTMAN, T, F., AND Grant, D. A. Differential Eyelid Conditioning as a Function of the 
MSRP p a SEC Mo id sis, <0 oa aces wohle E cere in A Bh gusta bw E gad 


HELLYER, S. Supplementary Report: Frequency of Stimulus Presentation and Short-Term 
UU CEG OU OP 7 ee or ce eee bee. E ye arene 


Hetson, H., AND STEGER, J. A. On the aia Effects of a Second Stimulus Following the 
Primary Stimulus to React. . Sues 

HERMAN, M. M. See GLEITMAN, ERS 

HETHERINGTON, M. See PICK, H. L., JR. 

HiL, W. F., Corton, J. W., anp CLayTON, K. N. Effect of Reward Magnitude, Percentage of 
Reinforcement, and Training Method on Acquisition and Reversal in a T Maze......... 

HiL, W. F., anp Spear, N. E. Resistance to Extinction as a Joint Function of Reward Magni- 
tude and the Spacing of Extinction TAAS. aasenso onara areren rarere 


Hitt, W. F., Spear, N. E., AND CLayton, K. N. T Maze Reversal Learning after ‘Several 


Different Overtraining Procedures 
HILLNER, K. See PETERSON, L. R. 


Horrman, H. S., AND FLESHLER, M. The Course of Emotionality in the Development of 
Avoidance... 


“Honic, W. K. Prediction of Preference, Transposition, and or 
the ei era Gradient 


tion-Reversal from 


‘eee cng H. Partial ‘Reinforcement, Continuous Reinforcement, ‘and ‘Reinforcement Shift 
ects. 


HUMPHREYS, L. G. See PAUL, G. L. 


Isaacs, I. D., ayp Duncan, C. P. Reversal and Nonreversal Shifts within and between Dimen- 
sions in Concept Formation. 


Ison, J.R. Experimental Extinction as a | Function of Number of Reinforcements........... 

Jacopa, H. See D'AMATO, M. R. 

Jakonovits, L. A., AnD LamBert, W. E. Mediated Satiation in Verbal Transfer... .. 

Jenkins, H.M. Resistance to Extinction When Partial Reinforcement Is Followed by Regular 
Reinforcement................. 

JENSEN, A. R. The von Restorff Isolation Effect with Minimal Response Learning K 

JOHANNSEN, W. J. Concept Identification under Misinformative and Subsequent Informative 
Feedback Conditions. 


Jones, C. G. See Tuomas, D, R. 


Jones, F. N., SINGER, D., AND TWELKER, P, A. Interaction among the S i i 
Judgments of Subjective Magnitude. . . ETS T es . r S “aaa 


151 


126 


215 


. 142 


258 
646 
551 


a973 


388 


476 


131 
650 


201 


8i 


636 


< 533 


288 
239 
227 
451 
580 
314 
346 


441 
123 


631 


- 105 


CONTENTS OF VOLUME 64 v 


Kanunco, R. N., Lampert, W. E., anp Mauer, S. M. Semantic Satiation and Paired- 
Associate Learning.............. y 

KARLIN, L., AND Mortimer, R, G. Effects of Visual and Verbal Cues on Learning a Motor 
Skill. 35.005. 2555. Beeler Fahad ss ý 

Kass, N. Resistance to Extinction as a 

Kaswan, J. W. See NAKAMURa, C. Y, 

Katz, L. Monetary Incentive and Range of Payoffs as Determiners of Risk Taking......... 541 

Kauster, D. H. See Cassem, N. 

KENDLER, H. H. See KENDLER, T. S. 

KENDLER, T, S., AND KENDLER, H. H. Inferential Behavior in Children as a Function of Age 
eE T s Rh a le Sw a aa A A e N T Se T 


Kıncarp, W. D., JR., BousrieLD, W. A., AND WHITMARSH, G. A. The P: 
en retAnsoviative Respoubes. Cah. elt. na a cots Ee S T eae 572 

LAMBERT, W. E. See JaKoBOVITS, L. A. 

LAMBERT, W. E. See KANUNGO, R, N. 

Lane, H. L. See Curran, C. R. 

Lindley, R. H. See Moyer, K. E, 

LoGan, F. A., AND WAGNER, A. R. Supplementary Report: Direction of Change in CS in 
By eld Condidoning- sn Eaa Nee eee A TA ET 2g ae ark 325 


as Factors in the Measurement of Acquired Fear.............0.00 0000 0c cc cceceeceeee 110 
MCCRACKEN, J., OSTERHOUT, C., anp Voss, J.F. Effects of Instructions in Probability Learning 267 
MANDLER, G., anD Cowan, P. A. Learning of Simple Structures......................... 177 


MAUER, S. M. See KANUNGO, R. N. 3 
MECHANIC, A. Effects of Orienting Task, Practice, and Incentive on Simultaneous Incidental 


and Intentional Learning. Y VATE A E D E lA Le adela ales 393 
MEDNICK, S. A., AND HALPERN, S. Ease of Concept Attainment as a Function of Associative 
BREA a a aban lene nes AT ee OTS. Ss ey AA, . 628 


MELTON, A. W. Editorial 553 


MILLER, N. E. See EGGER, M. D. 

Moore, J. W. See GORMEZANO, I. 

Morin, R. E., AND Forrin, B. Mixing of Two Types of S-R Associations in a Choice Reaction 
Tinie: Lash E cee eee ee a a 28) <'x'h placa ig-ate Sd GiGi Se web lin desG ae oe ltl eee 137 


Mortimer, R, G. See KARLIN, L. 
Moyer, K. E., AND LINDLEY, R. H. Supplementary Report: Effects of Instructions on Extinc- 


tion and Recovery of a Conditioned Avoidance Response 
MURDOCK, B. B., JR. The Serial Position Effect of Free Recall 


Myers, J. L. See Myers, N. A. 
Myers, N. A., AND Myers, J. L. Effects of Secondary Reinforcement Schedules in Extinction 


on Children’s Responding 


586 


Information on Spatial Stimulus Generalization 67 
NAYLOR, J. C. See Brices, G. E. 
NEISSER, U., AND WEENE, P. Hierarchies in Concept Attainment 640 
Nies, R. C, Effects of Probable Outcome Information on Two-Choice Learning.. isc... sae 430 
Ostrruovut, C. See McCracken, J. 
PALERMO, D. S. Mediated Association in a Paired-Associate Transfer Task......_. aa 234 


PARKER, J. F., JR. See FLEISHMAN, E. A. 
PauL, G, L., ERIKSEN, C. W., AND HUMPHREYS, L, G. Use of Temperature Stress with Cool 


Air Reinforcement for Human Operant Conditioning. ...........0...0. 000 cc eee c cece 329 
PENNYPACKER, H. S. See Krug, H. D. 
PETERS, S, C. See GLANZER, M. 


550 


623 


POSTMAN, L, The Effects of Language Habits on the Acquisition and Retention of Verbal 
ADS o o s, ae as eee E PEE SEERE d o Wiles R AS ok 7 


vi CONTENTS OF VOLUME 64 


Postman, L. Retention of First-List Associations as a Function of the Conditions of Transfer 380 
Proxasy, W. F., Fawcett, J. T., anD Hatt, J. F. Recruitment, Latency, Magnitude, and Am- 


plitude of the GSR as a Function of Interstimulus Interval............-65- 26+ ees 0eeees 513 
Raas, D., AND FEHRER, E. Supplementary Report: The Effect of Stimulus Duration and 
Luminance on Visual Reaction Time..........-- +--+. +25 2ee svete erect ener rtrt 326 


Ricuarpson, D. H. See Rosen, H. 

Rosen, H., Ricuarpson, D. H., anD Sattz, E. Supplementary Report: Meaningfulness as a 
Differentiation Variable in the von Restorff Effect..........-...2s-+-sssereeee ae ye SEE 

Sarren, M.A. Associations, Sets, and the Solution of Word Problems............-.+05005 40 

Sartz, E. See Rosen, H. 

SALZINGER, K., PORTNOY, S., AND FELDMAN, R. S. The Effect of Order of Approximation to the 
Statistical Structure of English on the Emission of Verbal Responses.....------+++++--> 52 

Sattzman, D. See PETERSON, L. R. 

Scurr, D. See D'Amato, M. R. 

SCHOEFFLER, M.S. Prediction of Some Stochastic Events: A Regret Equalization Model.... 615 

Scuuiz, R. W. See UNDERWOOD, B. J. 

Scuunz, R. W., anp Tucker, I, F. Supplementary Report: Stimulus Familiarization in 
Paired-Associate Learning ee 

Suarrer, J. P. Discrimination and Mediated Generalization in Probability Learning........ 

SINGER, D. SEE JONES, F. N. 

Sırowicz, R. R., Ware, J. R., AND Baker, R. A. The Effects of Reward and Knowledge of 


Results on the Performance of a Simple Vigilance Task. ........-..-.0+ 050s errer 58 
Smita, M. G. See Harris, S. J. 
Situ, S. L, Color Coding and Visual Search. ...... 6... cess eee eee eee ee eee teen ees ee 434 


Spear, N. E. See Hut, W. F. 

Stannis, C. D., AND CHAMPION, R. A. Spatial S-R Contiguity in Human Discrimination 
PORTING gales E E a T T 545 

STEGER, J. A. See HELSON, H. 

Stevens, S. S., AND Harris, J. R. The Scaling of Subjective Roughness and Smoothness... 489 

Stover, D. O. See D1 Vxsta, F. J. 

SUMMERS, S. A. The Learning of Responses to Multiple Weighted Cues... ..... ossos 

Turos, J. The Partial Reinforcement Effect Sustained Through Blocks of Continuous Rein- 
Po Med Net Tigi a a Pe Lk a) a Pe T a or) A OTEEE 1 

Tuomas, D, R. The Effects of Drive and Discrimination Training on Stimulus Generalization 24 


Tuomas, D. R., AND Jones, C. G. Stimulus Generalization as a Function of the Frame of 
OCICS. Lkg gh ed BTR OG e S A eee ro) eee 17 


THOMPSON, C. P, See Ernst, R. L. 
Tucker, I. F. See Scuurz, R. W. 
Turner, E. D., anD Bevan, W. Simultaneous Induction of Multiple Anchor Effects in the 
TOE OPE OE Ea A EA EN E a ee e 589 
TWELKER, P. A. See Jones, F. N. 
Unperwoon, B. J. See KEPPEL, G. 
UnperwooD, B. J., Ham, M., AND EKSTRAND, B. Cue Selection in Paired-Associate Learning 405 
Unperwoon, B. J., KEPPEL, G., AND Scnutz, R, W. Studies of Distributed Practice: XXII. 
Some Conditions Which Enhance Retention. ... ..........au occ eee ees 355 
Aee PSs MECRA a a leith | aae OOO 
WAGNER, A. R, See LOGAN, F. A. 
» Ware, J. R. See Srrowicz, R. R. 
Wess, W. B. See Wirr, J. L. 
WEENE, P. See NEISSER, U. 
Enric S. See HARRIS, S. J. 
‘HITMAN, J. R., AND GARNER, W. R. Free Recal! 
of Form of Internal Structure............... ; a gk T Anr ath aaa 558 
Waimared,G. A: SeeKincaw,W.DJRo =. S P e e i 


Wivxinson, R. T. Muscle Tension during Mental Work under Sleep Deprivati 
tion.......5.. 565 
Wits, A. C., anv Briccs,G.E. On-Target ~ ormai aad SAM i 
Wen Taa on n-Target versus Off-Target Information and the Acquisi- Pe) 
woun, J ae A Teat et fe Aranona Hypothesis for Verbal Learning 158 
r J. La BB, W. B. Supplementary Report: P; i i 
the Mastic of G y Repo roactive Inhibition as a Function of i 
te ail Supplementary Report: The Weinstock Partial Reinforcement Effect and Habit 
ite SR ee oe eC hi ee er ee ee ie a ee ares 647 
Woutwitt, J. F, The Perspective Illusion: Perceived Si ce in Fields Vary ing i 
WwW Suggested Depth, in Children and Adults. z ee me ER we or 300 
YNNE, J. D., AND Brocpen, W. J. Supplementar: ect upon > H 
N A i y Report: Effect - 
tioning of Backward, Forward, and Trace Preconditioning Traini co. bien) ea 422 


Zajonc, R. B. Response Suppression in Perceptual Defense, .... E ie i See . 206 


Journal of 


Experimental Psychology 


VoL. 64, No. 1 


Jury 1962 


THE PARTIAL REINFORCEMENT EFFECT SUSTAINED 
THROUGH BLOCKS OF CONTINUOUS 
REINFORCEMENT! 

JOHN THEIOS? 

Stanford University 


The discrimination interpretation 
of partial reinforcement (Bitterman, 
Fedderson, & Tyler, 1953; Mowrer & 
Jones, 1945) holds that resistance to 
extinction is a function of the degree 
of similarity between the training 
reinforcement schedule and the con- 
tinuous nonreinforcement of extinc- 
tion. According to this point of view, 
it is relatively easy for continuously 
reinforced Ss to discriminate when 
extinction starts because of the sharp 
contrast in the reinforcing events at 
the onset of extinction. Partially 
reinforced Ss have difficulty in dis- 
criminating the transition from train- 
ing to extinction because they have 


1 This study is based on a dissertation sub- 
mitted to the faculty of Stanford University 
in partial fulfillment of the requirements for 
the PhD degree. The writer wishes to express 
his gratitude to the chairman of his disserta- 
tion com:nittee, Douglas H. Lawrence, and 
to Gordon H. Bower for their cogent advice 
and generous aid during all stages of the 
development of this study. Thanks are also 
due to Ernest R. Hilgard and Patrick Suppes 
for their valuable comments during the later 
stages. The research was conducted while 
the writer was a National Science Foundation 
Cooperative Graduate Fellow. 

2 Now at the University of Texas. 


experienced runs of nonreinforced 
trials, and the first few extinction 
trials provide a stimulus situation 
which is similar to that experienced 
during the nonreinforced runs in 
training. 

In stimulus generalization terms, 
continuously reinforced Ss experience 
a large stimulus change during extinc- 
tion which results in a large decre- 
ment in response strength. Partially 
reinforced Ss experience only a small 
stimulus change, and their response 
strength is decreased only slightly. 
The difference in resistance to extinc- 
tion is attributable to the differential 
stimulus generalization decrements. 

A possible test of the discrimination 
hypothesis would consist of varying 
the number of continuously rein- 
forced trials interpolated between 
an initial partial reinforcement series 
and an extinction series. The con- 
tinuously reinforced trials would serve 
to isolate the partial reinforcement 
training from extinction by permitting 
the partially reinforced Ss to have a 
sharp contrast in the reinforcing 
events at the onset of extinction 
similar to that experienced by con- 


2 JOHN THEIOS 


tinuously reinforced Ss. This stimu- 
lus change experienced by Ss that 
have received some continuous rein- 
forcement following partial reinforce- 
ment should be larger than the change 
experienced by Ss that received only 
partial reinforcement, and according 
to the discrimination hypothesis, the 
former Ss should extinguish faster 
than the latter Ss. In general, the 
longer the block of continuous rein- 
forcement given after partial rein- 
forcement, the less similarity there 
should be between training and ex- 
tinction. Therefore, resistance to 
extinction is expected to decrease 
with the number of continuously 
reinforced trials interpolated between 
partial reinforcement and extinction. 
As the number of interpolated trials 
becomes large, resistance to extinction 
should approach that of continuously 
reinforced Ss. The present experi- 
ment was designed to test these pre- 
dictions from the discrimination theory 
of partial reinforcement. 


METHOD 


Apparatus—The apparatus was a black 
wooden runway 5 in. wide, 5 in. high, and 
99 in. long. The first 12 in. comprised the 
start box and the last 15 in. the goal box. 
The start and goal boxes were separated from 
the alley by metal guillotine doors. A 
1.8 X 2.5 in. food cup was concealed behind 
a 1.6-in. metal shield at the end of the goal 
box. The presence of the shield required S 
to put his head into the food cup to see its 
contents, 

Response time was measured in .01 sec. 
from the opening of the start door until S 
interrupted a photobeam 5 in. inside the goal 

x. On rewarded trials a food pellet was 
automatically delivered by a solenoid feeder 
when S interrupted a second photobeam 
Positioned across the food cup. The reward 
consisted of one “Frostyos” pellet, a sugar- 
coated breakfast cereal produced by General 
Mills, Incorporated. On nonrewarded trials 
the solenoid operated, but no food was de- 
livered. The Ss required about 15 sec. to 
consume the pellet, and were removed to the 


home cage 15 sec. after the interruption of 
the first photobeam. 

Subjects —The Ss were 60 naive Slonaker 
albino rats, 28 males and 32 females, selected 
from’the colony maintained by the Stanford 
University Psychology Department. They 
were approximately 90 days old at the be- 
ginning of the experiment and had been 
tamed for 3 days prior to the experiment. 
The Ss were housed in individual cages with 
ad lib. water and were maintained on 22 hr. 
food deprivation throughout the experiment. 
The Ss were permitted to eat lab chow for 
2 hr, after each daily experimental session. 

Procedure—After taming, S was placed 
into the goal box for 5 trials. Reward pellets 
were scattered throughout the goal box on the 
first 2 trials, but only the food cup contained 
pellets on the last 3 trials. When S ate one 
pellet he was removed to his home cage. 
Preliminary runway training followed on the 
next 6 days at 5 trials a day. The 30 pre- 
liminary runway trials were continuously 
rewarded for all Ss. The Ss were run in ` 
rotation, and the intertrial interval decreased 
from approximately 60 to 30 min. during 
this preliminary training. 

After preliminary training, Ss were divided 
into five groups, equated on performance ~ 
during the preliminary 30 trials, During 
the remaining part of the experiment Ss 
received 5 runway trials each day. Three 
groups of 14 Ss each received 40% partial 
reinforcedment (P) for 70 trials. The sched- 
ule was random except for the restrictions 
that runs of longer than 6 rewarded or non- 
rewarded trials were excluded and that the 
series ended with a rewarded trial. One of the 
P groups was reduced to 13 Ss when a rat 
died during the experiment. The remaining 
two groups, each consisting of 9 Ss, received 
continuous reinforcement (C) for 70 trials, 
All Ss were left in the goal box for 15 sý 
on each trial during this phase, irrespeAive 
of whether the trial was rewarded o 
The intertrial intervals averaged 
10 min. y 

Groups P-O and C-O were e: 
following the last of the 70 traj 
Group P-25, however, received 2 
100% rewarded trials, and then 
guished. Groups P-70 and C.70 
for 14 additional days on 100% 
receiving 70 additional continuously rew 
trials before they were extinguished. 

During extinction § received 5 trials each 
day until he accumulated 5 trials, not neces- 
sarily consecutive, during which he did not 
enter the goal box within 60 sec. In order to 


PARTIAL REINFORCEMENT 3 


obtain extincfion curves based on all Ss, 
each S was run for at least 40 trials. If S did 
not enter the goal box within 60 sec., he’was 
removed to his home cage and given a score 
of 60 sec. If S entered the goal box, he was 
detained in the box for 15 sec. The empty 
feeder operated if S looked into the food cup. 
The intertrial interval averaged about 10 min. 


RESULTS 


Training.—Curves of response speed 
during training are presented in Fig. 1. 
An S's median response speed for each 
block of 5 trials was computed. Each 
point on the curves represents a group 
mean of these median scores for a 
given block of trials. The Ss were 
divided into five groups just before 
Block 7. An analysis of variance on 
the median speed scores of Block 7 


yielded an F< 1.0. Hence, the 
division of Ss did result in comparable 
groups. 


The curves show that between 
Blocks 7 and 20, the trials on which 
the P Ss received 40% reinforcement, 
the continuously reinforced Ss even- 
tually ran faster than the partially 
reinforced Ss. The means and SDs of 
Ss’ median speeds during Block 20 
are presented in Table 1. An analysis 
of variance performed on these scores 
indicated that there was no significant 
difference between the two continu- 
ously reinforced groups’ (F = .28, 
df = 1/54), nor among the three 


RESPONSE SPEED (ISEC) 


DAILY BLOCKS OF 5 TRIALS 


Frc. 1, Response speeds during training. 


TABLE 1 
MEANS AND SDs OF SPEED SCORES ON 
Brock 20 
Group Mean SD N 
P-0 64 07 13 
P-25 66 07 14 
P-70 62 08 14 
C0 70 09 9 
C-70 72 11 9 


partial groups (F = 1.14, df = 2/54). 
However, the partial groups ran 
significantly slower than the 100% 
groups (F=7.16, df=1/54, P<.01). 

After Block 20, when Groups P-25 
and P-70 were changed from 40% 
reinforcement to 100%, their response 
speeds increased essentially to that 
of the C-70 Ss. Since Group P-25 
was extinguished following Block 25, 
a comparison of the P groups and 
Group C-70 was made by an analysis 
of variance on the Block 25 median 
speed scores. There was no signifi- 
cant difference between the two P 
groups (F=1.13, df = 1/34) nor 
between the combined P groups and 
Group C-70 (F = 1.00, df = 1/34). 
Thus, while on 40% reinforcement, 
the P Ss ran significantly slower than 
the 100% Ss. When the P Ss were 
subsequently given 100% reinforce- 
ment, however, their response speeds 
significantly increased so that the 
initial difference between the 40% 
and 100% Ss disappeared in less than 
25 trials. 

Extinction—Curves of response 
time during extinction are given in 
Fig. 2. Each point on the curves 
represents a group mean of individual 
median time scores for a given block 
of 5 extinction trials. The striking 
feature of Fig. 2 is the difference be- 
tween the extinction curves of the P 
and C groups. The three P groups, all 
of which initially received 40% rein- 
forcement, have very similar extinc- 


4 JOHN THEIOS 


tion curves in spite of the fact that one 
group had 0, the second group 25, 
and the third 70 trials of 100% rein- 
forcement between partial reinforce- 
ment and extinction. The curves of 
the two C groups are also very similar 
to each other. The difference be- 
tween the P and C groups is evident 
as early as the second block of 5 
extinction trials and indicates much 
faster extinction for the C groups. 
Thus, additional trials of 100% rein- 
forcement following 40% reinforce- 
ment had little effect on rate of 
extinction. 

The large difference in extinction 
rate between the P and C groups 
evidenced by the curves is also re- 
flected in the mean number of trials 
to reach the extinction criterion 
(Table 2), All three P groups took 
at least 50 trials to reach the criterion, 
but the C groups took only about 32 
trials. It can be seen that the SDs 
for the C groups are much smaller 
than those for the P groups, making 
inadvisable the use of an overall 
analysis of variance on the responses 
to the criterion scores. For Group 
C-0 and Group C-70, a t of .78 was 
obtained, indicating that, for 100% 
reinforced Ss, a difference between 
100 and 170 training trials had no 


RESPONSE TIME (SEC.) 


o 
mg 
DAILY BLOCKS OF 5 EXTINCTION TRIALS 


KA oak a ET 


Fic. 2. Response times during extinction, 


TABLE 2 


Means AND SDs or RESPONSES TO THE 
EXTINCTION CRITERION 


Group Mean SD N 

P-0 61 14 13 

P-25 66 11 14 

P-70 50 10 14 

C-0 33 S 9 

C-70 31 6 9 
effect on resistance to extinction. 


Homogeneity of variance among the 
responses to the criterion for the three 
P groups permitted a 1 X 3 analysis 
of variance. The obtained F of 6.26 
(df = 2/38) is significant at the .01 
level, and indicates that individual 
t tests are appropriate. Using the 
overall within-groups variance as the 
variance estimate for the error term 
of the ¢ tests, Groups P-0 and P-25 are 
not significantly different from each 
other (t= .88). The ¢ between 
Groups P-0 and P-70 of 2.46, however, 
is significant (df=25, P <.05). Simi- 
larly, the t between Groups P-25 and 
P-70 of 3.40 is significant (df = 26, 
Ee <= 01); 

In order to demonstrate that all 
the P groups are significantly more 
resistant to extinction than the C 
groups, a median test was run between 
Group P-70, the least resistant of the 
P groups, and the combination of the 
two C groups. The obtained x? of 
24.89 is highly significant (P < .001, 
df = 1). There is almost no overlap 
in the two distributions. This large 
difference was obtained in spite of the 
fact that Group P-70 had 70 con- 
tinuously rewarded trials just prior 
to extinction. 


Discussion 


The present results indicate that the 
partial reinforcement effect (PRE) of 
increased resistance to extinction is rela- 
tively unaffected by continuous rein- 


PARTIAL REINFORCEMENT 5 


forcement following the partial reinforce- 
ment. After the present experiment was 
completed, a study by Quartermain and 
Vaughan (1961) was published which 
bears on the same issue. Using lever 
pressing in a Skinner box, they com- 
pared resistance to extinction of rats 
given acquisition training with either 
800 responses of which a random 10% 
were reinforced, or 400 responses at 10% 
reinforcement followed by 400 continu- 
ously reinforced responses. Their results 
were entirely consistent with those re- 
ported here. These findings demonstrate 
that discriminability of the transition 
from training to extinction cannot be 
taken as explaining much of the partial 
reinforcement effect. 

Although it could be assumed that 
discrimination has some small effect since 
in the present study Group P-70 ex- 
tinguished slightly faster than Group 
P-0, the fact that there was no difference 
in resistance to extinction between 


Groups P-0 and P-25 presents a 
problem for the discrimination hy- 
pothesis. It might be argued that 25 


trials were not sufficient for Ss to have 
noticed a change in the reinforcement 
schedule, and, hence, a difference be- 
tween Groups P-0 and P-25 should not 
have been expected. However, the 
schedule change clearly influenced run- 
ning performance. The response speeds 
of Group P-25 and P-70 Ss increased 
markedly during the 25 continuously 
reinforced trials, eventually equaling the 
speeds of the Group C-70 Ss. This 
increase in performance can be taken 
as evidence that the Ss ‘‘noticed’”’ the 
change. 

Another proposal might be that Group 
P-25’s resistance to extinction could 
have been increased by the fact that it 
had 25 more trials than Group P-0. 
However, Group C-70 had 70 more trials 
than did Group C-0, and these C groups 
did not differ in resistance to extinction. 
Thus, with the degree of training given 
in the present experiment, additional 
trials had no effect on resistance to 
extinction. 

The present experiment can be added 


to a growing amount of empirical data 
indicating that the discrimination hy- 
pothesis cannot adequately account for 
the PRE. Marx (1958) has shown that 
if certain significant features of the goal 
situation (i.e., the food cup) are omitted 
on half of the extinction trials, rate of 
extinction is decreased rather than 
increased as would be predicted from a 
discrimination analysis. The discrimina- 
tion theory has been most successful in 
accounting for the fact that rate of ex- 
tinction usually varies directly with the 
percentage of reward when number of 
trials or number of reinforcements are 
held constant. However, using a design 
which pitted percentage of reinforcement 
against number of nonreinforcements, 
Lawrence, Festinger, and Theios (sum- 
marized by Festinger, 1961) found that 
resistance to extinction was independent 
of the reinforcement percentage, but was 
determined primarily by the number 
of nonreinforced trials Ss experienced 
during training. 

The present data as well as these other 
studies suggest that the discrimination 
hypothesis is inadequate as an interpre- 
tation of partial reinforcement. The 
present data indicate that an adequate 
theory of partial reinforcement must 
have constructs representing relatively 
permanent effects of nonreinforcement 
which can be sustained through blocks 
of continuous reinforcement. At least 
four existing theories of partial rein- 
forcement employ constructs of this 
character. These theories include Amsel 
(1958), Estes (1959), Logan (1960), and 
Festinger (1961), The present results 
do not enable one to discriminate among 
these theories. 


SUMMARY 


Three groups of 14 rats each received 70 
trials of random 40% reinforcement in a run- 
way. Then different numbers of continuously 
reinforced trials were interpolated between 
the 40% reinforcement and extinction. Group 
P-0 received none, Group P-25 received 25, 
and Group P-70 received 70 interpolated 
100% reinforced trials. Two control groups 
(N = 9 each) received only continuous rein- 


6 JOHN THEIOS 


forcement during training: Group C-0 was 
extinguished with partial Group P-0; Group 
C-70 was given 70 additional reinforced trials 
and extinguished with Group P-70. 

There was no significant difference in 
resistance to extinction between the two 
continuously reinforced groups. Group P-70 
was very significantly (P < .001) more re- 
sistant to extinction than the twocontinuously 
reinforced groups. There was no significant 
difference in resistance to extinction between 
Groups P-0 and P-25, but both of these groups 
were slightly more resistant than Group P-70. 
These results question the adequacy of the 
discrimination hypothesis that the partial 
reinforcement extinction effect results from 
difficulty in discriminating the transition 
from training to extinction. 


REFERENCES 


AMSEL, A. The role of frustrative nonreward 
in noncontinuous reward situation. Psy- 
chol. Bull., 1958, 55, 102-119, 

BITTERMAN, M. E., Fepperson, W. E., 
& TYLER, D. W. Secondary reinforcement 


and the discrimination hypothesis. Amer. 
J. Psychol., 1953, 66, 456-464. 
Estes, W. K. The statistical approach to 
learning theory. In S. Koch (Ed), 
Psychology: A study of a science. Vol. 2. 
New York: McGraw-Hill, 1959. 
Festincer, L. The psychological effects 
of insufficient reward. Amer. Psychologist, 
1961, 16, 1-11. 
Logan, F. A. Incentive. 
Univer. Press, 1960. 
Marx, M. H. Resistance to extinction as @ 
function of continuous or intermittent 
presentation of a training cue. J. exp. 
Psychol., 1958, 56, 251-255. 
Mowrer, O. H., & Jones, H. M. Habit 
strength as a function of the pattern of 


New Haven: Yale 


reinforcement. J. exp. Psychol, 1945, 
43, 293-311. 
Quartermain, D., & Vaucman, G. M. 


Effect of interpolating continuous rein- 
forcement between partial training and 
extinction., Psychol. Rep., 1961, 8, 235-237. 


(Received June 6, 1961) 


Journal of Experimental Prychology 
1962, Vol. 64, No. 1, 7-1 


THE EFFECTS OF LANGUAGE HABITS ON THE 
ACQUISITION AND RETENTION OF 
VERBAL ASSOCIATIONS! 


LEO POSTMAN 
University of California 


The experiments reported in this 
paper investigate the effects of unit- 
sequence habits on the acquisition 
and retention of verbal materials. 
By unit-sequence habits we refer to 
associative connections between ver- 
bal items established through linguis- 
tic usage (Underwood & Postman, 
1960). The transfer effects of such 
language habits may be either positive 
or negative, depending on the rela- 
tionship between .S’s pre-experimental 
associations and those prescribed in 
the learning task. 


The amount of transfer from pre- 
experimental habits should increase with 
the frequency of usage of the verbal units 
in the language. The higher the fre- 
quency of usage, the larger on the 
average is the number of different con- 
texts in which an item is likely to appear 
and hence the larger the number of 
different associations which it acquires. 
Conventional indices of meaningfulness 
such as association value and m value 
do in fact correlate highly with frequency 
of usage (Postman, 1961; Underwood & 
Schulz, 1960). Both unit-sequence facili- 
tation and unit-sequence interference 
may then be expected to increase as a 
function of frequency of usage. When 
the sequence to be learned involves 
high-frequency items, at least some 
of the prescribed associations are likely 
to agree with pre-experimental habits 
or can be readily established through 
direct mediational links. However, the 
number of pre-experimental associations 
which can compete with the prescribed 
ones will also increase as a function 


1 This research was supported by a grant 
from the National Science Foundation. 


of frequency of usage. With low-frequency 
materials, the prescribed sequences will 
have little pre-experimental strength and 
few direct mediators will be available, but 
there will also be few pre-experimental 
associations to produce interference. 

The conditions of facilitation and 
interference should influence recall as 
well as acquisition. When there is unit- 
sequence facilitation, it is assumed that 
the strength of the pre-experimental 
habits has generalized to the prescribed 
associations within the list. The greater 
the generalized habit strength, the more 
resistant the items in the experimental 
list will be to interference. When there 
is unit-sequence interference, it is as- 
sumed that competing pre-experimental 
habits have been unlearned during 
acquisition and have recovered over time 
to become sources of interference at 
recall. 

An earlier study (Postman, 1961) 
investigated the acquisition and retention 
of serial lists of words of high and low 
frequency of usage (Lists HF and LF). 
List HF was learned faster than List LF. 
At the same time, however, the rate 
of misplaced responses was substantially 
greater for List HF. The positive rela- 
tionship between speed of learning and 
error rate points to covariation of unit- 
sequence facilitation and interference 
as a function of word frequency. There 
were no significant differences in the 
amount of retention for the two kinds 
of materials. However, misplaced re- 
sponses at recall recovered at a faster 
rate for List HF than for List LF. 
While there was no evidence for superior 
retention of materials of high meaning- 
fulness, the original expectation of an 
inverse relationship between word fre- 
quency and recall was not borne out. 
This expectation had been based on the 


8 LEO POSTMAN 


assumption that competition from extra- 
experimental associations would increase 
directly with word frequency. The dif- 
ference in recovery of errors is in accord 
with this assumption. However, the 
fact that the amounts recalled were 
comparable in spite of a substantial 
difference in the incidence of overt 
errors suggests that increases in unit- 
sequence interference may be offset in 
part by unit-sequence facilitation. 

The present experiments extend the 
analysis of the conditions of unit-se- 
quence facilitation and interference. A 
first step was to vary the word frequency 
of stimuli (Ss) and responses (Rs) in a 
study investigating the acquisition and 
retention of paired associates (Exp. I). 
There is considerable evidence that speed 
of learning varies directly with the 
meaningfulness of Ss and Rs, and that 
this relationship is considerably more 
pronounced and consistent for Rs than 
for Ss (Cieutat, Stockwell, & Noble, 
1958; Hunt, 1959; Kimble & Dufort, 
1955; L’Abate, 1959; Mandler & Camp- 
bell, 1957; Morikawa, 1959; Underwood 
& Schulz, 1960). In these experiments 
the units scaled for meaningfulness (e.g., 
trigrams or paralogs) were either non- 
sense items or, more usually, included 
both nonsense items and words. Letter- 
Sequence habits as well as unit-sequence 
habits contribute to the differences in 
speed of learning obtained with such ma- 
terials (Underwood & Schulz, 1960). 
The lists used in Exp. I consisted entirely 
of words so that variations in speed of 
learning and amount of retention could be 
attributed primarily to the influence of 
unit-sequence habits. It was expected 
that both unit-sequence facilitation and 
unit-sequence interference would increase 
with word frequency, with the relative 
amounts of interference growing more 
rapidly as a function of S frequency than 
of R frequency. To the extent that 
paired items are practiced in the order 
of their presentation, competing asso- 
Ciations elicited by the Ss should be 
more effective in delaying acquisition 
than those evoked by the Rs, These 
considerations suggest that speed of 
learning should rise to a maximum 


and then decline as the word frequency 
of Ss is increased. On the other hand, 
speed of learning should vary directly 
with the word frequency of Rs. 

No prior studies of the effects of S 
and R meaningfulness on the long-term 
retention of paired associates are avail- 
able. If the associative context of the 
Ss rather than the Rs is critical in deter- 
mining the amount of effective competi- 
tion at recall, differences in the amount 
of retention should be determined largely 
by the degrees of unit-sequence facilita- 
tion and interference falling on the S 
terms. Thus, the differences among S 
conditions in learning should predict 
recall better than those among the R 
conditions, 

According to the present analysis, a 
sample of high-frequency Ss will activate 
strong pre-experimental habits whereas a 
sample of low-frequency Ss will not. It 
follows that manipulation of the condi- 
tions of facilitation and interference 
within a list should be more effective 
with low-frequency Ss than with high- 
frequency Ss. When pre-experimental 
associative probabilities are low, the 
introduction of otherwise unlikely sources 
of facilitation and interference based 
on language habits should produce major 
effects on learning and retention. On 
the other hand, when the associative 
probabilities are high, the introduction 
of known associates or competitors 
should add relatively little to the 
amounts of facilitation and interference 
from uncontrolled sources. This pre- 
diction was tested in Exp. II in which 
the pre-experimental probability of the 
prescribed associations was varied for 
Ss of high and low word frequency. 

Experiment III was designed to pro- 
vide a further test of the assumption 
that the higher the word frequency of a 
series of items, the greater is the amount 
of overlap among their associative con- 
texts. The method of verbal discrimina- 
tion learning was used. To the extent 
that discriminability depends on the 
availability of differential responses to 
the members of a pair, speed of discrimi- 
nation learning should increase with the 
meaningfulness of the paired items. 


ACQUISITION AND RETENTION OF ASSOCIATIONS 


TABLE 1 
STIMULUS AND RESPONSE TERMS IN Exp, I 


Sp Ra Sm Rn Si Ri 
BUILDING ANSWER ARBOR BASIN BRAMBLE ABBESS 
COUNTRY COLOR BURLAP DOGMA CAUCUS BUFFOON 
DOCTOR DINNER CINDER FETISH DECOY DOTAGE 
GARDEN FIGURE DISCORD HERMIT FARTHING HAREM 
HUSBAND MOMENT GROCER MINSTREL GULLET OBOE 
LETTER PROBLEM LOTION OMEN LORRY PREFIX 
MORNING REASON MAGNATE RELIC OXIDE RAMROD 
PAPER SHOULDER OATMEAL SUFFRAGE PESTLE STANZA 
STORY TABLE TRAITOR TEMPEST SEQUEL TENURE 
WINDOW WOMAN WHISKER WAFER WICKET WAMPUM 


Such a relationship was found with pairs 
of nonsense syllables by Runquist and 
Freeman (1960). However, as the 
associates elicited by the items increase 
in number, they should begin to lose 
distinctiveness because of the increasing 
probability of overlap in associative 
context. Acquisition will then require 
the unlearning of differential responses 
which produce errors of generalization, 
but such responses may be expected 
to recover with the passage of time. 
Thus, speed of acquisition and amount 
of retention for the verbal discrimination 
task should first increase and then 
decrease as a function of word frequency. 


EXPERIMENT I 
Method 


Materials —The learning materials were 
two-syllable nouns of high, medium, and low 
frequency of usage. Each frequency range 
was represented by two lists of 10 words 
each, one of which served as the S list and the 
other as the R list (Sn, Sm, Si; Rn, Rm, Ri). 
These materials were used in the construction 
of nine paired-associate lists representing all 
the possible combinations of word frequencies 
of the Sand Ritems. The lists of words are 
shown in Table 1. 

The lists sample three frequency ranges in 
the “L” count of Thorndike and Lorge (1944). 
The numbers of occurrences in 4.5 million 
are 1000-3300, 10-33, and 1-3 for words of 
high, medium, and low frequency, respec- 
tively. The mean rated familiarity and m 
values, which were obtained in prior standard- 
ization procedures, are correlated with fre- 
quency of usage. The words were chosen 


so as to minimize pre-experimental associa- 
tions between items in the S lists and items 
in the R lists. Norms of free association 
obtained from 1000 students at the University 
of California were used for this purpose. 

There was no duplication of first letters 
in any of the S or R lists. For all nine combi- 
nations of word frequencies the S list and the 
R list had six first letters in common. Four 
different pairings of Ss and Rs were used 
equally often with each of the nine lists. 

Procedure.—The lists were presented on a 
Hull-type memory drum at a 2:2 rate (S 
alone for 2 sec. and S and R together for 2 
sec.), with an 8sec. intertrial interval. 
Learning was to a criterion of one perfect 
trial. Four different orders of presentation 
were used to minimize serial learning. Each 
of the four random orders was used as a start- 
ing order equally often. 

Retention was tested by relearning either 
30 sec. or 7 days after the end of original 
practice. Relearning was for five trials or to 
criterion, whichever took the longer. 

Subjects.—With nine types of lists and two 
retention intervals, there were 18 groups of 
16 Ss each. All Ss were undergraduate 
students who had English as their native 
language and were naive to rote-learning 
experiments, The Ss were assigned to condi- 
tions in blocks of 18, with 1 S per block for 
each combination of lists and retention inter- 
vals. The running order within each block 
was determined by a table of random num- 
bers, as was the assignment to different 
pairings and starting orders. No Ss were 
lost because of failure to learn. 


Results 


Original learning.—There were no 
significant differences between the 


10 LEO POSTMAN 


TABLE 2 
Mean NUMBERS OF TRIALS TO CRITERION: 
Exr. I 
Stimulus Terms 
Over- 
Sı All 
Mean 


34.) 15.7 


Pez] 


14.7 


30-sec. and 7-day groups learning a 
given type of list. The results of the 
two groups learning the same materi- 
als have, therefore, been combined. 
Table 2 shows the mean trials to 
criterion for each of the nine kinds of 
lists and also summarizes the average 
trends produced by the word fre- 
quency of the Ss and Rs. Speed of 
learning first increases and then de- 
creases as a function of the word 
frequency of Ss but varies directly 
with the word frequency of Rs. 
The over-all effects of the word fre- 
quency of both Ss and Rs are signifi- 
cant beyond the .01 level (F = 6.15 


TABLE 3 


MEAN PERCENTAGES OF MISPLACED RESPONSES AND CORRELATION BETWEEN 
PERCENTAGES AND TRIALS TO CRITERION: Exp. I 


for Ss and 7.24 for Rs, df = 2/279 
for both). The S X R interaction is 
not significant (F=1.54, df=4/279). 

Errors during learning—Table 3 | 
shows the mean percentages of mis- 
placed Rs during learning. Such 
responses account for the great major- 
ity of overt errors. The percentages 
are based on opportunities (total 
number of presentations minus num- 
ber of correct responses) and thus 
are independent of speed of learning. 

For both Ss and Rs the mean per- 
centages of errors are clearly larger 
when word frequency is high than 
when it is medium or low. There 
are only small differences between 
the latter two conditions. Following — 
arc-sine transformation and with 2/279 
df, the F ratio for Ss is 12.44 (P <.001) 
and 3.11 for Rs (02 < P < .05). ~ 
There is also a significant SXR — 
interaction (F = 2.62, df = 4/279, 
02 < P <.05). When the word 
frequency of either the Ss or the Rs 
is high, the percentage of errors re- 
mains uniformly large, regardless of 
the value of the other variable. 

Table 3 also shows for each type of 
list the product-moment correlation — 
between trials to criterion and per- 
centage of misplaced Rs during learn- 


R Terms 


Ra 23.0 

r 16.4 

a 14.8 
Over-all 
Mean 


Stimulus Terms 


17.9 
10.1 
12.1 


—— 


Ee 
.59* 
61* 


ACQUISITION AND RETENTION OF ASSOCIATIONS 11 


TABLE 4 


30 Sec. Recall 


MEAN Numuers or [rams RecaLLeD in Expr. | 


7-Day Recall 
á CETE SN A Sad Ss 
R Terms Stimulus Terms | Stimulus Terms | 
i ie | Over- | x 7 = Ś 
Se Se Sı | Xi S | Se | Sı os 
— ). *| ae ae Mean ie rouan Mean 
Mean| SD | Mean) SD | Mean) SD | Mean) SD Mean| SD | Mean| SD 
Re |94| 7 |92| .7|92 |78| 93 | 53/17] 54] 21] 52/19] 53 
Re | 9.2] .7 |88| 8/90/12] 90/49/20] 51)1913.8] 21] 46 
Ry 8.7 9 | 9.1 | 1.2 | 8.6 | 1.0 | 8.8 | 4.7 |2.3 | 5.2 | t.9 | 3.5 | 24| 45 
Over-all | | | 
Mean | 9.1 9.0 | 8.9 | 5.0 S2 4.2 


ing. All the correlations are positive, 
i.e., the higher the error rate, the more 
slowly is criterion reached, This 
relationship is especially pronounced 
and statistically significant for Cond. 
Sm and Ra. 

The inverse relationship between 
error rate and speed of learning found 
for individuals within groups clearly 
does not hold for materials. On the 
R side, both speed of learning and 
error rate are maximal for Cond. Rs; 
on the S side, error rate is maximal, 
and speed of learning is intermediate, 
for Cond. Sa. Within a given group, 
however, rate of acquisition varies 
with Ss’ susceptibility to interference, 
especially when learning is fast as 
in Cond. Sm and Ry. When correct 
Rs are acquired rapidly, a low per- 
centage of misplaced Rs is favorable 
to attainment of the criterion, i.e., 
failures to respond are eliminated 
more readily than overt errors. When 
correct Rs increase relatively slowly, 
the criterion score is less sensitive 
to the type of error. 

Recall—Table 4 shows the mean 
numbers of Rs recalled by the various 
groups. The drops on the 30-sec. 
test vary inversely with speed of 
learning although the correlation is 
not perfect. The amounts of reten- 


tion loss (mean 30-sec. recall — mean 
7-day recall) as a function of the word 
frequency of Ssand Rsareshown in Fig. 
1, The losses during the 7-day inter- 
val are smallest for Sm and greatest 
for S with Sa yielding an inter- 
mediate amount of forgetting. The 


STIMULI 


Retention loss (30 sec.—7 days) 


Low Medium High 


Log frequency 


Fic. 1. Amounts of retention loss (mean 


30-sec. recall — mean 7-day recall) as a func- 
tion of the word frequency of Ss and Rs 
(Exp. 1). ‘ 


LEO POSTMAN 


TABLE $ 


Noumness op Misetacep Responses at Recau: Expr. | 


7-Day Recall 


| 


30-Sec, Recall 
Stimulus Terms 
J oS es 
S Se 
Total) N 

Ry 5 

Re 5 

Ry 11 
Over-all | 21 


Stimulus Terms 


Over-All 


Over 
Sı Si 5, sS 
Total] N | Total) N | Total N 
Sl 8) 7] $1-S| 18 8 
5| 9} 7] 4] 4] 18 [16 6 
9) 2) 1] 6] S|] 19 [15] 91 6) 4] -2] 10 | 6) 23 | 14 
18 15 15 | 14 | 35 |20| 20 |13| 32 |23 | 


a| 17 | 8| 12 | 7| 13 | 9] 42 |2 


22 |18 


8 


Note—N = Number of Ss giving misplaced responses. 


average difference between Sa and 
Sa is, however, clearly smaller than 
that between S, and Sw. The trends 
for the three values of R are com- 
parable. There are no consistent 
variations in retention loss as a func- 
tion of the word frequency of Rs. 
The slight over-all advantage of 
Ra is due entirely to the variations 
under Cond. Sı and is not present 
at the other two values of S. Follow- 
ing a Freeman-Tukey square root 
transformation, the retention scores 
were subjected to an analysis of 
variance. The significance of the 
differences in the amount of forgetting 
is assessed by the interactions of time 
(T) with the values of S and R. Only 
the interaction, S X T, is significant 
(F=3.08, df=2/270, 02<P<.05). 
The F ratios for RXT and SXRXT 
are .88 and 1.08, respectively. Thus, 
the rank order of the S conditions 
is the same for speed of original learn- 
ing and amount of long-term reten- 
tion, On the other hand, the differ- 
ences in rate of acquisition among the 
R conditions are not reflected con- 
sistently in the amounts of retention. 

Misplaced responses at recall—The 
numbers of misplaced Rs at recall 
and the numbers of Ss contributing 
them are shown in Table 5. Such 


errors account for only a small frac- 
tion of the total retention losses. 
A large proportion of Ss—99/144 in 
the 30-sec. groups and 88/144 in the 
7-day groups—failed to give mis- 
placed Rs. The observed frequencies 
indicate a higher rate of recovery 
of intralist intrusions for S, and Sı 
than for Sm, and also a higher rate 
for Ry than for Ra and Ry. In view 
of the high proportion of zero scores, 
the differences in temporal trends 
were not evaluated statistically and 
must be interpreted with caution. 
It is interesting to note, however, 
that the apparent differences in rate 
of recovery are related to the amount 
of forgetting for the S conditions but 
not the R conditions. The low rate 
of intralist intrusions suggests that 
Ss had considerable success in rejecting 
incorrect Rs, 

Relearning—The mean numbers 
of trials to criterion in relearning are 
shown in Table 6. The relative 
amounts of retention loss correspond 
to those found at recall; the increase 
in trials to criterion is greater for Sn 
and S; than for Sm whereas the word 
frequency of Rs has little effect. 
The interaction, S X T, is significant 
(F=4,52, df=2/270, .02<P<.05). 


Se 


ACQUISITION AND RETENTION OF ASSOCIATIONS 13 


TABLE 6 


The F ratios for RXT and SXRXT 


ot ie ge Fre fee Pest 
: of fornia. other the Rs were 
are ,01 and .85, respectively. jates of low probability (Ai), having 


of 
EXPERIMENT II ap tad eae rar aang types of 
pairs: i 


Method ia Sects and Sy-A; in one list, SrA, and 
Materials.—There were two lists of 10 SrAtin the other. All the Rs were nouns of 

. high frequency of usage falling within the 

paired associates, one with S terms of high AA ra of the G I Count Of 


TABLE 7 
Stutus-Response Pairs Usep tn Exp. II 
Stimulus | Ay | Assoc. Prob.» Stimulus | Ai Assoc. Prob.. 
List Sa 
GARDEN FLOWER 592 BUILDING SIZE 2 
MOMENT TIME COUNTRY VALLEY 2 
ORDER COMMAND 115 LETTER PERSON 2 
TROUBLE PROBLEM 132 REASON NEED 2 
WINDOW GLASS 252 SHOULDER BURDEN 2 


a Number of Ss out of 1000 giving response on test of free association, 


14 


LEO POSTMAN 


TABLE 8 


MEAN NUMBERS OF TRIALS TO CRITERION oF 5/5 For Eacu CLASS OF PAIRS AND 
10/10 ror Torat Lists IN ORIGINAL LEARNING AND RELARNING: Exp. II 


30-Sec. Groups 
Stimulus List 


7-Day Groups 
Stimulus List 


Ents Sh Sı Sa Sı 

Mean SD Mean SD Mean SD Mean SD 
Original Learning 
4.6 2.7 1.9 9 5.4 3.7 1.6 a 
K 4.3 2.0 2.8 1.0 4.2 ES 2.6 1.5 
Total list 5.4 2.9 2.9 1.0 6.2 3.4 2.8 1.5 
Relearning 

1.4 8 1.0 0 3.3 1.9 1.7 6 
at 1.8 1.0 1.2 -6 2.6 1,3 2.2 6 
Total list 13 1,2 1.2 6 4.7 1.9 2.5 8 


lists the S and R terms had’six first letters 
in common. Within a list none of the Rs 
appeared in the association tables for any 
of the stimuli except the one with which 
it was paired. Table 7 presents the lists 
and for each pair shows the frequency of the 
R term in the association norms, 

Procedure.—The lists were presented on a 
Hull-type memory drum at a 2:2 rate with an 
8-sec. intertrial interval. Learning was to 
one perfect trial. There were four different 
orders of presentation each of which was 
used as a starting order equally often. Re- 
tention was tested by relearning either 30 sec. 
or 7 days after the end of original learning. 
Relearning was for five trials or to criterion, 
whichever took the longer. 

Subjects—With two lists and two reten- 
tion intervals, the design comprised four 
groups of 16 Ss each. All Ss were under- 
graduate students who had English as their 
native ‘language and were naive to rote- 
learning experiments. The Ss were assigned 
to conditions in blocks of 4, with 1 S per 
block for each combination of lists and reten- 
tion intervals, Assignment to conditions 
within a block was determined by a table 
of random numbers as was the assignment to 
different starting orders. No Ss were lost 
because of failure to learn. 


Results 


Original learning —The mean num- 
bers of trials to a criterion of 5/5 for 


each type of pair as well as to a 
criterion of 10/10 for the entire list 
are shown in Table 8. There were 
no significant differences between the 
30-sec. and 7-day groups learning a 
given kind of list. Pairs S|-A, were 
mastered extremely rapidly, with the 
large majority of Ss reaching the 
criterion of 5/5 in either one or two 
trials. Pairs S;-A; were learned more 
slowly than Pairs S-A. However, 
with half the pairs learned almost 
immediately, acquisition of the entire 
list was very rapid. By contrast, 
Pairs S-A were not learned faster 
than Pairs S;-A1; in fact, there was a 
trend in the opposite direction. The 
over-all rate of learning was signifi- 
cantly slower for List S, than for 
List S, (t = 4.97, df = 62, P < .01). 
Analysis of variance of the criterion 
scores on the four kinds of pairs shows 
the interaction, Pairs X Lists, to be 
significant (F = 7.08, df = 1/62, 
P<.01). Thus, the association norms 
predict speed of learning for List Si 
but not for List Sp. 

Errors during learning —Misplaced 
Rs were much more common during 


ACQUISITION AND RETENTION OF ASSOCIATIONS 15 


the acquisition of List Sa than of List 
Sı. Twenty-two of the Ss learning 
List Sa made such errors, and only 
6 of the Ss learning List S. When 
the numbers of misplaced Rs are 


. expressed as percentages based on 


opportunities, the mean percentage 
for List S, is 27.6 (SD = 20.0; 
Median = 26.8). For List Sı the mean 
of the extremely skewed distribution 
is 64° (SD = 16.1; Median < 1.0). 
The difference between the two dis- 
tributions is significant beyond the 
.01 level by Wilcoxon's test for signed 
ranks, with Ss in the same block 
treated as paired replicates. 

A total of 23 importations from 
outside the list was contributed by 
13 of the Ss learning List Sı; 3 of the 
Ss learning list S» contributed a total 
of five such errors. The difference 
between the proportions of Ss making 
such errors is significant beyond the 
.01 level (x? = 6.75; df=1). All 
of the importations appeared in the 
association tables for the appropriate 
stimuli. Of 23 importations into List 
Sı, 19 were substitutions of primary 
associates for prescribed A; responses ; 
no such cases occurred during the 
learning of List Sy. 

Recall.—The recall scores for the 
four kinds of pairs are shown in Table 
9. When we consider the two types 
of pairs within each list, we find clearly 
superior retention of the high-prob- 
ability pairs in List Sı but no differ- 
ence in List Sa. Comparison of the 
corresponding pairs in the two lists 
shows retention to be higher for Pairs 
SAn than for Pairs S;-An. By 
contrast retention is poorer for Pairs 
S-A; than for Pairs S,-A). 

In view of the virtual absence of 
any variation in the recall scores 
on the 30-sec. test, the following 
procedure was used to evaluate the 
statistical significance of the differ- 
ences in the amount of forgetting. 


Within each block, the difference 
between the scores obtained by Ss in 
the 30-sec. and 7-day groups was 
determined for each type of pair, and 
the distribution of differences was 
subjected to an analysis of variance. 
The over-all difference between Lists 
is not significant (F < 1), whereas 
the interaction of Lists with Pairs is 
significant beyond the .01 level 
(F = 8.59, df = 1/30). The rela- 
tionship between the associative rank 
of the Rs and the rate of forgetting 
varies reliably with the word frequency 
of the Ss. 

Errors at recall.—The numbers of 
misplaced Rs at recall were small and 
did not increase during the retention 
interval. The few intralist errors 
that were made occurred almost 
exclusively during the recall of List 
Sa, at the rate of approximately .5 
per S on both tests. By contrast, 
importations from outside the list 
showed a systematic temporal trend. 
On the 30-sec. test there were only 
2 Ss giving such Rs, both on Pairs 
Si-Ai. The frequencies of importa- 
tions on the 7-day test and the num- 
bers of Ss (N) contributing them were 
as follows: S,-A,—4 (N = 4); S,-A;— 
5 (N = 5); S-An—5 (N = 5); S,;-A;— 
19 (N = 13). When the 30-sec. and 
7-day Ss assigned to the same block 
are compared, the number of cases 
in which there is an increase in 


TABLE 9 
MEAN NUMBERS OF PAIRS RECALLED IN 
Exp II 
30-Sec. Recall 7-Day Recall 
Stimulus List Stimulus List 
Raise |. reg Sı Sh Sı 


Mean} SD} Mean| SD | Mean| SD | Mean| SD 
An | 4.8 |.4| 5.0 | © | 3.2 |1.2| 41 | .8 
Ai | 4.5 |.5| 4.8 |.4 | 3.0 | 1.3) 2.2 | 1.2 


16 LEO POSTMAN 


30 sec. 
10 - 
ey, 
-9 
2 
5 
SiG 
3 
[3 
27 
c 
È —Hi-F 
6 mm Med-F 
Owe Lo-F 
5 
lT bie Fg een a a eal 
IRs 364 96) 2 suas 
Trials 
Fic. 2. Mean numbers of correct re- 


sponses in relearning of verbal discrimination 
(Exp. IIT). 


importations during the retention 
interval is significantly greater for 
List Sı than for List S, (x? = 4.52, 
df = 1, .02 < P <.05). It is clear 
that this difference is primarily a 
function of the number of imported 
substitutions for low-probability as- 
sociates. In the recall of List Sn, 
none of these substitutions were 
primary associates of the appropriate 
Ss; in the case of List S, 16/19 
substitutions were primary associates. 

Relearning.—The mean numbers of 
trials to relearn to a criterion of 5/5 
on each type of pair and 10/10 for 
the entire list are shown in Table 8. 
The differences in the amount of 
forgetting as measured by speed of 
relearning correspond to those found 
at recall. The procedure used in 
analyzing the temporal trends in 
recall was also applied to the relearn- 
ing scores. The over-all difference 
between Lists was again not reliable 
(F = 1.53), whereas the interaction 
of Lists with Pairs is significant 


(F=5.11, df=1/30, .02<P<.05). 


EXPERIMENT III 
Method 


Materials —The learning materials were 
three lists of 10 pairs of words each, One 
list (List HF) was composed of words of high 
frequency of usage, one list (List MF) of 
words of medium frequency, and one list 
(List LF) of words of low frequency. The 
words used in the construction of the lists 
were the same as those in the S lists and R 
lists of Exp. I (see Table 1). Two different 
pairings of the words in each list were used 
equally often. For purposes of verbal dis- 
crimination learning, one of the items in each 
pair was designated as correct. Each item 
in a pair was correct for half the Ss. 

Procedure-—The members of a pair were 
presented one above the other in the window 
of a Hull-type memory drum. The rate of 
presentation was 1.5 sec. for the pair alone, 
and 1.5 sec. for the pair and the correct R 
together. The intertrial interval was 3 sec. 
Four different orders of presentation were 
used to minimize serial learning, In each 
order half of the correct words appeared in 
the upper position and half in the lower posi- 
tion. The position of the correct item in each 
pair varied from trial to trial, Each of the 
four orders was used as a starting order 
equally often. The Ss began anticipating the 
correct R on the first trial. Learning was to 
a criterion of one perfect trial. 

Retention was tested by relearning either 
30 sec. or 7 days after the end of practice. 
Relearning was for five trials or to criterion, 
whichever took the longer. 

Subjects.—With three types of lists and 
two retention intervals, there were six groups 
of 16 Ss each. All Ss were undergraduates 
who had English as their native language; 
they were not necessarily naive to rote-learn- 
ing experiments but had had no prior experi- 
ence with verbal discrimination learning. 
The Ss were assigned to conditions in blocks 
of 6, with 1 S per block for each combination 
of lists and retention intervals. The running 
order within each block and the assignment 
to different starting orders were determined 
by tables of random numbers. No Ss were 
lost because of failure to learn. 


Results 


Original learning.—There were no 
significant differences between the 
30-sec. and 7-day groups learning a 
given kind of list. The mean num- 
bers of trials to criterion for the com- 


ACQUISITION AND RETENTION OF ASSOCIATIONS 17 


bined groups were as follows: HF —6.4 
(SD = 4.0); MF—4.5 (SD = 1.5); 
LF—5.3 (SD = 2.5). The effects of 
word frequency on acquisition parallel 
those obtained on the S side in paired- 
associate learning in Exp. I, i.e., List 
MF is learned faster than Lists HF and 
LF. The variation in criterion scores 
is significant (F = 3.63, df = 2/93, 
02 < P < .05). 

Recall and relearning.—Figure 2 
shows the mean numbers of correct 
Rs on the five trials of relearning 30 
sec. and 7 days after the end of prac- 
tice. The 30-sec. test reflects the 
rank order of the conditions in original 
learning. Immediate relearning pro- 
ceeds less steadily for List HF than 
for the other two lists. The disad- 
vantage of List HF becomes pro- 
nounced on the 7-day test. A note- 
worthy fact is that the difference 
between List HF and the other two 
lists increases between Trial 1 and 
Trial 2 and that this separation is 
maintained through Trial 4. The 
conditions which slowed up the acqui- 
sition of List HF rapidly come into 
play again in relearning. The separa- 
tion between Lists MF and LF is 
consistently small. 

The differences on the recall trial 
are not significant. For the five trials 
of relearning, the over-all variation 
among lists is significant (F = 7.52, 
df = 2/90, P < .01). With perform- 
ance on List HF poorest on both tests, 
the Time X Lists interaction falls 
short of significance (F = 2.12, 
df = 2/90). The pattern of differ- 
ences and the results of the statistical 
tests are the same when trials to 
relearn to criterion are considered. 


DISCUSSION 


The results of the experiments are 
consistent with the assumption that unit- 
sequence interference and unit-sequence 
facilitation covary as a function of word 


frequency, with the relative amount of 
effective interference increasing more 
rapidly on the S than on the R side. 
Such systematic changes in the pattern 
of transfer effects are indicated by the 
findings on acquisition in Exp. I: (a) 
rate of overt errors is a positively ac- 
celerated function of the word frequency 
of both Ss and Rs; (b) speed of learning 
increases steadily with the word fre- 
quency of Rs; (c) speed of learning first 
increases and then decreases as a func- 
tion of the word frequency of Ss. 

Speed of learning is influenced by the 
word frequency of both Ss and Rs where- 
as the amount of retention varies signifi- 
cantly only with S conditions. In acqui- 
sition the availability of the R units, 
which is a function of frequency, is a 
major determinant of the speed of learn- 
ing (Underwood & Schulz, 1960). Once 
the Rs are available to the learner, the 
associates elicited by the Ss are of major 
importance in determining the speed at 
which the prescribed associative con- 
nections are established. Retention losses 
occur as incorrect associations which 
had been unlearned during acquisition 
recover in strength relative to the pre- 
scribed sequences. On the recall test, 
the competing Rs elicited by the S terms 
largely determine the amount of effec- 
tive interference. The rate at which 
such competing associations recover with 
the passage of time reflects the balance 
of positive and negative transfer effects 
in acquisition. There appears to be no 
differential decline in the availability 
of the Rs when well integrated units 
such as words are used. 

Experiment II shows that the pre- 
experimental probabilities of specific 
associates have a major influence on 
learning and retention when the word 
frequency of the Ss is low but have no 
differential effects when the word fre- 
quency of the Ss is high. In the acquisi- 
tion of List S}, strong associative con- 
nections either existed or were readily 
established by direct mediational links 
between any pair of words in the list, 
and the primary associates had no 
advantage over associates of low rank. 
In the case of List Sı, on the other hand, 


18 LEO POSTMAN 


the uncontrolled transfer effects of pre- 
experimental habits were weak and did 
not mask the difference between asso- 
ciates of high and low probability. 

The results of the retention tests in 
Exp. II clearly bring out the differences 
in unit-sequence interference and facili- 
tation falling on the pairs in the two 
lists. When facilitation by pre-experi- 
mental habits obtains for the pairs in 
both lists (S,-An and Sı-An), retention 
is inversely related to the word frequency 
of the Ss. Under these conditions the 
rate of forgetting directly reflects the 
assumed difference in unit-sequence in- 
terference. In the case of the low-prob- 
ability pairs (S,-A; and S)-A;) there is 
interference from language habits in 
both lists, as indicated by the intralist 
errors in List S, and the frequent impor- 
tation of primary associates into List S}. 
For these pairs, retention varies directly 
with word frequency, now reflecting the 
assumed difference in unit-sequence fa- 
cilitation. The question arises, of course, 
whether the differences in recall, and 
for that matter in acquisition, are simply 
the result of a set to give primary 
associates. While such a set was un- 
doubtedly present, the critical finding is 
that it was effective only with Ss of low 
word frequency. With Ss of high word 
frequency, no one associate is clearly 
dominant, and Rs from outside the list 
cannot compete effectively with those 
within the list. 

The fact that speed of verbal discrimi- 
nation learning first increases and then 
decreases as a function of word frequency 
(Exp. III) provides additional support 
for the assumed covariation of unit- 
sequence facilitation and interference. 

' With homogeneous pairs such as were 
used in this study, differential effects 
of R availability are minimized (cf. 
Runquist & Freeman, 1960), and the 
relationship parallels that obtained for 
Ss in paired-associate learning. The 
availability of strong differential Rs is 
essential for the achievement of a stable 
discrimination, but once the number of 
such Rs exceeds an optimal value, gen- 
eralization across items counteracts the 
beneficial effects of R-produced differ- 


entiation. Itis likely that generalization 
both within pairs and between pairs 
contributes to the total effect. While 
recall was too high to yield reliable dif- 
ferences, List HF is clearly inferior to the 
other two lists in relearning. 

In agreement with the results ob- 
tained earlier with serial lists (Postman, 
1961), the present experiments offer no 
support for the assumption that mean- 
ingfulness necessarily favors retention. 
By maximizing either the negative or 
positive transfer effects of pre-experi- 
mental habits, it is possible to obtain 
either an inverse or a direct relationship 
between meaningfulness and retention. 
For an average sample of verbal materi- 
als, the best prediction at present is 
that the differences in retention will 
be small when degree of original learning 
is held constant. 


SUMMARY 


Three experiments investigated the trans- 
fer effects of language habits on the acquisi- 
tion and retention of verbal associations. 
They tested the assumption that both positive 
and negative transfer effects (unit-sequence 
facilitation and unit-sequence interference) 
increase as a function of the frequency of usage 
of words. The balance of unit-sequence 
interference and facilitation determines the 
speed of acquisition and the rate at which 
interferences recover with the passage of time. 

In Exp. I lists of paired associates repre- 
senting all possible combinations of S and R 
terms of high, medium, and low word fre- 
quency were used. Speed of acquisition first 
increased and then decreased as a function 
of the word frequency of Ss but varied directly 
with the word frequency of Rs. Amount of 
retention did not vary significantly as a 
function of R conditions whereas the effect 
of S frequency paralleled that obtained in 
acquisition. While the availability of Rs and 
pre-experimental associative probabilities both 
influence speed of acquisition, amount of 
retention loss appears to be determined pri- 
marily by the recovery of competing associa- 
tions elicited by the S terms. 

In Exp. II associates of high and low pre- 
experimental probability, as determined by 
norms of free association, were learned to SS 
of high and low word frequency, Pre-experi- 
mental associative probability has significant 
effects on learning and retention when the 


ACQUISITION AND RETENTION OF ASSOCIATIONS 19 


word frequency of the Ss is low but not whea 
it is high. Whether or not manipulation of 
the conditions of facilitation and interference 
is successful depends on the magnitude of the 
uncontrolled transfer effects of pre-experi- 
mental habits. 

The method of verbal discrimination learn- 
ing was used in Exp. III with pairs of words 
of either high, medium, or low word frequency. 
Speed of acquisition first increased and then 
decreased as a function of the word frequency 
of the paired items. Thus, the relationship 
paralleled that obtained for the word fre- 
quency of Ss in paired-associate learning. 
The differences in recall were unreliable, but 
the high-frequency lists were relearned more 
slowly than those of medium or low frequency. 


REFERENCES 


CIEUTAT, V. J., STOCKWELL, F. E., & NOBLE, 
A; The interaction of ability and 
amount of practice with stimulus and re- 
sponse meaningfulness (m, m’) in paired- 
associate learning. J. exp. Psychol., 1958, 
56, 193-202. 

Hunt, R. G. Meaningfulness and articula- 
tion of stimulus and response in paired- 
associate learning and stimulus recall. 
J. exp. Psychol., 1959, 57, 262-267. 

KmBLE, G. A., & Durort, R. H. Meaning- 
fulness and isolation as factors in verbal 


farniag J. exp. Psychol., 1955, 50, 361- 


AT L. Manifest anxiety and the learn- 
ing of syllables with different associative 
values. Amer. J. Psychol., 1959, 72, 107- 
110. 

MANDLER, G., & CamPBELL, E. H. Effect 
of variation in associative frequency of 
stimulus and response members on paired- 
associate learning, J. exp. Psychol., 1957, 
54, 269-273. 

Morikawa, Y. Functions of stimulus and 
response in paired-associate verbal learning. 
Psychologia, 1959, 43, 437—446. 

Postman, L. Extra-experimental inter- 
ference and the retention of words. J. exp. 
Psychol., 1961, 61, 97-110, 

Runguist, W. N., & Freeman, M. Roles 
of association value and syllable familiar- 
ization in verbal discrimination learning. 
J. exp. Psychol., 1960, 59, 396-401, 

THORNDIKE, E. L., & LORGE, I. The teacher's 
wordbook of 30,000 words. New York: 
Teachers College, Columbia University, 
1944. 

Unperwoop, B. J., & Postman, L. Extra- 
experimental sources of interference in 
forgetting. Psychol. Rev., 1960, 67, 73-95. 

Unperwoon, B. J., & ScuuLz, R. W. Mean- 
ingfulness and verbal learning. Philadel- 
phia: Lippincott, 1960. 


(Received May 11, 1961) 


Journal of Experimental Psychology 
1962. Vol. Ch No. 1, 20-35 


CONDITIONED DIMINUTION OF THE UNCONDITIONED 
RESPONSE AS A FUNCTION OF THE NUMBER 
OF REINFORCEMENTS * 


H. D. KIMMEL 
University of Florida 


Kimble (1961) has recently shown 
that the amplitude of the uncondi- 
tioned eyeblink is attenuated by the 
presence of the CS during reinforced 
CS-UCS conditioning trials. Assum- 
ing that this diminution of the UCR 
is a manifestation of a conditioned 
inhibitory process under the control 
of the CS, Kimble and Ost (1961) 
varied the CS-UCS interval during 
conditioning to determine whether 
or not the degree of inhibition varied 
as a function of the interstimulus 
interval. Theyfound thatthestrength 
of the conditioned inhibitory process 
was related to the interstimulus inter- 
val in the same way as is the condi- 
tioned excitatory process, i.e., the 
greatest amount of recovery in UCR 
amplitude occasioned by the omission 
of the CS occurred when the interval 
during conditioning had been 0.5 sec. 

The purpose of the present study 
was to extend these eyelid condition- 
ing findings to another classically 
conditionable human response, the 
GSR, and to investigate the relation- 
ship between this conditioned inhibi- 
tory process and another parameter 
of the conditioning process, the num- 
ber of reinforcements. It was hy- 
pothesized that attenuation of the 
UCR during reinforcement and re- 
covery of the UCR upon omission of 


1 The junior author's collaboration in this 
study was supported by a predoctoral re- 
search fellowship from the National Institute 
of Mental Health (Mf 13,327), 

This study was done while the senior 
author was Visiting Assistant Professor at 
Duke University on leave of absence from 
the University of Florida. 


AND 


20 


H. S. PENNYPACKER 
Duke University 


the CS would both vary positively 
with variations in the number of 
reinforcements. 


METHOD 


Subjects —Thirty students in introductory 
psychology at Duke University were Ss. 
The Ss volunteered to meet a class require- 
ment. They were assigned randomly to 
three groups of 10 each, to receive different 
numbers of reinforcements, but were other- 
wise treated identically. 

Instructions—The S read the instructions 
from a typewritten card. They indicated 
that this was an experiment on the effect of 
environmental stimulation upon the GSR 
and that S’s task was to remain relaxed and 
motionless but to pay attention to the stimuli 
during the experiment. 

Apparatus—The CS was a 1000-cps tone 
produced by General Radio Company equip- 
ment and delivered to S via Trimm Company 
earphones, The intensity of the tone, rated 
at the earphones, was 40 db. (re.: .0002 
dynes/cm?), 

The UCS was an electric shock produced 
by a Psychological Instruments Company 
stimulator and delivered to the index and 
middle fingertips of S’s right hand. The 
shock intensity used during conditioning 
was 2 ma. at 50 v., for a hypothetical S with 
a resistance of 25,000 ohms, 

The durations of the stimuli were con- 
trolled electronically, The CS hada duration 
of 1.0 sec. and the UCS a duration of 0.1 sec. 
On paired conditioning trials a delayed 
paradigm was used with a 0.9-sec. interval 
between onsets of the two stimuli. 

The GSR was measured as the maximum 
decrease in resistance which occurred within 
3 sec. after the offset of a stimulus. The 
response was picked up from the palm and 
back of S's left hand by 4-in. zinc electrodes, 
covered with a few drops of zinc sulphate 
solution, in lucite cups filled with saline elec- 
trode jelly. It was amplified by a Hunter 
GSR apparatus and recorded on an Esterline- 
Angus ink-writing milliammeter with a paper 


CONDITIONED DIMINUTION OF THE UNCONDITIONED RESPONSE 21 


speed of 3 in/min. All responses were trans- 
formed to the square root of conductance 
change ( VAC) for statistical purposes. 

Procedure.—Data were collected in a quiet 
dark room. An electric fan masked extrane- 
ous sounds. The Æ and the apparatus were 
located in an adjoining room. 

The S's left hand was cleaned with acetone 
before the GSR electrodes were attached. 
He was then seated in the experimental room 
and given the instructions to read before the 
pickup leads were attached to the GSR elec- 
trodes, the shock electrodes were taped to his 
fingertips, and the earphones were placed 
on his head. 

The S was first given 2 presentations of 
the CS alone followed by 3 presentations of 
the shock alone, in intensity increments up to 
the intensity used during conditioning. This 
was followed by 4 additional presentations 
of the CS alone. During the reinforcement 
period, which occurred next, the three groups 
of Ss received either 4, 8, or 16 paired presen- 
tations of the CS and UCS. After reinforce- 
ment 2 presentations of the shock alone were 
given and, finally, 4 presentations of the CS 
alone (i.e, extinction). All presentations 
(i.e., trials) were separated by intervals of 
30 to 60 sec., varied unsystematically by Æ. 


RESULTS 


Of primary significance in evaluat- 
ing the hypotheses was the way in 
which the amplitude of the UCR 
varied over trials during reinforcement 
and the degree to which an increment 
in amplitude of the UCR was pro- 
duced by the omission of the CS after 
reinforcement. Figure 1 shows the 
average amplitude of GSR for each 
group of Ss, as a function of reinforced 
trials. Also indicated in Fig. 1 are 
the average amplitudes of GSR on the 
two test trials of the UCS only (shown 
in the figure). Each data point repre- 
sents the mean of 10 Ss. 

Figure 1 indicates that the ampli- 
tude of the UCR gradually reduced 
during reinforcement, although the 
groups differed in the extent to which 
this reduction occurred. Statistical 
analysis of the degree of diminution 
of GSR amplitude was achieved 


ria 4 

sof \ Tritt on VCS omy 
Saat N Tae 
oe A $ 
š tot sY 

amr A \ 5 N 
- end ah N 
ja 210} we NA or’ 
$ rof- V 

sdl 


ES E A AET er eee a 7 
roves er evwnenonen 
RENFORCED TRIALS 


Fic. 1. Average amplitude of the un- 
conditioned GSR during reinforcement and 
on test trials without the CS. 


comparing the average amplitude of 
GSR on the first two reinforced trials 
(combined) with the average ampli- 
tude of GSR on the last two reinforced 
trials (combined). Thus, for the 
group which received four reinforce- 
ments the two values used were the 
average of the first two reinforced 
trials and the average af the third and 
fourth reinforced trials. The values 
used for the group which received 
eight reinforcements were obtained 
from the first two reinforced trials and 
the seventh and. eighth reinforced 
trials, etc. Analysis of variance of 
these average measures of UCR ampli- 
tude during reinforcement showed 
that the overall reduction in UCR 
amplitude was statistically significant 
(P < .05, error variance = .2616) but 
that it did not interact significantly 
with the number of reinforcements as 
had been expected. 

Figure 1 also shows the degree to 
which the omission of the CS pro- 
duced an increment in UCR ampli- 
tude. The exact quantitative nature 
of the relationship between the num- 
ber of reinforcements and the degree 
of increment produced by the omis- 
sion of the CS is shown in Fig. 2, 
which presents the average difference 
(i.e., increment) in GSR amplitude 
between the last two reinforced trials 
(combined) and the two test trials 
of UCS only (combined). A log- 


22 H. D. KIMMEL AND H. S. PENNYPACKER 


arithmic scale has been used for the 
abscissa in Fig. 2 to emphasize the 
linearity of the relationship when 
expressed in this way. 

It is clear in Fig. 2 that the degree 
to which the omission of the CS pro- 
duced an increment in UCR amplitude 
increased as a linear function of the 
logarithm of the number of reinforce- 
ments, i.e., as a negatively accelerated 
growth curve. The analysis of vari- 
ance of the averaged UCRs on the 
last two reinforced trials and the two 
UCS-only trials indicated that the 
overall increment in UCR amplitude 
produced by the omission of the CS 
was statistically significant (P <.025) 
and that the interaction between this 
increment and the number of rein- 
forcements approached significance 
(P < .07). The linear component of 
the trend overslog N was statistically 
significant (P < .025). Error vari- 
ance for these tests = .1331. 

It was of interest also to examine 
the relationship between the number 
of reinforcements and GSR amplitude 
during extinction. ` This evaluation 
was contaminated in part by the fact 
that all of the Ss had received two 


) 


a 
9° 99, 2 
gro OS 


UCR INCREMENT ( 
e 
S 


4 8 16 
NUMBER OF REINFORCEMENTS 
Fic. 2. The relationship between the 
number of reinforcements and the amount 


of increment produced by the omission of 
the CS, 


presentations of the UCS alone be- 
tween the end of reinforcement and 
the beginning of extinction, which 
probably tended to obscure any dif- 
ferences in degree of conditioning 
which might have occurred. For this 
reason only the last three extinction 
trials were used to evaluate the degree 
of conditioning which occurred. When 
the average amplitude of GSR on 
these three trials was compared to 
the average amplitude of the GSR on 
the last three presentations of the CS 
alone prior to conditioning it was 
found that the average amplitude 
of GSR increased greatly from before 
to after the reinforcement period and 
that this increase was largest in the 
group which received eight reinforce- 
ments. The overall difference from 
before to after reinforcement was 
highly significant, supporting the con- 
clusion that conditioning occurred, 
but the interaction between the 
number of reinforcements and degree 
of conditioning failed to reach 
significance. 


DISCUSSION 


The results of this experiment sup- 
port the notion that the CS becomes 
capable during conditioning of attenuat- 
ing the amplitude of the UCR. Not 
only did the amplitude of the UCR 
diminish during reinforcement, but, when 
the CS was omitted and the UCS pre- 
sented alone, the UCR recovered to 
approximately its original amplitude. 

It is of little significance to point out 
that S was surprised to receive the UCS 
without the CS and that this surprise, 
acting as a startle stimulus, may have 
been responsible for the increment in 
the GSR on shock-alone trials, The 
fact that the presence of the CS serves as 
a warning to S that the shock is coming 
is, of course, one of the things that are 
“learned” during the conditioning proc- 
ess. Even when viewed in this way, 
the reduction-of-surprise function of 


CONDITIONED DIMINUTION OF THE UNCONDITIONED RESPONSE 23 


the CS is essentially an inhibitory func- 
tion. That this inhibitory aspect of the 
classical conditioning process is highly 
adaptive to the organism being condi- 
tioned is apparent, since a good deal of 
unnecessary emotional overresponding 
is brought under control or avoided by 
the association of the warning signal 
(i.e. the CS) with the noxious event 
which is imminent. 

The relationship between the present 
findings and those of an earlier study 
of disinhibition in GSR conditioning 
(Kimmel & Fowler, 1961) is of some 
significance. In the earlier study it was 
shown that the addition of an extra, or 
disinhibiting, stimulus to the CS after 
reinforcement had the effect of producing 
an increment in the amplitude of the 
conditioned GSR and that this effect 
was positively related to the number of 
reinforcements given prior to the addi- 
tion of the extra stimulus. The inter- 
pretation of the results of the earlier 
study was based upon the assumption 
that the CS acquired inhibitory capa- 
bilities during reinforcement and that 
the extra stimulus inhibited this in- 
hibition, producing an increment in 
response amplitude, i.e., disinhibition. 
The present findings support this inter- 
pretation, since the omission of the CS 
had the same behavioral consequence 
as the inhibition of the inhibition con- 
trolled by the CS, i.e., in both cases 
the inhibition was removed or reduced. 
The observed positive relationship be- 
tween these effects and the number of 
reinforcements supports the contention 
that they result from an associative 
process as, of course, do the findings of 
Kimble and Ost (1961) that the con- 
ventional interstimulus interval function 


is involved. 


SUMMARY 


This study tested the hypotheses that 
the amplitude of the unconditioned GSR is 
gradually attenuated by the CS during con- 
ditioning and that this reduction in UCR 
amplitude vanishes when the CS is omitted 
and the UCS presented alone. Three groups 
of 10 Ss received either 4, 8, or 16 reinforced 
presentations of a tone CS and electric shock 
UCS in a classical delayed paradigm. Fol- 
lowing reinforcement the CS was omitted and 
2 UCS alone trials were administered. After 
the tests on the shock alone, 4 extinction 
trials of the CS alone were given. 

It was found that the amplitude of the 
unconditioned GSR diminished during rein- 
forcement and that the UCR recovered to 
approximately its original amplitude when 
the CS was omitted. The amount of incre- 
ment produced by omitting the CS was a 
linear function of the logarithm of the number 
of reinforcements. ‘These results were inter- 
preted as supporting the notion that a condi- 
tioned inhibitory process develops during 
reinforcement, under the control of the CS, 
which attenuates the amplitude of the UCR 
in the presence of the CS. The relationship 
between these findings and similar findings 
in a study of the relationship between amount 
of disinhibition and number of reinforcements 
was thought to add further support to the 
interpretation offered. 


REFERENCES 


Krister, G. A. Hilgard and Marquis’ condi- 
tioning and learning. New York: Appleton- 
Century-Crofts, 1961. 

KIMBLE, G. A., & Ost, J. W. P. A condi- 
tioned inhibitory process in eyelid condi- 
tioning. J. exp. Psychol., 1961, 61, 150-156. 

KımmeL, H. D., & Fowrer, R. L. The 
relationship between disinhibition resulting 
from compound stimulus presentation and 
the number of reinforcements. Off. Naval 
Res. tech. Rep., 1961, No. 3. (Contract 
NONR 580—09) 


(Received May 15, 1961) 


wave. H, Ne. neh 


THE EFFECTS OF DRIVE AND DISCRIMINATION 
TRAINING ON STIMULUS GENERALIZATION ! 


DAVID R. THOMAS? 


Duke University 


In 1959 Hanson reported in this 
journal a study on the influence of 
discrimination training on stimulus 
generalization. Hanson trained pi- 
geons to peck at a key illuminated by 
a monochromatic light of 550 my, 
under a VI reinforcement schedule. 
After successive discrimination train- 
ing, Ss were subjected to a generaliza- 
tion test in extinction, following a 
procedure introduced by Guttman 
and Kalish (1956). 

Hanson found that time to the 
discrimination criterion varied as a 
negatively accelerated decreasing func- 
tion (S+, S—) difference. Post- 
discrimination generalization gradi- 
ents (PDGs) were higher and steeper 
than a control gradient and in addi- 
tion, showed a shift in the peak of 
responding from the S+ in the direc- 
tion away from the S—. The extent 
of this shift varied also as a negatively 
accelerated decreasing function of 
(S+, S—) difference. 

The present study is an extension 

Hanson’s work. The Ss were 
tested on three different discrimina- 
tion problems at three different levels 
of food deprivation. The literature 
1S inconsistent with regard to the 


1 This study was taken from a dissertation 
submitted to the Psychology Department of 
Duke eee in partial falfliment of the 
requirements for the degree in psy- 
chology. The writer is Greatly RAES to 
Norman Guttman under whose direction the 
investigation was conducted. Thanks are 
also due to Doris Homa Thomas for running 
some of the Ss. At the time the research was 
conducted the author was a Predoctoral 
Research Fellow of the United States Public 
Health Service. 

* Now at Kent State University, 


24 


influence of drive level on discrimina- 
tion learning, and no study appears 
to have been reported using the 
operant free-responding technique. 
In this study, Ss were tested for 
generalization before discrimination 
training was initiated, and in addition, 
short generalization tests were inter- 
spersed during the course of discrimin- 
ation training so as to trace the 


development of the changes in the 
PDG. 


METHOD 


Subjects —The Ss were 54 experimentally 
naive white carneau pigeons obtained from 
the Palmetto Pigeon Plant in Sumpter, 
South Carolina. 

Apparatus—A bank of four identical 
Skinner type key pecking apparatuses was 
used, Each box had the following internal 
dimensions: width, 15} in.; depth, 143 in.; 
height, 14}in. Walls and ceiling were painted 
flat black; floors were of unpainted Masonite. 
The S’s key of translucent plastic was exposed 
through a j-in. circular aperture placed 6} in. 
above the floor on one side of the box. Di- 
rectly below the key in the floor of each box 
was a l-in. circular aperture through which 
Ss had access to the food magazine. The 
magazine, which was operated by a motor- 
driven cam, allowed S approximately 4 sec. 
of access to food during each cycle, Between 
cycles, the food was lowered beyond S's reach. 
A Plexiglas light fixture with a 15-w. bulb, 
placed directly above and in front of the floor 
opening illuminated the opening and the 
magazine for the duration of each cycle. 

Aside from the stimulus light and the 
magazine light during magazine cycles, the 
boxes were in darkness throughout the experi- 
ment. One box, Number 1, was set aside 
for generalization testing. The source of 
illumination for the key in this box was a 
Bausch and Lomb diffraction grating mono- 
chromator, Model 33-86-40, equipped with a 
108-w., 6-v., ac ribbon filament tungsten lamp- 

Boxes 2, 3, and 4 were used for training 


STIMULUS GENERALIZATION 25 


purposes only, The keys of these were 
illuminated by 100-w,, 120-v,, ac projection 
lamp. Positive and negative stimuli were 
provided by Bausch and Lomb interference 
filters, Each box was equipped with a “posi- 
tive fiter,” (transmission peak $50 my) and a 
“negative filter,” ($90 ma for Box 2, 570 ma 
for Box 3, and 558 my for Box 4), The bright- 
ness of the filter colors was matched to the 


of a photomultiplier tube and its associated 
amplifier 


The three training boxes were set up to 
change stimuli automatically according to a 
prearranged schedule punched on a tape run 
on a Gerbrands programer. Throughout the 
entire experiment, masking noises were sup- 
plied to all boxes by a Grason-Stadler noise 
generator, Model 901. 

Procedure-—U pon arrival at the laboratory, 
all Ss were weighed, individually caged, 
and allowed free access to food and water. 
Throughout the entire experiment free access 
to water was always available in the home 
cages. After 4 to 10 days of free feeding, a 
stable weight level was achieved by all Ss 
and food deprivation was begun. The Ss 
were randomly assigned to three body weight 
levels, 80% of ad lib. weight, 70%, and 60%, 
and to the three different discrimination 
problems, thus creating nine groups of Ss, 
one for each combination of weight level and 
discrimination problem. Deprivation ceased 
for each S when the appropriate weight was 
reached. At this point training was begun. 

All Ss were adapted, magazine trained, 
and conditioned to key peck according to a 
set schedule covering 5 days. During this 
time, the box was in complete darkness with 
the exception of the key light and the maga- 
zine light during reinforcement cycles. 

All Ss were given VI reinforcement for 5 
days following conditioning. 60-sec. 
stimulus-on periods, alternated with 10-sec. 
stimulus-off periods, were given each day. 
The mean interval between reinforcements, 
not counting stimulus-off periods, was ap- 
proximately 60 sec, with a range from 4 sec. 
to 4 min. During stimulus-off periods, a 
shutter operated, removing the light from 
the key and leaving the experimental box 
dark. Reinforcements were never given 
during a stimulus-off period. One of the five 
VI sessions for each S was administered in 
Box 1, the test box, so as to accustom Ss 


to the test box and to the monochromator- 
produced stimulus. 

On the next day, after a S-min. warm-up 
period of VI in the test box, each S was sub- 
thy generalization test in extinction. 

Eleven different test stimuli were used— 
510, $20, $30, $40, 545, 550, 555, 560, 570, 
$80, and $90 mp. The 11 test stimuli were 
randomized within a series and six different 
random series were ted to each S, 
This resulted in a scl le of 66 stimulus 
presentations, Each stimulus presentation 
was for 30 sec. and was followed by a 10-sec. 


preceding period 
recorded and the stimulus was changed. 

On the day following the (preliminary) 
generalization test, discrimination training 
was begun. All three discrimination groups 
were trained with the same positive stimulus 
(S+-) $50 my, and differed only with respect 
to the negative stimulus (S—). Each dis- 
crimination group consisted of 18 Ss, 6at each 
of the three body weight levels. ‘These groups 
may be a a by the negative stimulus 
used: ma (trained in Box 2), 570 mp 
(Box y on 558 ma (Box 4). During 
discrimination training, responding to the 
positive stimulus was reinforced ing 
to the same VI schedule previously used. 
Responding to the negative stimulus was 
never reinforced. The positive and negative 
stimuli were presented successively in a pre- 
arranged random order. Fifteen 1-min. 
intervals of S+ and 15 of S— were presented 
each day. All stimulus changes were made 
during the 10-sec. blackout periods. The 30 
stimulus presentations comprised three blocks 
of 10, and within each block there were 5 
positive and 5 negative stimuli. Discrimina- 
tion training was continued until a criterion 
of no responding in five successive periods of 
S— combined with continued responding to 
S+ was achieved. 

During the course of discrimination train- 
ing, a three-series generalization test of 33 
stimulus presentations was administered at 
the completion of every even numbered daily 
session of discrimination training. If the 
discrimination criterion was met on a day 
on which an interpolated generalization test 
was scheduled, the test was omitted. On the 
day following the achieving of the criterion 
for discrimination, a final generalization test 
was administered to each S. The final test 
was carried out in the same manner as the 
preliminary and interpolated tests, with the 
exception that 12 test series (132 stimulus 
presentations) were used. 


dS 
a 


DAVID R. 


% OF TOTAL RESPONSES. 


sdo sao sas sto s38 s60 870 
WAVE LENGTH (My) 


so szo 


Fic. 1. Postdiscrimination generalization 
gradients after three different discrimination 
problems, pooled over three drive levels. 


RESULTS AND DISCUSSION 


Data bearing on three different 
problems will be presented in this 
report. The first problem concerns 
the roles of hunger drive level and 
(S+, S—) difference in determining 
the rate of formation of a discrimina- 
tion. The present experiment is fully 
in agreement with Hanson (1959) in 
the finding that the rate of forma- 
‘tion of a discrimination varies in- 
versely with the physical difference 
between the stimuli to be discrimi- 
nated. Analysis of variance indicates 
that minutes to criterion varies as a 
function of the (S+, S—) difference 
(F = 23.50, df = 2/45, P < .01). A 
beneficial influence of drive level is 
suggested, but is not demonstrated 
at a statistically acceptable level 
of confidence (F = 2,96, df = 2/45, 
05 < P <.10). The results seem 
sufficiently promising, however, to 
warrant a more direct attack of the 
problem, without the interpolation 
(as in the present case) of other 
experimental treatments (preliminary 
and interpolated generalization tests). 
The interaction between the (S+,S—) 
difference and drive level js not 
significant (F < 1, df = 4/45). 

In the present experiment depriva- 
tion levels down to 60% of ad lib. 
weight were employed. Thus, the 
limits of the hunger motive were 


THOMAS 


assessed. We may therefore conclude 
that even at its extremes, the hunger 
drive exercises far less control over 
the rate of discrimination learning 
than does the physical difference 
between thestimuli to be discriminated. 

The second problem to be con- 
sidered here concerns the roles of drive 
level and the (S+, S—) difference in 
determining the location of the PDGs. 
In Fig. 1 are presented the PDGs of 
the three problem groups pooled over 
all drive levels. The PDGs show 
the characteristic displacement of the 
peak from the S+ an amount in- 
versely related to the (S+, S—) dif- 
ference. A convenient measure of 
the location of a gradient is derived 
by treating it as a grouped frequency 
distribution and computing the mean 
value. The difference between the 
means of the 18 individual mean 
scores in the three problem groups is 
significant (F = 10.74, df = 2/45, 
P <.01). The effect of the (S+,S—) 
difference on the location of the PDG 
is mirrored at all three levels of drive. 
However, the effect of drive level on 
the means of the final generalization 
gradients fails to achieve a statis- 
tically acceptable level of confidence 
(F= 2.56, df = 2/45, .05 < P < .10). 
Neither is the interaction between 
the (S+, S—) difference and drive 
level significant (F = 1.07, df = 4/45). 
Thus, Hanson’s findings with regard 
to the effect of the (S+-, S—) difference 
on the location of the central tendency 
of the PDG are replicated at three 
different levels of drive, 

In agreement with Hanson, we have 
found that both the time to criterion 
and the amount of displacement of the 
PDG are inversely related to the 
(S+, S—) difference. This suggests 
the possibility that amount of training 
is the vehicle through which the 
(S+, S—) difference has its effect on 
the PDG. The analysis of this ques- 


STIMULUS GENERALIZATION 27 


tion was the third major problem to 
which the present study addressed it- 
self. The administration of a short 
generalization test to all Ss after 2 days 
of training (with the exception of 
those Ss which had already met the 
learning criterion) makes possible an 
evaluation of the effects of the (S+, 
S—) difference, with amount of train- 
ing held constant. On the other hand, 
the analysis of the series of generaliza- 
tion tests given to all Ss during the 
course of discrimination training re- 
veals the role of amount of training 
with the (S+, S—) difference con- 
trolled. 

Although only 1 S from the 590-myu 
group failed to achieve the criterion 
of discrimination learning within 
2 days, 14 Ss from the 570-my group 
and 16 Ss from the 558-my group 
could be compared in this manner. 
For each S a ‘‘displacement score’ 


TABLE 1 


DISPLACEMENT OF THE MEAN OF THE GENER- 
ALIZATION GRADIENT WITH Two SESSIONS 
or DISCRIMINATION TRAINING AT 
Two Dirrerent (S+, S—) 

VALUES 


570-muz Group 558-muz Group 
Displacement Displacement 
S No. of Mean S No. of Mean 
my) my) 
2 12.37 15 7.91 
5 2.39 54 3.11 
14 7.77 66 4,55 
56 14.72 51 7.90 
70 —0.62 12 6.49 
46 6.59 18 8.42 
47 7.04 44 9.52 
59 4.77 48 2.40 
72 —0.97 60 4.25 
43 2.22 67 7.91 
40 3.21 32 —2.25 
37 0.12 38 2.75 
31 7.82 25 9.64 
62 10.27 22 14.44 
29 7.07 
68 3.37 
Mean 5.6 Mean 5.7 


MEAN VALUE (My) 
3% 
rit 


[| 
Lee E kr 4 eS 


TEST NUMBER 


Fic. 2. Mean of the gradient as a function of 
ordinal position of test. 


was obtained by subtracting the mean 
of the second generalization test from 
the mean of the first. The means of 
these displacement scores for the two 
groups were virtually identical, 5.6 
my for the 570-my group vs, 5.7 mu 
for the 558-muz group. The mean 
displacement is the same in spite of 
the fact that in one case the (S+,S—) 
difference is more than twice that of 
the other! 

In Fig. 2 are presented the means 
of the preliminary generalization gra- 
dients obtained from all Ss, the means 
of the second obtained gradient, the 
third, etc. The data from all groups 
have been pooled since it appears that 
neither the (S+, S—) difference nor 
drive level have any marked effect 
on the location of the PDG if the 
amount of training is held constant. 
Under each of the values is recorded 
the number of Ss whose scores are 
represented in that mean value. It 
should be noted that the points 
plotted were not obtained from dif- 
ferent groups of Ss. On the con- 
trary, the 5 Ss which received five 
generalization tests were included 
among the 22 Ss which received (at 
least) four generalization tests, which, 


28 DAVID R. 


in turn were among the 32 Ss which 
received (at least) three generalization 
tests, etc. The value of N con- 
tinually decreases indicating that all 
Ss had at least two generalization tests 
whereas progressively fewer Ss had 
three tests, four tests, etc. The figure 
shows a negatively accelerated de- 
creasing function. Even as few as two 
sessions of discrimination training 
produces a marked shift in the loca- 
tion of the gradient. 


The present procedure has made 
possible the separation of the effects of 
the (S+, S—) difference and amount of 
training on the location of the PDG. 
When this is done it appears that the 
effects of the (S+, S—) difference are 
mediated through the amount of training. 
There is no obvious explanation of the 
failure of the (S+, S—) difference to have 
a direct effect on the location of the PDG. 
It is clear, however, that this finding 
raises serious doubt about the adequacy 
of a Spence-type theory of inhibition 
(Spence, 1936, 1937) to account for the 
process of successive discrimination learn- 
ing in the operant free-responding situa- 
tion. It is to be hoped that future 
studies will suggest an alternative theo- 
retical framework which will have greater 
predictive and explanatory value within 
this context. 


SUMMARY 


The effects of drive and discrimination 
training on stimulus generalization were 
studied in the pigeon. Three groups of 18 Ss 
each, maintained at 60%, 70%, and 80% of 
ad lib. weight, respectively, were trained to re- 
spond to a key illuminated by a light of 550 
mau. After 5 days of VI training, Ss were tested 
for generalization to stimuli ranging from 500 
mp to 600 my. Then the three groups were 
subdivided into three discrimination problem 


THOMAS 


groups, 550 my positive for each, but with 
590 mp, 570 mp, and 558 mp negative, 
respectively. During discrimination training, 
responding to S+ was VI reinforced, respond- 
ing to S— was never reinforced. Periodically 
during the course of discrimination training 
short generalization tests were given to all 
Ss. After the criterion of discrimination was 
met, all Ss were subjected to a final generali- 
zation test. 

The major conclusions were: (a) minutes to 
criterion varies inversely with the (S+, S—) 
difference; a beneficial effect of drive is sug- 
gested but not conclusively demonstrated 
(.05 < P < 10). (b) Discrimination training 
produces a general steepening of the PDG, a 
lowering of the gradient in the region of S—, 
and a shift of the central tendency from the 
region of S+ in the direction away from S—, 
the amount of shift varying inversely with 
the (S+, S—) difference. This finding is 
replicated for all three levels of drive. (c) 
Generalization gradients obtained during the 
course of discrimination training reveal that 
the mean of the gradient shifts in a negatively 
accelerated manner as a function of the 
amount of discrimination. training. The 
amount of training appears to be the vehicle 
through which the effect of the (S++, S—) dif- 
ference on the location of the PDG is medi- 
ated. Displacement varies with amount of 
discrimination training, independent of any 
direct effect of the (S+, S—) difference. 


REFERENCES 


Guttman, N., & Kauss, H. I. Diserimina- 
bility and stimulus generalization. J. exp. 
Psychol., 1956, 51, 79-88. 

Hanson, H. Effects of discrimination train- 
ing on stimulus generalization, J. exp. 
Psychol., 1959, 58, 321-334. 

Spence, K. W. The nature of discrimination 
learning in animals. Psychol, Rev., 1936, 
43, 427-449, 

Spence, K. W. The differential response in 
animals to stimuli varying within a single 


genoa Psychol. Rev., 1937, 44, 430- 
44, 


(Received May 19, 1961) 


Journal of Experimental 
1962, Vol. 64, No. 1, 


ee 


THE LEARNING OF RESPONSES TO MULTIPLE 
WEIGHTED CUES! 


STANLEY A. SUMMERS? 


University of California, Los Angeles 


In many situations a response is 
dependent simultaneously on several 
different cues, i.e., on several poten- 
tially useful stimulus attributes. The 
purpose of this experiment was to 
analyze the learning of responses to 
simultaneously presented cues of dif- 
ferent validities, in order to deter- 
mine how much the responses came 
to depend on each cue. The implica- 
tions are relevant especially to situa- 
tions in which individuals react to 
several influences in reaching a judg- 
ment, decision, or interpretation. 


Questions about the effects of complex 
stimuli have been answered by several 
related types of experimental analysis, 
such as investigations of stimulus-com- 
pound patterning, of probability learn- 
ing, and of the utilization of multiple 
cues. In studies of cue utilization, cor- 
relations may be taken between each 
of several cues and the response variable. 
This correlational approach, which is 
particularly associated with the work 
of Brunswik (1956), was used in the 
present experiment. Brunswik devel- 
oped the concepts of ecological validity 
and functional validity. Ecological va- 
lidity is the correlation between the cue 
and an environmental variable pre- 
dicted by it. Functional validity is the 
correlation between the cue and a re- 
sponse, such as a judgment, directed 


1 This paper is based upon a PhD disserta- 
tion presented to the Graduate School of the 
University of California, Los Angeles. The 
author wishes to express his gratitude to 
Wendell E. Jeffrey for guidance and assistance 
throughout the investigation. He also wishes 
to thank Joseph A. Gengerelli and Irving 
Maltzman for their suggestions. 

2Now at San Fernando Valley State 
College. 


29 


toward the environmental variable. 
Stimulus attributes, then, can be dealt 
with as joint cues to some criterion vari- 
able. A correlational approach to cue 
utilization provides a method of abstract- 
ing from a series of responses the simul- 
taneous effects of several cues. 

Brunswik assumed that the functional 
validity of a cue will correspond, in most 
cases, to its ecological validity; the extent 
to which each cue is used should come, 
through learning, to conform to the ex- 
tent to which the cue predicts. With 
correlational analysis Brunswik investi- 
gated, for the most part, established 
patterns of cue utilization, rather than 
the learning process by which individuals 
acquire such patterns. He attempted to 
ascertain the use of cues in settings repre- 
sentative of the natural environment. 
To isolate the effect of cue validity, 
however, it seems desirable to establish 
ecological validities for originally mean- 
ingless cues in a laboratory setting and to 
observe the responses learned to these 
cues. 

An attempt to study the learning of 
functional validities in a controlled 
situation was made by R. Goodnow 
(Bruner, Goodnow, & Austin, 1956). 
The validity of a cue for predicting a 
correct category was defined as the rela- 
tive frequency of the cue’s association 
with that category. The results did not 
indicate any close correspondence be- 
tween cue use and objective validity. 
However, there are limitations in the 
treatment of this question in terms of 
frequencies rather than correlations. 
One limitation is that probability match- 
ing, which was considered the criterion 
of appropriate functional validity, would 
not have maximized success in the situa- 
tion and, therefore, did not have the 
adaptive character of a match of func- 
tional to ecological validity in a correla- 


30 STANLEY A. SUMMERS 


ANGLE=6 


ape AREA=7 


A Å COLOR=3 (YELLOW-ORANGE) 


COLOR=5 (YELLOW-GREEN) 


ANGLE=2 
AREA=2 


COLOR =7 (BLUE) 
ANGLE=8 
AREA=8 


Fic, 1. 


Three stimulus forms, 


tional framework. Smedslund (1955), 
dealing less directly with the relation 
between ecological and functional validi- 
ty, approached the topic of cue learn- 
ing through a correlational framework. 
He established joint visual cues to a 
predicted variable. A correlation was 
imposed between each of the cues and 
the correct response, giving each cue a 
different validity. The main conclusion 
was that Ss do learn “to utilize many 
probabilistic cues simultaneously . . .”” 
(Smedslund, 1955, p. 26). In Smeds- 
lund’s design, however, a particular cue 
was always associated with the same 
validity, so that the effect of validity 
could not be studied independent of the 


effects of saliency or other cue charac- 
teristics. 


In the present study, correlations 
were imposed between each of three 
simultaneously presented visual cues 
and a variable, correct line length, 
whose magnitude varied with the 
magnitude of all three cues. Each 
of the experimentally imposed correla- 
tions will be called a cue weighting. 
Correlations were taken between cor- 
rect line length and S's response 
throughout a learning session, and 
between each of the individual cues 
and S's response. These empirically 


. derived correlations will be called 


response weightings. 

It was expected that the order of 
utilization of the three cues would 
come to conform to the order of cue 
validity. Specifically, it was pre- 
dicted that the response weightings 
would be different and would be 
ranked in the order of the cue weight- 
ings, and that the magnitude of the 
response weightings would approach 
that of the cue weightings. 


METHOD 


Subjects —The Ss were 30 members of a 
ninth grade class, 20 boys and 10 girls. This 
was a special class composed of students of 
high academic performance. 

Materials—The stimulus materials were 
384 geometrical forms photographed on 35- 
mm. Kodachrome slides and projected on a 
screen from an automatic projector. Each 
slide bore a number giving its position in the 
standard order of presentation. The pro- 
jected form was approximately 2 ft. in diam- 
eter. Each form consisted of an isosceles 
triangle on a circular white background. A 
portion of the triangle was black; the re- 
mainder was of some hue. Three character- 
istics of these stimuli varied from slide to 
slide, These were color, angle, and area. 
That is, the hue of the triangle, its orientation 
with respect to the vertical, and the propor- 
tion of the triangle that was colored rather 
than black, Each of the three characteristics 
varied over eight values; these were desig- 
nated 1 through 8 (see Fig. 1). 

A hue of red was assigned Value 1. The 
other hues and their values were red-orange 
(2), yellow-orange (3), yellow (4), yellow- 
green (5), green (6), blue (7), and violet (8). 
The two equal sides of the triangle were the 
long sides; the ratio of long to short was 5:2. 
If the point opposite the short side is regarded 
as an indicator, the orientation of the triangle 
can be described in terms of degrees of clock- 
wise rotation from the 0° (12 o'clock) position. 
Values were assigned to these positions as 
follows: 22.5° (1), 67,5° (2), 112.5° (3), 
157.5° (4), 202.5° (5), 247.5° (6), 292.5° (7), 
337.5° (8). The triangles varied along the 
dimension of area from almost completely 
black (1), in equal increments of altitude, to 
almost completely colored (8). The black 
strip always paralleled the short side of the 
triangle. The values (1 through 8) of the 


RESPONSES TO MULTIPLE WEIGHTED CUES 31 


three dimensions varied independently of 
one another. A standard stimulus series was 
presented to all Ss, 

The response materials consisted of book- 
lets of paper on which were printed horizontal 
lines approximately 7 in. long. Each line was 
an elongated arrow, pointing toward the right. 
The first 2 pages of a booklet contained 64 
lines, numbered 1 to 64, corresponding to 
numbers on the first 64 slides. The next 16 
pages contained 256 bracketed pairs of lines, 
each pair given a number corresponding to the 
number in the slide sequence. The lower 
line of each pair was completely covered by a 
cardboard strip, which was lightly pasted 
at both ends. Hidden by this strip was a 
small red mark which designated a correct 
length for that line. The last 2 pages of 
the booklet were identical to the first 2. 

Procedure and design.—All Ss were tested 
in a single session of approximately 2 hr. Two 
Es were present throughout the experiment. 
The instructions were a fairly straight-for- 
ward description of the task and its signifi- 
cance. The following is an abstract of the 
instructions, 


The experiment will investigate how we 
learn to make certain kinds of judgments. 
You will see a series of slides projected on a 
screen—like this. Each of these forms 
contains the information you need to tell 
you how to mark off a particular length 
along an arrow on answer sheets like this. 
You will just make a small dash, measuring 
from the left, for how long you think the 
line should be. Sometimes it will go here, 
sometimes there. Each slide contains all 
the information you need to mark off the 
correct length. Your task in the experi- 
ment is to learn how to use this informa- 
tion. At first you will get no help in esti- 
mating how long the line should be. Later 
you will be able to find out how long the 
line should be after making your own guess. 
The answer sheets for most of the experi- 
ment will have two arrows attached like 
this. The correct answers are covered 
by paper strips. You can make a mark 
right on the line over the strip and then 
pull down or tear back the strip to see the 
correct answer, Then you can compare 
this answer with the mark you made and 
with the picture. Be careful not to look 
at the correct mark until after you make 
your own. Each design on the slide tells 
you how long the line should be by the 
features that change from slide to slide. 
Which way the figure points, which color 
is on the slide, and the amount of color are 
all clues to how long the line should be, 


and just how, you will have to learn 
during the experiment. Using all these 
features at the same time you may learn 
to get the line length just right. This is 
difficult but the closer you can come the 
better. The first two pages are a prelimi- 
nary measurement before you get help in 
estimating how long the line should be. 
The last two pages, also, do not have correct 
answers marked. You will see each slide 
only a short time so you will have to judge 
pretty much by your first quick impression. 


A series of 384 slides was presented; each 
slide was projected for 10 sec., with a 2-sec. 
interval between slides. During each ex- 
posure S viewed the slide, made a mark on the 
upper line of the corresponding line-pair, 
tore back the cardboard strip, and compared 
the correct line length marked on the lower 
line with his mark and with the stimulus 
pattern. For the first 64 and the last 64 
trials, where no correct length was giyen, S 
simply viewed the stimulus and marked the 
line. This provided a measure of operant 
level and an extinction series. A 10-min. rest 
period was given in the middle of the presen- 
tation. 

The stimulus series of 384 figures was 
composed of six blocks of 64 each, The last 
block (E) was identical to the first (0). 
Within each block of 64, each of the eight 
values of each cue appeared equally often in 
combination with each value of the other 
two cues. Each block was based on one of 
five different latin squares, selected from a 
complete set of 8 X 8 latin squares (Fisher 
& Yates, 1948, p. 63). The effect of this 
arrangement was to provide a zero correlation 
between any two cues within each block and 
to makealmost all stimuli different, aside from 
the extinction series. Within each block 
the order of the various combinations of the 
three cues was randomized, and the distribu- 
tion of values for each cue was rectangular. 

The relation of the cue values to the cor- 
responding series of correct line lengths was 
defined in the following way. Correct line 
length was made a function of the three 
cue values, as expressed by the equation 
CLL = 2C, + 1.5C2 + C;, where CLL is 
correct line length, and the Cs are values of 
the three cues. The coefficients of the Cs 
determine the cue weightings. Since the 
minimum cue value was 1, and the maximum 
8, correct line length varied from 4.5, when 
all cue values were 1, to 36, when all cues had 
the value 8. The relation between each cue 
and correct line length can also be expressed 
in terms of a correlation, Since the cues were 
uncorrelated, the correlations of the three cues 


32 STANLEY A. SUMMERS 


TABLE 1 


EXPERIMENTAL CONDITIONS; WEIGHTINGS 
ASSIGNED EACH CUE IN DETERMINING 
CORRECT LINE LENGTH 


Cue Type and Weighting 


Condition 
.74 56 37 
1 Color Angle Area 
2 Color Area Angle 
3 Angle Color Area 
4 Angle Area Color 
5 Area Color Angle 
6 Area Angle Color 


with correct line length were in the ratio 
2:1.5:1. Since the square of a correlation 
corresponds to the proportion of total variance 
predicted, it is possible to calculate the value 
of the correlation coefficient for a cue by the 
equation: 


(2r)? + (1.57)? + 7? = 1.00 
r= 371 


Thus the three correlations, or cue weightings, 
were .371, .557, and .743, The procedures 
provided S with all the information necessary 
to make a correct response if he used each cue 
correctly. 

In order to ascertain the effect of various 
cue weightings independent of other proper- 
ties of the cue, six experimental conditions 
were provided. The design was completely 
counterbalanced with respect to cue type 
(color, angle, area) and cue weighting and 
their combinations (see Table 1). These six 
conditions were six different sets of correct 
line lengths for the single series of slides. The 
Procedure for the various conditions differed 

only in the construction of the response 
booklet, that is, in the correct line lengths, 
All six experimental groups Participated in 
the experiment together, being presented with 
the stimulus series simultaneously, The Ss 
were randomly assigned to experimental 
conditions. Data were obtained from 33 Ss; 
data from 1 randomly chosen S in each of 
three conditions were discarded to provide 
5 Ss in each condition. 


RESULTS 


For each block of 64 presentations, 
four Pearson correlation coefficients 
were computed for each 5, These 
were the correlations between his 
Tesponse and each cue, as well as 


between his response and correct 


line length. The correlations were 
averaged over Ss by converting each 
correlation to Fisher’s z, averaging the 
z's, and converting the average to a 
Pearson 7. Curves showing average 
response weightings for each cue 
weighting are presented in Fig. 2. 
The vertical position of the curves 
corresponds to the order of the cue 
weightings. 

An analysis of variance was per- 
formed on z transformations of the 
response weightings for the learning 
blocks, the four blocks in which cor- 
rect line length was presented to „S. 
The main question to be answered 
by the analysis is whether Ss were 
learning to respond differentially to 
the different cue weightings; that is, 
whether there was a significant dif- 
ference in vertical placement of the 
curves. The analysis (Lindquist, 
1953, Type VI) is summarized in 
Table 2. The F for cue weighting is 
significant beyond the .001 level. 
This result confirms the prediction 
that the response weightings would 
differ with the cue weightings. 

In addition, there is an over-all 
trend toward increasing cue utiliza- 
tion over these four blocks, The 
block effect is significant beyond the 


wm 50 


> 
Le 


i 
o 


20 


o 


MEAN RESPONSE WEIGHTING: 
= a 


o | 2 3 E 
BLOCKS 

Fic. 2. Mean response weightings (cor- 
relations of cue with response) for different 
cue weightings. (Correct line length was 
presented in Blocks 1 through 4.) 


= eee ee O 


. 


RESPONSES TO MULTIPLE WEIGHTED CUES 33 


TABLE 2 


ANALYSIS OF VARIANCE OF RESPONSE 
WEIGHTINGS USING A s 


TRANSFORMATION 
Source df MS F 
Between Ss 
Conditions (C) 5} .1382| <1 
Error 24| .2529 
Within Ss 
Cue weighting (W) 2 | 1.3726 | 8.96** 
Blocks (B) 3| .1858 | 5.40* 
W XB ó| .0233 | 1.03 
WXC 10| .2071 | 1.35 
B XC 15| 0154| <1 
WBC 30 | .0276 | 1.22 
Error (w) 
Error; 72| .0344 
Errore 48| 1531 
Errors 144| .0227 
*P <,005. 
** P< 001. 


.005 level. In order to ascertain 
whether there is a trend toward 
increasing utilization of each separate 
cue, the block effect was tested for 
each cue weighting separately. The 
block effect was significant for the 
.74 weighting (P < .05, F = 2.79, 
df = 3/72) and for the .56 weighting 
(P < .025, F = 3.34, df = 3/72), but 
not for the .37 weighting (F < 1). 
The form of the learning curves does 
not differ for different cue weightings; 
the W X B interaction is not sig- 
nificant. 

The question of the final magni- 
tudes of the response weightings could 
not be answered since there was no 
indication that learning had reached 
a limit. It is possible, however, to 
take the observed correlation of 
response with correct line length on 
a given block and to compute what 
the three response weightings would 
be if they contributed to this total 
correlation in the expected ratio 
2:1.5:1. The response weightings 
computed on this basis are presented 
in Table 3, along with the corre- 
sponding observed weightings. Table 
3 indicates that cue utilization, 


throughout the learning trials, is 
roughly proportional to cue validity, 
although the highest-weighted cue is 
being used slightly more, and the 
lowest slightly less, than would be 
predicted on the basis of propor- 
tionality. 


DISCUSSION 


The results indicate that Ss responded 
simultaneously and differentially to the 
multiple cues. The differential response 
is shown by the highly significant cue 
weighting effect. Simultaneous response 
to the multiple cues, rather than use 
by each S of only a single cue, is indi- 
cated by the fact that this weighting 
effect transcends the S X Weighting 
interaction contained in the error term 
(Error:). Also, once Ss are exposed to 
correct line length, the correlation of 
response with correct line length is higher 
than with the most heavily weighted cue 
(see Fig. 2), indicating successful use of 
more than the most valid cue by itself. 
Individual Ss must have responded to 
more than one cue during any given 
block of trials. The curves suggest that 
there is some response to the lowest- 


TABLE 3 


OBSERVED MEAN RESPONSE WEIGHTINGS (0) 
COMPARED WITH RESPONSE WEIGHTINGS 
COMPUTED ON THE BASIS OF 
PROPORTIONALITY (P) 


Response Weighting 


werglting Block 
1 2 3 4 

CLL 203 271 313 418 
74 

E 15 -20 .23 31 
o 19 .24 con 32 
-56 

P 11 AS 17 23 
(0) 10 AS 17 24 
37 

P 08 10 12 16 
(0) 04 .01 06 07 


34 STANLEY A. SUMMERS 


weighted cue, although in the statistical 
test this was not rising significantly. 
In general the Ss made an appropriate 
use of the cues available. 

The importance to adaptive behavior 
of appropriate response to multiple cues 
has been emphasized for various situa- 
tions: in terms of the learning of func- 
tional relations (Miller, 1959), for pre- 
diction of judgmental behavior (Johnson, 
1955), with application to clinical ap- 
praisal (Hammond, 1955; Hoffman, 
1960). Important differences of em- 
phasis are implied in the choice of any 
one methodological framework for the 
study of cue utilization. In particular 
one may contrast the orientation implied 
by a correlational framework with that 
implied by an event frequency frame- 
work, such as is used in studies of prob- 
ability matching. Both frameworks may 
be used to study adaptation to environ- 
mental uncertainties. The event fre- 
quency situation provides for the success 
of a given response to be probabilistic. 
Usually the best adaptation possible is 
consistent choice of the most frequent 
alternative, with probability matching 
implying less than optimal adjustment. 
In a correlational framework, the prob- 
abilistic character of the individual cues 
does not necessarily adhere to the cues 
in combination, which may be unequivo- 
cally related to the criterion as they 
were in the present study. In terms of 
behavior, the matching of response 
weighting to cue weighting would imply 
fully optimal adjustment to the situa- 
tion; there may be consistently accurate 
responses on the basis of individually 
limited cues. A correlational frame- 
work, then, emphasizes cue utilization 


as a process which tends to remove un- 
certainty. Much of our judgmental and 


discriminative behavior is characterized 
by a sufficient degree of sureness and 
adaptiveness to be best studied within 
such a framework. 


SUMMARY 
This experiment investigated the relation 


between the objective validity of certain cues 
and the extent to which these cues were used. 


The purpose was to analyze the learning of 
responses to multiple cues of different validi- 
ties, in order to determine how much the 
responses came to depend on each cue. The 
independent variable was the correlation 
imposed between a criterion and each of three 
simultaneously presented visual cues. The 
dependent variable was the correlation 
between the cues and Ss’ responses. 

The Ss were 30 members of a ninth grade 
class. The stimulus materials were geometric 
forms. Three characteristics of these stimuli 
varied, acting as cues for the prediction of a 
line length. These characteristics were based 
on the color, angle, and area of parts of the 
forms, and were counterbalanced to establish 
independent cue validities. The-cue validities 
of .74, .56, and .37 together permitted a per- 
fect prediction of line length. During four 
blocks of 64 trials each, Ss responded and 
then were presented with the correct line 
length. 

Successful cue utilization increased during 
the learning session. The Ss responded to ` 
different cues simultaneously, and the extent 
to which cues were used differed with validity. 
Cue utilization was roughly proportional to 
cue validity throughout the learning trials. 


REFERENCES 


Bruner, J. S., Goopnow, J. J., & AUSTIN, 
G. A. A study of thinking, New York: 
Wiley, 1956, 

Brunswik, E. Perception and the representa- 
tive design of psychological experiments. 
Berkeley: Univer. California Press, 1956. 

Fisuer, R. A., & Yates, F. Statistical tables 
for biological, agricultural, and medical 
research. London: Oliver & Boyd, 1948. 

Hamwmonp, K, R. Probabilistic functioning 
and the clinical method. Psychol. Rev. 
1955, 62, 255-262. 

Horrman, P. J. The paramorphic represen- 
tation of clinical judgment. Psychol. Bull., 
1960, 57, 116-131. 

Jounson, D. M. The psychology of thought 
and judgment. New York: Harper, 1955. 

Linpguist, E. F. Design and analysis of 
experiments, New York: Houghton Mifflin, 
1953. 

Mitter, N. E. Liberalization of basic S-R 
concepts. In S. Koch (Ed.), Psychology: 
A study of a science. Vol. 2. Genera 
systematic formulations, learning, and special 
processes. New York: McGraw-Hill, 1959. 

SMEDSLUND, J. Multiple-probability learning: 
Oslo: Akademisk Forlag, 1955. si 


(Received May 23, 1961) 


i ŘS 


Journal of Experimental Psychology 
1962, Vol. 64, No. 1, 35-39 


STIMULUS-RESPONSE CONTIGUITY IN CLASSICAL 
AVERSIVE CONDITIONING! 


R. A. CHAMPION 
University of Sydney 


Stimulus-response contiguity is 
widely accepted as a necessary, if not 
sufficient, condition of learning. In 
particular, it has been suggested that 
even though reinforcement may be 
required for complete learning, a 
response which is being learned can- 
not be reinforced until it has begun 
to occur and that S-R continguity 
may operate as a crucial factor in 
initiating any learning process (Mason, 
1959). In classical conditioning, de- 
gree of S-R contiguity is represented 
in the temporal interval between the 
CS and the UCR, and the following 
experiment was designed to test the 
hypothesis that, early in conditioning, 
performance is proportional to degree 
of CS-UCR contiguity. In order to 
make this test a short-latency re- 
sponse (eyeblink) and a long-latency 
response (GSR) were conditioned 
simultaneously in each S and the 
time relations between CS and UCR 
were varied so as to permit greater 
contiguity of eyeblink and CS in one 
group of Ss and greater contiguity of 
GSR and CS ina second group. This 
type of arrangement is represented 
in Fig. 1; with an air puff as the UCS, 
the CS (tone, T) may be presented 
near the unconditioned eyeblink as 
in the case of T1, or near the uncondi- 
tioned GSR as with T2. In terms of 
the contiguity hypothesis it was pre- 
dicted that the eyeblink would condi- 
tion better than the GSR with T1 
whereas the reverse would hold with 
T2, i.e that there would be inter- 


1 Part of the apparatus used in this study 
was provided by Commonwealth of Australia 
Research,Grant No. 2002. 


35 


action between the CS-UCS interval 
and the latency of the UCR as inde- 
pendent variables affecting condi- 
tioning. By the same token it was 
expected that the early conditioning 
of the eyeblink would be better with 
T1 than with T2 but that T2 would 
produce the superior performance in 
the case of the GSR. 

The experiment provided an op- 
portunity to confirm an earlier finding 
that backward conditioning produced 
some true learning of the long-latency 
GSR (Champion & Jones, 1961). 
It is apparent from Fig, 1 that UCS- 
CS trials must be used in the condi- 
tioning of the GSR if optimum CS- 
UCR contiguity is to be achieved and 
this arrangement was therefore em- 
ployed in the second of the two groups 
described above. After these two 
groups had been trained, however, 
the question arose as to the real mean- 
ing of “backward conditioning.” The 
accepted definition seems to be put 
simply in terms of the presentation 
of the UCS before the CS, but this 
does not guarantee that the UCR will 
precede the CS. In fact, in the second 


n 2 A 
ti E aaa a a ae 


CR (GSR) 


MSEC. 


w o Toog 2000 3000 000 S000 


Fic. 1. Schematic representation of time 
relations between stimuli and responses in 
the three groups of the experiment. (T1, T2, 
and T3 represent the location of the CS in 
Groups 1, 2, and 3, respectively.) 


36 R. A. CHAMPION 


arrangement described above, favor- 
ing contiguity of GSR and tone (T2 
in Fig. 1), the CS still precedes the 
UCR. A third group was therefore 
trained with an even longer backward 
UCS-CS interval so as to ensure that 
the unconditioned GSR preceded the 
CS (T3 in Fig. 1). An attempt was 
made to set this interval so that 
degree of contiguity was nevertheless 
approximately the same as in the 
second group, i.e., so that CS-UCR 
contiguity matched UCR-CS con- 
tiguity. The use of this third group 
allowed a test of the possibility that 
R-S and S-R contiguity are equally 
effective. 


METHOD 


Subjects —The Ss were 49 volunteers from 
courses in psychology at the University of 
Sydney. 

Apparatus.—The general experimental sit- 
uation and the apparatus for measuring the 
GSR have been described elsewhere (Cham- 
pion & Jones, 1961). The eyeblink was 
measured with the system developed at the 
State University of Iowa laboratory, i.e., 
a microtorque potentiometer linked me- 
chanically to S's right eyelid and electrically 
to an ink-writing recorder through a dc 
amplifier. The CS was a 2000-cps, 90-db. 
tone of 50-msec. duration delivered to S 
through headphones. The UCS was a 3.0-psi 
air puff applied to S’s right eye through a 
-062-in. diameter tube and set at 50 msec. 
duration by means of an ac solenoid valve. 
All time intervals except the intertrial interval 
were controlled with electronic timers. 

Procedure.—In the course of general in- 
structions Ss were asked to keep their eyes 
open except for normal blinking. The series 
of trials for each S began with one UCS-alone 
trial followed by one CS-alone trial, The 
remaining presentations of stimuli consisted 
of training trials (CS-UCS or UCS-CS) inter- 
spersed with test trials (CS alone) in the 
following order: 3 training, one test, 7 train- 
ing, one test, 10 training, and one test trial. 
Thus test trials to assess the progress of 
conditioning were administered after 0, 3, 10 
and 20 training trials. The training series 
was limited to 20 trials because the hypothesis 
under test dealt only with the early stages 
of conditioning. The intertrial interval aver- 
aged 30 sec. and varied between 25, 30, and 


35 sec. in prearranged random order. No 


ready signal was given at any stage. 

As represented in Fig. 1, Groups 1 and 2 
were conditioned with a CS-UCS interval! of 
400 msec. and a UCS-CS interval of 1200 
msec., respectively. Group 3 was then 
trained with a UCS-CS interval of 2800 msec. 
A conditioned eyeblink was defined as occur- 
ring on test trials with a pen deflection of 1 
mm. or more within the interval 200-500 
msec. following the onset of the CS. No 
attempt was made to exclude ‘voluntary 
responders.” A GSR on test trials was taken 
to be any response occurring in the interval 
1-4 sec. after the CS. There was some adap- 
tation of the GSR to the air puff and the 
results of 4 Ss who failed to give a response 
on 6 or more of the 20 training trials were 
discarded. This left 15 Ss in each of the three 
groups. There were 7, 8, and 6 women in 
Groups 1, 2, and 3, respectively; a survey of 
the results showed that the men gave in- 
significantly more CRs of each type compared 
with the women. 


RESULTS 


In view of the predicted interaction 
between the length of the CS-UCS 
interval and the latency of the UCR 
it was necessary to make a direct 
comparison between the performance 
of the eyeblink and the GSR. For 
this purpose a technique was used 
which had already proved effective 
(Mason, 1959) and which allowed a 
measure of the frequency of the condi- 
tioned GSR. For each § the GSR 
scores (change in conductance in 
micromhos) obtained on test trials 
during training were expressed as a 
ratio of the response on the first 
test trial, before training; this trans- 
formation overcame individual dif- 
ferences in general sensitivity of 
response. In the few cases where 
there was no response on the first test 
trial the least measurable response 
of 0.1 micromhos was assumed. The 
transformed scores, ranging from 0 
to 35.0, were pooled and found to have 
a median of 2.6. Each value at or 
above 2.6 in the transformed scores 
was then taken as a conditioned GSR. 


CONTIGUITY IN CLASSICAL CONDITIONING 37 


The eyeblink and the GSR were 
conditioned simultaneously in each 
S and it was possible to test for the 
predicted interaction by comparing 
across groups the proportion of Ss 
giving more conditioned eyeblinks 
than conditioned GSRs on the test 
trials during training. These data 
are set out in Table 1; the result for 
Groups 1 and 2 was in accord with 
the prediction and a Fisher exact 
probability test (Siegel, 1956), ap- 
plied with cases of equal frequency of 
response omitted, showed the inter- 
action to be significant (P < .01 for 
a two-tailed test). Comparison of 
these results with the data from Group 


TABLE 1 


NuMBER or Ss GIVING More, EQUAL, AND 
FEWER CONDITIONED EYEBLINKS 
COMPARED WITH CONDITIONED 
GSRs on Test TRIALS 
DURING TRAINING 


Group 1 | Group 2 | Group 3 


ORDe (N =15) | (N =15) | (N =15) 
Eyeblinks > GSRs| 9 0 3 
Eyeblinks = GSRs| 3 4 2 
Eyeblinks < GSRs| 3 11 10 


3 showed the interaction to be sig- 
nificant for Groups 1 and 3 (P <.05) 
but not for Groups 2 and 3. 

The other aspect of the predicted 
interaction involved within-response 
comparisons over the various CS-UCS 
intervals. Performance curves for 
the eyeblink CR are shown in Fig. 2. 
The performance of Group 1 was 
consistently superior to that of Group 
2, as predicted, but the application 
of Fisher tests showed that a sta- 
tistically significant difference did not 
emerge until 20 training trials 
(P < .01); at 10 trials the difference 
approached significance (.10>P> .05). 
The same relationship held between 
Groups 1 and 3 but there was no 
significant difference at any stage 


| o—* GROUP Tt 
#0) 


PERCENT CONDITIONED RESPONSES 


eat 
=aa== 
0 ‘ 8 n 6 20 


TRIALS 
Fic. 2. Performance curves for eyeblink 


conditioning. 


between Groups 2 and 3. An overall 
test at 20 trials proved significant 
(x? = 12.09, P < .01, df = 2). These 
results conform to the general finding 
that a 400-msec. interval is almost 
optimum for the eyeblink and that 
backward conditioning of this re- 
sponse is ineffective. 

The more important within-re- 
sponse comparison concerned the 
GSR, and performance curves for 
the conditioning of this response 
are shown in Fig. 3. As predicted, 
Group 2 was consistently superior 
to Group 1, but according to Fisher 
tests the difference was only sta- 
tistically significant at 3 trials 
(P < .05). Groups 1 and 3 did not 
differ significantly at any stage. An 


e— e GROUP 1 


o—o GROUP 2 
o=- -0 GROUP 3 


PERCENT CONDITIONED RESPONSES 


8 
TRIALS 


Performance curves for GSR 
conditioning. 


Fic. 3. 


38 R. A. CHAMPION 


overall test at 3 trials gave a signifi- 
cant result (x?=6.66, P<.05, df =2). 
More sensitive measures than fre- 
quency of response were available 
for the GSR and the transformed 
amplitude scores were studied for 
further information. Nonparametric 
statistical tests were used because of 
the marked positive skew in the 
distribution of these scores. The 
results given above were confirmed ; 
in addition, the application of a U 
test at 20 trials showed that the 
difference between Groups 1 and 2 
approached significance (U = 68, 
10> P > .05). At this stage there 
was a significant difference in ampli- 
tude of response between Groups 2 
and 3, with the former superior 
(U = 51, P < .05), but not between 
Groups 1 and 3. An overall test 
applied to the three groups at 20 
trials proved significant (H = 6.99, 
P < 05, df = 2). 

As a check on the possible intro- 
duction of some artifact in the trans- 
formation of the GSR scores, Mood’s 
test of trend (McNemar, 1955) was 
applied to the raw change-in-con- 
ductance scores for each group. The 
observed trends in these data were 
virtually identical with those in the 
frequency data, depicted in Fig. 3; 
they were significant for Groups 2 
(? = 11.8, P < .01, df = 3) and 3 
Q? = 11.4, P <.01), but not for 
Group 1 (x? = 2.20, P > .50). 


Discussion 


The predictions about the effects of 
S-R contiguity in classical conditioning 
were confirmed by the data. It may be 
asked why exact contiguity of CS and 
UCR was not chosen as the condition 
most likely to yield optimum perform- 
ance early in conditioning; with the 
assumption of latencies of 200 msec. 
and 2000 msec. for the unconditioned 
eyeblink and GSR, respectively, this 


would have been achieved with the use 
of corresponding backward UCS-CS 
intervals. A pilot study conducted 
along these lines before the present 
experiment produced results exactly in 
keeping with the contiguity hypothesis, 
but the level of conditioning in both 
groups was so low that the interaction 
was not statistically significant. Poor 
conditioning with a short backward 
interval is an established result for a 
short-latency response (e.g, Wolfle, 
1932) and the results of Groups 2 and 3 
in the present experiment suggest that 
there is a decrease in the effectiveness of 
GSR conditioning in early stages with 
a backward interval somewhere between 
1200 msec. and 2800 msec., possibly of 
the order of 2000 msec. An obvious 
inference to be drawn from these con- 
siderations is that the required con- 
tiguity lies not between CS and UCR as 
overt events but rather between cor- 
responding physiological processes, lag- 
ging behind the CS in stimulus reception 
and preceding the UCR in response 
evocation. 

It is tempting to suppose that the CS 
must precede the UCR for contiguity 
to be effective. This would explain why 
the conditioning of the eyeblink did not 
occur in Groups 2 and 3, and why Group 
3 was inferior to Group 2 and comparable 
with Group 1 on some counts in the 
conditioning of the GSR. The supposi- 
tion is contradicted, however, by the 
comparable performance of Groups 2 
and 3 with respect to the relative fre- 
quency of the two types of CR (Table 1) 
and by the presence of a significant trend 
in the GSR conditioning of Group 3. 
Both these latter results are probably due 
to the early rise in the performance © 
Group 3, at three training trials. Con- 
tiguity appears to have had some effect 
when the UCR preceded the CS (Group 
3), but the performance was inferior tO 
and not as sustained as that obtained 
with a comparable degree of cs-UCR 
contiguity (Group 2). Such an outcome 
could be due to two factors which are not 
mutually exclusive. First, if allowance 
is made for the physiological processes 


CONTIGUITY IN CLASSICAL CONDITIONING 39 


delineated above, then the contiguity 
between stimulus reception and response 
initiation would be greater in Group 2 
than in Group 3 (Fig. 1); under the 
conditions of Group 3, of course, some 
physiological mechanism must be found 
which permits contiguity to act when 
response initiation precedes stimulus 
reception. Second, if delay of reinforce- 
ment is represented in the interval be- 
tween CR and UCS, then it is greater in 
Group 3 than in Group 2 and perform- 
ance in the former group should not have 
been so sustained as in the latter group 
for that reason; this consideration brings 
with it the problem of “backward” 
reinforcement when the UCS precedes 
the CR. More generally, it may seem 
improper to invoke the effects of rein- 
forcement if attention is limited to the 
initial stages of learning, where con- 
tiguity has been taken as the prime 
factor operating. Reinforcement may 
take effect, however, as soon as the 
learned response appears, no matter 
how weakly, and the precise separation 
of this effect from that of contiguity 
awaits the use of some more refined 
technique than mere reference to early 
and late stages of learning. 

Attention should be drawn to two 
other features of the data. There is 
some conflict with the results of the pre- 
vious study (Champion & Jones, 1961), 
where the backward conditioning of the 
GSR (with a UCS-CS interval of 500 
msec.) was inferior from the outset to 
forward conditioning (with a CS-UCS 
interval of 500 msec.) whereas the per- 
formance of Group 2 was superior to that 
of Group 1 in the present study. The 
discrepancy might be accounted for in 
terms of differential degrees of contiguity 
and reinforcement, but the differences 
in procedure in the two experiments are 
too great to permit this explanation to 
be pursued with any confidence. A 
second point of interest was the appear- 
ance in the present data of a statistically 
significant contiguity effect early in 
training with the GSR but relatively 
later with the eyeblink. This result 
would emerge if greater contiguity was 


achieved by chance in Group 2 with the 
GSR than in Group 1 with the eyeblink. 
It should be noted that the only set of 
conditions favoring performance on the 
grounds of both contiguity and rein- 
forcement was that prevailing in Group 
1 for the eyeblink and that only under 
these conditions did the performance 
curve rise throughout its course. 


SUMMARY 


A test was made of the hypothesis that 
CS-UCR contiguity is an important factor 
in the initial stages of classical aversive 
conditioning. The short-latency eyeblink 
and long-latency GSR were conditioned 
simultaneously in each S, with tone as CS 
and air puff as UCS. A forward CS-UCS 
interval of 400 msec. and a backward UCS-CS 
interval of 1200 msec. were used in separate 
groups to favor contiguity of CS with eye- 
blink and GSR, respectively. The acquisition 
of the eyeblink was superior with the forward 
interval and inferior with the backward 
interval when compared with that of the GSR. 
Within-response comparisons showed that the 
eyeblink conditioned better with the forward 
interval whereas the GSR conditioned better 
with the backward interval. To test the effect 
of UCR-CS contiguity in the case of the GSR 
a third group was trained with a backward 
UCS-CS interval of 2800 msec,; this condi- 
tion produced an initial rise in performance 
which was not sustained. The results of the 
experiment were interpreted as supporting 
the hypothesis. 


REFERENCES 


Cuampion, R. A., & Jones, J. E. Forward, 
backward, and pseudoconditioning of the 
GSR. J. exp. Psychol., 1961, 62, 58-62. 

McNemar, Q. Psychological statistics, (2nd 
ed.) New York: Wiley, 1955. 

Mason, J. E. The joint action of contiguity 
and reinforcement in classical conditioning. 
Unpublished doctoral dissertation, Uni- 
versity of Sydney, 1959. 

SIEGEL, S. Nonparametric statistics, 
York: McGraw-Hill, 1956. 

Wotrte, H. M. Conditioning as a function 
of the interval between the conditioned 
and the original stimulus. J. gen. Psychol., 
1932, 7, 80-103. 


New 


(Received May 27, 1961) 


Journal 


4 Experimental 
1962, Vol. 


64, No. 1, 4 


AE En 


ASSOCIATIONS, SETS, AND THE SOLUTION OF 
WORD PROBLEMS! 


MIRIAM A. SAFREN °? 
Johns Hopkins University 


Investigators have used word prob- 
lems (anagrams and skeleton words) 
to explore the ‘category set,” which 
can be defined as a readiness to re- 
spond to words belonging to a common 
category or class, and can be meas- 
ured by the speed or frequency? with 
which certain responses occur. For 
example, Starch (1911) and others 
demonstrated the existence of the 
category set when Ss solved problems 
made from lists of words belonging 
to a common class more rapidly than 
problems made from random word 
lists. Later, Rees and Israel (1935) 
and others measured the strength of 
the category set by the frequency 
with which Ss solved multiple solution 
problems as category words after 
practice on unique solution problems 
which could be solved only as words 
of the selected category. Past Es 
reported that telling Ss the category 
name did not always have a con- 
sistent effect; what they are surer of 
is that the set arises from verbal con- 
text during problem solving. Unfortu- 

*This study is based upon a dissertation 
submitted to the Johns Hopkins University 
in partial fulfillment of the requirements for 
the PhD degree, and was supported in part 
by Public Health Service F ellowship Number 
MF 13,450 from the National Institute of 
Mental Health. The author expresses her 


thanks to James E. Deese for his guidance 
throughout the study. 

* Personnel Research Psychologist, Bureau 
of Old-Age and Survivors Insurance, Social 
Security Administration, Department of 
Health, Education, and Welfare, 

* The probability of occurrence of response 
can be considered the definition of set, The 
probability of occurrence of a response is then 
inferred from the time it takes the response 
to occur or the frequency with which it occurs. 


40 


nately, the past Es selected the cate- 
gory words and category name on an 
a priori basis, and none of them sys- 
tematically investigated the verbal 
context triggering the set. The 
present study aims to show that asso- 
ciative strength between words from 
which problems are made is a per- 
tinent variable to manipulate in the 
operation of the category set in 
anagram solving. 


This approach stems from work in free 
recall showing that associative strength 
between words presented to Ss signifi- 
cantly determines Ss’ verbal productions. 
Additional support for this approach is 
given by Mayzner and Tresselt (1958) 
who suggest that processes in anagram 
solving may be described by laws similar 
to those in verbal recall. 

Especially relevant to the problem 
investigated here is a study by Deese 
(1959) who investigated the effect on 
recall of presenting Ss with lists of words 
all of which are associated, with the 
average strength of association among 
the group expressed in an index he called 
interitem associative strength. He found 
that in “organized” lists of words high 
on this index, Ss recalled more and had 
fewer verbal intrusions. Postulating 
that free recall is in part free associating, 
Deese explained that Ss recalled more 
with the organized lists because there 
was a greater probability of Ss’ associa- 
tions actually being listed items. 


The present study extends Deese’s 
model of free recall to anagram solving 
and hypothesizes that the category 
set in anagram solving operates by 
associations between words selected 
by Es as belonging to a given cate- 
gory. On the basis of this hypothesis, 


SOLUTION OF WORD PROBLEMS 41 


TABLE 1 
THE Six ORGANIZED Lists USED IN THE EXPERIMENT 


List 1 List 2 List 3 
Labels Labels . Labels 
Word Anagram (Cond. O-L) Word Anagram (Cond. O-L) Word |Anagram! (Gand: 
COMMAND CANDOMM iy WHISTLE WHELIST CHAIR RAICH 
ORDER REDOR Military TRAIN TARNI Railroad SOFT FOST Comfort 
ARMY RAMY Armed Forces| NOISE NESOI Station SOFA OFAS Relaxation 
OBEY OYEB Discipline SOUND UNDOS Sounds CUSHION] CUNSHIO| Furniture 
SOLDIER | SODLERI SHRILL SHLIRL Noises PILLOW | WOLPIL 
NAVY VANY LOUD OLDU COUCH COCHU 
List 4 List 5 List 6 
MILK LIMK DOCTOR DOROCT SQUARE | SAQURE 
CREAM RECAM BEVERAGES | NURSE URNSE Hospital CIRCLE | crRLEc | Geometry 
SUGAR USGRA Breakfast HEALTH EHALHT Physical Health| Round | DUNRO | Geometric 
COFFEE EFECOF Food SICK KIsc Medical Care | CUBE BECU Figures 
SWEET SEWTE Taste MEDICINE MEDCIENI BLOCK CLOBK Shapes 
DRINK DINRK CURE ECRU BALL LALB 


the following predictions were made: 
(a) Anagrams made from organized 
lists of associatively related words 
will be solved more quickly than 
anagrams made from random word 
lists, because in the former, associa- 
tions called up by solved problems 
should aid Ss in the solution of sub- 
sequent ones. (b) There should be a 
rapid decrease in solution time with 
trials when Ss solve anagrams made 
from associatively related words as 
associations called up by solved 
problems begin to aid Ss in the solu- 
tion of subsequent ones. 


METHOD 


The study consisted of two parts: the 
selection of stimulus materials and the 
experiment itself. j 

The experiment.—The experiment consisted 
of three major conditions. In Cond. O and 
Cond. O-L groups of Ss were presented with 
anagrams made from “organized” lists of 
words (words that elicit each other in free 
association). In addition, Ss assigned to 
Cond, O-L were told labels describing the list 
of words from which the anagrams were made. 
These labels were selected on the basis of 
normative data gathered prior to the experi- 
ment. The labels were associates to the words 
of the list and in turn elicited them. In Cond. 
R, Ss were given anagrams made from “ran- 
dom” lists of words (words with essentially 


zero probability of eliciting each other in 
free association). The experimental design 
was such that the same sample of words in 
the same anagram form was used for all 
experimental conditions, Thus, as many 
factors as possible were constant for all 
conditions and whatever effects might occur 
could be attributed to the influence of 
associative context. 

Stimulus materials.—The stimulus materi- 
als consisted of 36 anagrams. The 36 words 
from which the anagrams were made came 
from 6 organized lists of 6 words each. These 
lists were constructed after administering 
several word association tests consisting of 
200 items including filler items to groups of 
from 50 to 100 Johns Hopkins University 
undergraduates. The instructions given to 
Ss in this part of the study were standard 
word association test instructions. 

The following governed the selection of 
the six words of each organized list: (a) the 
six words should have maximum interitem 
associative strength, i.e., the words should 
have a maximum probability of eliciting each 
other in free association, and (b) they should 
be words from which only unique! solution 
anagrams could be made. The six lists of six 
words each are shown in Table 1. 

Labels describing the organized lists 
(Cond. O-L) were obtained as follows: A 


* Two of these words do not form unique 
solution anagrams. These are sora which 
could be structured as oars and ciRcLE which 
could also be cleric. In the anagram form 
in which they were presented, none of these 
solutions occurred. 


42 


+ TABLE 2 


Over-ALL MEDIAN SOLUTION Times (IN Sec.) For SIX ORGANIZED LISTS OF 
DIFFERING [NTERITEM Associative STRENGTHS WITH AND 
WITHOUT LABELS 


MIRIAM A. SAFREN 


Interitem Associative Strength 


Note.—Median time scores for each list are based upon data of 6 Ss who solved six anagrams each. 


group of 60 Ss, Johns Hopkins University 
undergraduates, were given a 6-page booklet 
containing the six organized lists, one per 
page. The Ss were told to look at each list 
of words and to think of a label or labels that 
deséribed them or the way in which they were 
similar, and to note this in the space pro- 
vided. The various labels and the frequency 
with which they were given were tabulated. 
The labels selected for each list were those 
which included 50% or more of the responses. 
The labels are shown in Table 1. 

As the labels of each list were associates 
to the words of that list, further normative 
data were obtained to find out if the labels 
would elicit the words of the list. A sample 
of 40 Ss, Johns Hopkins University under- 
graduates, were given a 6-page booklet con- 
taining the six sets of labels, one per page. 
The Ss were told to look at the labels on each 
page and then to write down the words which 
the labels called to mind, but to spend no 
longer than 1 min. associating to each set of 
labels. Table 2 shows the average number of 
words of each organized list which the labels 
of that list elicited. The mean number of 
words called up by these labels in 1 min. was 


10 (SD = 6.32). 
TABLE 3 


ANAGRAM SOLUTION TIMES (IN SEC.) FOR 
THE THREE CONDITIONS 


Į Condition N Median Q Range 
R 36 12.2 6.2 2-67.8 
(0) 36 7.4 5.6 | .1-49.2 
O-L, 36 2.8 1.9 | 6-224 


Note.—Time scores on which medians Qs, and ranges 
ate calculated are the median solution | ac 
_ of the 36 Ss tested under each ereer enig ese 


No. Words Labels Elicit 


Median Time | Median Time 
(No Labels) | (with Labels) 
Variance 

1.32 13.65 6.60 
.96 7.65 3.85 
90 10.85 2.90 
1.06 3.25 1.25 
1.69 16.45 2.10 
1.18 3.25 1.25 


For Cond. R in which Ss were presented 
with anagrams made from random lists of 
words, 36 random lists* of six words each were 
generated as follows: One word at a time was 
taken from each of the 6 organized lists to 
form a random list of six words. By a pre- 
arranged order, every word of each organized 
list occurred in 6 of the 36 random lists but 
never in the context of any other word of the 
same organized list. 

In constructing the anagrams, an attempt 
was made to control for three factors that 
affect difficulty of solution (Mayzner & 
Tresselt, 1958, 1959; Sargent, 1940): (a) the 
degree of difference of the letter arrangement 
from the original order in the word, (b) the ` 
transitional probabilities between the letters 
in the disarranged form, and (c) frequency of 
occurrence of the word in printed matter ` 
(Thorndike & Lorge, 1944). 

Subjecs—A total of 108 Ss from the 
introductory psychology class at the Johns 
Hopkins University were randomly assigned, 
36 per condition, For Cond. O and O-L there 
were 6 lists of six anagrams each. Different 
groups of 6 Ss received each of the 6 lists, — 
and within each group the order of occurrence 
of the six anagrams was counterbalanced over 
Ss. For Cond. R, there were 36 lists of six 
anagrams each. Each of the 36 Ss assigned 
to this condition received a different list. 


_* The 36 random lists are in the doctoral 
dissertation on file at the library of the Johns 
Hopkins University, 


SOLUTION OF WORD PROBLEMS 43 


of paper and pencil, and then to give the 
solution orally; (d) they would have 4 min. 
to solve each problem and then would be 
given the solution; (e) they were to work as 
rapidly as possible as they were being timed 
on each problem. In the case of Cond. O-L, 
Ss were told that the anagrams were made 
from a list of words related by association, 
and given the labels that described the list. 
Solution times were recorded with a stop- 
watch, and all Ss were tested individually. 


RESULTS 


Because of the nature of the dis- 
tribution of Ss’ anagram solution 
times, medians rather than means 
were chosen to represent S's typical 
performance in the statistical analysis. 
Table 3 summarizes the data on solu- 
tion time for the three conditions 
showing that medians, Qs, and ranges 
decrease in going from Cond. R to O 
to O-L. The overall statistical sig- 
nificance of the difference between 
the three experimental conditions, 
using the Kruskall-Wallis one-way 
analysis of variance, is P < .005. 
A comparison of Cond. O with Cond. 
R by the Mann-Whitney U test 
yielded P < .02. A similar compari- 
son between Cond. O and O-L 
yielded P < .003. 

With a significant difference be- 
tween Cond. R and Cond. O, a de- 
tailed analysis was made of the 
organized lists of differing interitem 
associative strengths. A Kruskall- 
Wallis one-way analysis of variance 
by ranks showed a significant dif- 
ference between organized lists 
(P < .05). However, the correlation 
(Kendall’s ' tau) between interitem 
associative strength index and median 
solution time for the organized lists 
was —.28 and nonsignificant. Table 
2 shows this information. 

Since the comparison between Cond. 
O and Cond. O-L was very significant, 
Mann-Whitney U tests were done 
comparing median solution times for 
the 6 Ss of each organized list with 


the comparable 6 Ss of the same 
organized list with labels. For Lists 
3, 5, and 6 there were significant 
differences between the two condi- 
tions (P = 021, .013, and .002, 
respectively). For Lists 1, 2, and 4 
the differences were not significant 
(P = .155, .242, and .155, respec- 
tively). Therefore, it may be con- 
cluded that anagram solution is 
facilitated when Ss are given ana- 
grams made from “organized” asso- 
ciatively related word lists. Anagram 
solution is facilitated still further 
when in addition, Ss are told labels 
describing the organized lists when 
these labels are (a) associates to the 
words and (b) in turn call up the 
words of the lists. 

The overall Trial effect (Ss under 
all conditions had six problems to 
solve), as tested by the Friedman 
two-way analysis of variance by ranks, 
was significant (P < .02). However, 
there appears to be an interaction 
between trials and conditions (see 
Fig. 1), for the Trial effect is non- 
significant for Cond. R (.50 <P <.70) 
and nonsignificant for Cond. O-L 
(.10 < P <.20). The Trial effect 
is significant for Cond. O (P < .05). 
The significant Trial effect for Cond O 
verifies the predicted decrease in 
solution time with trials when Ss 
solve anagrams made from organized 
lists as associations called up by the 
context begin to aid Ss in the solution 
of subsequent problems. 

With a significant Trial effect for 
Cond. O, analysis was made of the 
Trial effect for each organized list. 
Only the Trial effect for List 3 was 
significant (P < .02). However, the 
sample size for each organized list 
was only 6. 

Although the six organized lists 
vary little in the mean probability 
with which the words elicit each other 
in free association, there is a signifi- 


È 


© RANDOM LISTS 
© ORGANIZED LISTS 


6 ORGANIZED LISTS AND 
LABELS 


MEDIAN TIME IN SECONDS 


TRIALS 
Fic. 1. Median anagram solution time per 
trial for each of the three experimental condi- 
tions. (Each point is based upon the data of 
36 Ss.) 


cant difference in the variance about 
the mean (see Table 2). Bartlett’s 
test for the difference between the K 
variance estimates was significant 
at P<.05. Furthermore, there seems 
to be a relationship between the inter- 
item associative strength index and 
the decrease in solution time with 
trials for each organized list. Rank- 
ing each list by the magnitude of its 
variance and then ranking each ac- 
cording to the amount of change in 
solution time with trials results in a 
correlation (Kendall's tau) of —.60, 
P=.068. This suggests that the 
greater the variance about the inter- 
item associative strength index the 
less the change in solution time with 
trials. 

A similar correlation (Kendall's 
tau) between the interitem associative 
strength of each list and the variance 
about this value is .60 (Œ = .068). 
Thus, it appears that organized lists 
with higher interitem associative 
strengths had a higher variance about 
this index which would tend to make 
thedrop in solution time with trials less. 

Possibly, not controlling for the 
variance about the interitem associa- 
tive strength index was one reason 
why it was not possible to show a 


TA table showing the trial effect for each 
organized list is in the doctoral dissertation. 


MIRIAM A. SAFREN 


position relationship between the mag- 
nitude of the index and the median 
solution time for each organized list. 
It is also possible that other factors 
inhibiting this relationship were (a) 
the narrow range of interitem asso- 
ciative strengths used, and (b) the 
inability to make anagrams of all 
organized lists of equal difficulty. 


DISCUSSION 


The results support the hypothesis 
that in the solving of word problems such 
as anagrams, the category set operates 
by associations between words selected 
by Eas belonging to a common category. 
The results also suggest processes in the 
solving of word problems such as ana- 
grams may be similar to those in verbal 
recall; for in both verbal recall and 
anagram solving, associative strength 
between words significantly determines 
Ss’ verbal productions. 

A suggested model to explain the 
results is as follows: When Ss solve a list 
of anagrams, at least in part, they are 
calling up words to match the letters 
presented and sampling from a momen- 
tary response pool of available words. 
After Ss have determined some of the 
words from which the problems were 
made, these words implicitly evoke 
others which are associatively related. 
These associations become part of the 
momentary available pool of words 
from which Ss are sampling. Thus, 
implicitly evoked associations from the 
solution of preceding problems can 
facilitate the solution of subsequent pro- 
blems by converging upon words on the 
list from which the anagrams were made. 

When Ss are given anagrams made 
from associatively related words and 
also given labels for these words, these 
labels evoke at the outset a response pool 
of available words some of which con- 
verge upon solutions to the anagrams. 
Since both associations elicited by solved 
anagrams and those called up by labels 
increase the availability of anagram 
solution, anagrams are solved 
quickly under Cond. O-L. However, 
since from the outset the labels make 


most 


SOLUTION OF WORD PROBLEMS 45 


some of the anagram solutions readily 
available, the Trial effect is not significant 
for Cond, O-L. 

Although couched in different lan- 
guage, the present model of the category 
set agrees basically with that of Maltz- 
man and Morrisett (1952). The latter 
explain the set as follows: An anagram 
belonging to a common class, e.g., 
“nature,” serves as a stimulus for the 
arousal of other words belonging to the 
selected class and in addition, a whole 
hierarchy of responses some of which 
belong to the selected class, some of 
which do not. However, since only the 
selected category responses are con- 
sistently reinforced in the experiment, 
these become dominant through medi- 
ated generalization. 

Maltzman and Morrisett do not 
explicitly describe the mediated generali- 
zation mechanism. However, in the 
present model, the Ss’ pool of associa- 
tively related words implicitly evoked 
by solved anagrams could be regarded 
as the mediated generalization mecha- 
nism. This pool of words would include 
the category name in so far as it is asso- 
ciated to the words comprising a common 
class and elicits these words. 

The present description of anagram 
solving in terms of verbal recall and free 
association may be considered an exten- 
sion of a model by Deese (1959), who 
interprets free recall as involving in part 
free association. Thus, the present 
study represents an extension over which 
associative relationships between words 
plays an important role. 

The present research also implies that 
in future studies on the category set, 
stimulus materials should be calibrated 
by existing associative relationships for 
more precise predictions on the rate of 
problem solving. For it is the contention 
of this study that associative strength 
between words may be considered a 
measure of existing response sets which 
when activated, facilitate solutions of 
word problems such as anagrams. 


SUMMARY 


This experiment tested the hypothesis 
that the category set in anagram solution 
operates by associative strength between 


words selected by E as belonging to some 
common class. In Cond. O and O-L, groups 
of Ss were given anagrams made from 
“organized” lists of words that elicited each 
other in free association. In Cond. O-L, Ss 
were also told labels describing the lists. The 
labels were associates to the words on the 
list from which the anagrams were made and 
in turn elicited these words. In Cond. R, Ss 
were given anagrams made from “random” 
word lists (words with essentially zero prob- 
ability of eliciting each other in free associa- 
tion). 

Time to solution was evidence for the 
operation of the set. It was predicted that 
(a) anagrams made from organized lists 
would be solved more quickly than anagrams 
made from random lists; for in the former, 
associations called up by the context would 
aid in problem solution; (6) there would be a 
significant Trial effect (a decrease in solution 
time with trials) for the organized condition 
as associations called up by context began to 
aid Ss in the solution of subsequent problems. 
The results verified these predictions and 
supported the present interpretation of the 
category set. A description of anagram 
solving as involving verbal recall and free as- 
sociation was advanced to explain the results. 


REFERENCES 


DESSE, J., Influence of inter-item associative 
strength upon immediate recall. Psychol. 
Rep., 1959, 5, 305-312. 

MALTZMAN, I., & Morrisett, L., Jr. Dif- 
ferent strengths of set in the solution of 
anagrams. J. exp. Psychol., 1952, 44, 
242-246. 

Mayzner, M. S., & TresseLT, M. E. Ana- 
gram solution times: A function of letter 
order and word frequency. J. exp. Psychol., 
1958, 56, 376-379. 

Mayzner, M. S., & TresseLT, M. E. Ana- 
gram solution times: A function of transi- 
tion probabilities. J. Psychol., 1959, 47, 
117-125. 

Rees, H. J., & Israet, H. E. An investiga- 
tion of the establishment of mental sets. 
Psychol. Monogr., 1935, 46(6, Whole No. 
210). 

SARGENT, S. S. Thinking processes at various 
levels of difficulty. Arch. Psychol., N. Y., 
1940, 35(Whole No. 249). 

Srarcu, D. Experiments in educational 
psychology. New York: Macmillan, 1911. 

THORNDIKE, E. L., & LORGE, I. The teacher's 
word book of 30,000 words. New York: 
Teachers College, Columbia University, 
1944. 

(Received May 27, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 1, 46-51 


SEQUENTIAL INTERFERENCES DEMONSTRATED BY 
SERIAL RECONSTRUCTIONS * 


E. B. COLEMAN 


Human Resources Research Office 


The findings of experiments on 
retroactive and proactive inhibition 
have been generalized to suggest that 
forgetting that occurs outside the 
laboratory is caused by interference 
similar to that found in the traditional 
laboratory experiment. Recently Un- 
derwood and Postman (1960) noted 
that “As plausible as this argument 
may seem, it is apparent that it would 
be highly desirable to specify possible 
sources of extraexperimental inter- 
ference, and to specify these in such 
a way that experimental tests of their 
influence on forgetting are possible” 
(p. 74). They suggested two such 
interferences: “‘letter-sequence inter- 
ferences” and “unit-sequence inter- 
ferences.” 

3 Letter-sequence interferences result 
when a list presented for learning 
conflicts’ with’ English spelling habits. 
Suppose S$ is presented with a con- 
sonant syllable JQB. Underwood 
and Postman assume that pre- 
viously learned spelling habits will 
make this sequence hard to learn 
because Q never follows J; nor does B 
ever follow Q in English spelling. 
These previously learned spelling 
habits must be “extinguished” or“ un. 
learned” or “inhibited” before JOB 
can be learned, Following Briggs 


1 Based on a dissertation submitted to the 
Faculty of Philosophy of the Johns Hopkins 
University in partial fulfillment of the require- 
ments for the degree of Doctor of Philosophy. 
The guidance of James Deese is most grate- 
fully acknowledged. This research was 
Supported in part by a National Science 
panda ton oie (30125) and in part 

y a Public Health Service Rese: x y- 
ship (MF-11, 727). Seb 


46 


(1954), they also assume that as time 
passes, the older sequential habits 
will spontaneously recover and inter- 
fere with JQB during a test for reten- 
tion (proactive inhibition). The older 
spelling sequences may be used during 
the delay period also (common se- 
quences being used more frequently 
than uncommon ones). This will 
further strengthen them and further 
interfere with JOB (retroactive inhi- 
bition). These assumptions suggest 
that as time passes, S may replace 
JOB with a more common sequence 
perhaps JOB. 

The “unit” in unit-sequence inter- 
ferences is usually the word, although 
it could be any sequence of letters 
presented as an independent unit. 
Word-sequence interferences result 
when a sequence presented for learn- 
ing conflicts with syntactic habits. 
Suppose a list presented to S contains 
the sequence STAND DID LINGERIE 
THEY IN. Assumptions analogous to 
those given in the preceding para- 
graph predict that as time passes, S 
may replace such a sequence with a 
more familiar one—perhaps THEY DID 
STAND IN LINGERIE. 

To collect evidence for these two 
sources of interference, Underwood 
and Postman used serial anticipation 
learning and measured retention after 
1 week for four kinds of lists; common 
and uncommon words, and common 
and uncommon trigrams. As time 
passed, errors due to remembering an 
item in an incorrect order increased 
faster for word lists than for trigram 
lists. In a similar experiment, Post- 
man (1961) also found more such 


—— Ee 


SEQUENTIAL INTERFERENCES 47 


intralist errors in recalling a list of 
common nouns than in recalling 
uncommon nouns. Both findings may 
be interpreted as evidence for word- 
sequence interferences. Underwood and 
Postman (1960) also found that as 
time passed, letter-sequence errors 
increased faster in the uncommon 
trigram lists than in the common 
trigram lists. This may be inter- 
preted as evidence for letter-sequence 
interferences. 

On the other hand, when the cri- 
terion is number of items recalled 
correctly, neither of the interferences 
have been revealed unless the analysis 
was restricted to the middle serial 
positions—to items that had not been 
overlearned. Thus evidence for the 
effect of these two sources of extra- 
experimental interference is not as 
strong as one would wish. 

The most direct and convincing 
evidence for sequential interferences 
would be to demonstrate that uncom- 
mon sequences are replaced by more 
common ones during recall. This 
investigation will attempt to demon- 
strate this by a variation of serial 
reproductions which might be called 
serial reconstructions. 


PROCEDURE 


The Ss were 27 introductory psychology 
students from the Johns Hopkins University. 

Experiment I,—The first S was presented 
with a list of letters and asked to’memorize 
its order. Then he was given all the letters— 
each one typed on a separate card—and was 
asked to arrange the cards in the correct order. 
His ordering was typed and presented to the 
second S to memorize and to arrange in order, 
and this second ordering was typed and pre- 
sented to the third S, and so on. 

Eleven successive Ss were instructed to 
memorize and reconstruct the order of six 
lists of scrambled letters. The Ss were tested 
individually, one after the other, each one 
trying to memorize the six orderings of the 
preceding S. The experiment was discon- 
tinued after the eleventh ordering because 


the orderings appeared to have reached a limit 
of no further approach toward English. 

Six sentences ranging from 25 to 33 letters 
in length (counting spaces as letters) were 
selected from a second grade reader, and the 
individual letters in each sentence were 
scrambled. For instance, HE SAT DOWN might 
yield SHT EAW NDO. 

The first S was given 20 sec. to study each 
list (by complete presentation). Then he was 
given all the original letters of the list—each 
letter typed on a separate card—and was 
asked to arrange them in the correct order. 
Before he began to arrange them, he was 
required to write down all he could recall. 
Because arranging the cards sometimes took 
him as long as 10 min., as he arranged the 
cards, he refreshed his memory by referring to 
these notes if he liked. The notes were dis- 
carded, but his ordering of the letters was 
typed and presented to the second S to 
memorize. The second S-was given the same 
cards (after they had been shuffled) and asked 
to arrange them in order, and this second 
ordering was typed and presented to the 
third S, and so on. 

With each new ordering, the list gradually 
approached the order of English spelling, so 
the time given to study a list was gradually 
reduced. As far as possible, Æ adjusted the 
time so that S made from three to six errors 
when ordering a list. By the eleventh order- 
ing, the time had been reduced to 12 sec. for a 
list. 

To reduce the serial position effect, the Ss 
were told: “Most people remember the first 
and last portions of a list best. Since you 
must arrange all the cards, you will do better 
if you spend proportionately more of your 
time memorizing the middle of the list.” 

Experiment II.—For word sequences, Exp. 
I was repeated with the following changes: 
(a) For materials, six sentences ranging from 
18 to 24 words in length were selected from 
“Storyville Days and Nights” by Louis Arm- 
strong (1954) and the words in each sentence 
were scrambled. (b) The first S was given 35 
sec. to memorize a list, and by the final order- 
ing this had been reduced to 13 sec. (c) Six- 
teen Ss (or 16 orderings) were needed before 
the lists reached a limit of no further ap- 
proach toward English. 


RESULTS 


Letter-sequence interferences.—Ex- 
periment I clearly showed that Ss 
replaced uncommon letter sequences 
with more common ones, Although 


48 


E. B. COLEMAN 


TABLE 1 
ORIGINAL List AND ELEVENTH ORDERING FOR Five Lists OF SCRAMBLED LETTERS 


Original List 


(a) NEHSMFECR DI VARWER HOTIHTHF ER —> 
(b) HNEWTWY NHETO NT HTKN AOSNDO 


ee, 
(c) WFD ALHOD HONIO NOOEK WDE ETTU —> 
(d) TEFTLH EAGN HGRE PULA EHT > 
(e) EHH NC TEHEHM IRATDL IRWCHDE => 


Eleventh Ordering 


MERCHIV REAHER SFDT THON HFWI 
SWYTHE TKNOT ANDNO WEHN TOHN 
ALH KEETE WOUD HUOTN DOWD FOOI 
TEATHELD TUG NAHL EFH ERG 

CHH CHEEME HUDWAR DIRLT HEIT 


they were asked to reconstruct the 
presented order exactly, with the first 
four or five reconstructions at least, 
the lists came closer and closer to 
English spelling. 

Here are several orderings for a 
list selected at random: 


Original 1E TYOUW LHRHES MNHKWEIH CNE 
ETAR 

First AI THYHLE MUHK WEIO EHRRWN 
ETCESN 

Second AI HYTWE WENKO HICH ERRHN 
MELTESN 

Sixth AI WEKNO HICH NERYHUE SWERT 
METHL 

Eleventh 


OWI HWE HEUI ACKRE NTHSRN 
MEYTHEL 


The original and the eleventh 
orderings for the other five lists are 
given in Table 1. 

It may not be obvious that the final 
orderings in Table 1 resemble English 


160 


2 
o 


PER 141,000 WORDS 
> 
o 


MEAN TRIGRAM OCCURRENCE 


0 
ORIGINAL FOURTH SEVENTH ELEVENTH 
SUCCESSIVE ORDERINGS 
(AND SUCCESSIVE SUBJECTS) 
F : 
Fic1. | ‘Mean trigram frequency for six 


lists"of letters arranged in order of successive 
orderings by Ss, 


more than the originals, but Fig. 1 
shows quantitatively that Ss did 
replace infrequent sequences with 
ones that more closely resembled 
English spelling. In Fig. 1, mean 
frequency per trigram for all six lists 
is plotted against successive orderings. 
For every list, all the three-letter 
sequences were written down, Spaces 
were not considered characters. (For 
instance, IE TYOUW LHRH yields TYO, 
YOU, OUW, LHR, HRH.) The frequency 
with which each trigram occurred in 
141,000? English words was deter- 
mined (Underwood & Schulz, 1960, 
Total Count); and mean frequency 
was plotted against successive order- 
ings. For the first four or five order- 
ings at least, it is apparent that Ss 
replaced presented sequences with 
sequences that more closely resembled 
English spelling. Tau between fre- 
quency and ordering is .36 which is 
significant beyond the .05 level. 
Note that 6 of the 11 Ss were tested at 
the horizontal part of the curve; 
therefore this tau is a very conserva- 


* Underwood and Schulz (1960, p. 75) 
state that their Total Count represents fre- 
quencies of occurrence in 1,035,000 words. 
However, B. Underwood (personal com- 
munication) has indicated that this is in error. 
In Appendix D, the Thorndike-Lorge (1944) 
count was based on a sample of 2,080 of the 
19,440 words in the The Teacher's Word Book 
of 30,000 Words, and so represents a base 
closer to 106,000 than 1,000,000 words. Thus, 
the Underwood and Schulz Total Count 
represents frequencies of occurrence in 
approximately 141,000 words, 


SEQUENTIAL INTERFERENCES 49 


TABLE 2 


ORIGINAL List AND SIXTEENTH ORDERING FOR Five Lists OF SCRAMBLED 
SENTENCES 


Original List 


(a) ABOUT WAS GOOD-LOOKING WAY AND 
TREATING MADE OF THAT A HIM THE 
QUIET YOUNGSTER NICE HE MANNERS A 
THEM GIRLS WILD GO WITH 


(b) AS BE CHILDHOOD TO LIVED FROM FRIENDS 
BEEN CONTINUE REAL HAD TRUE AS WE 
LONG WE AND WOULD WE 


(c) I BACK AND LONG AND SITTING ALL WORK 
DAY MULE MY MY IT SHOVELING GET 
BEHIND TO AWFUL USED PAINS COAL IN 
HARD WAS 


(d) GooD WE BIG AND ME A COOKED IRENE 
GUMBO SEE OF HIM POT FOR HE AND CAME 
TO 


(e) THEM HEART I WORLD OF MY FROM THE 
TO THE BOTTOM BE REPLACE WILL ABLE 
NEVER THAT SAY AND 


Sixteenth Ordering 


HE WAS A YOUNGSTER NICE QUIET WITH 
MANNERS GOOD-LOOKING AND A WAY OF 
TREATING THEM THAT MADE THE GIRLS GO 
WILD ABOUT HIM 


WE LIVED AS FRIENDS LONG TRUE FROM 
CHILDHOOD AND WE WOULD BE REAL TO 
CONTINUE AS WE HAD BEEN 


I AND MY ALL DAY LONG SHOVELING COAL IT 
WAS HARD AWFUL WORK AND SITTING 
BEHIND MULE USED TO GET PAINS IN MY 
BACK 


GOOD CAME ME AND FOR HIM IRENE COOKED 
A BIG POT OF GUMBO TO SEE HE AND WE 


NEVER SAY THAT WORLD AND BE MY HEART 
ABLE TO REPLACE THE I WILL FROM THE 
BOTTOM OF THEM 


tive estimate of the correlation be- 
tween frequency and ordering.’ 

Figure 1 shows that after the fifth 
ordering—after the sequences reached 
a mean occurrence of about 130 
occurrences per 141,000 words— 
there was little if any further approach 
toward English spelling. Although 
the orderings changed somewhat, they 
remained at about the same level of 
nonsense. This suggests that the Ss 
deliberately tried to reproduce the 
same level of nonsense even when they 
had forgotten the exact sequence. 

Word-sequence interferences —Ex- 
periment II repeated the above re- 
sults for word sequences: the Ss 
replaced uncommon word sequences 
with more common ones. With each 
new reconstruction, the arrangement 
came closer and closer to a sensible 
English sentence. 

3 These reconstructions were also scored 
as to bigram frequency per 121,000 words. 
The curve was almost identical to Fig. 1. It 
reached a peak at the fourth reconstruction, 
fell slightly, and began to rise again at the 
ninth reconstruction. 


Several orderings for a list selected 
for its inherent interest are given: 


(Original) STAND DID LINGERIE THEY IN 
STORYVILLE THE DID AS NOT NEIGHBORHOOD 
FINE IN DOORWAYS SILK THEIR WEARING 
GIRLS OUR IN. (First) STAND THEY DID IN 
STORYVILLE LINGERIE FINE THEIR THE DID 
NOT IN SILK NEIGHBORHOOD AS DOORWAYS 
WEARING GIRLS OUR IN. (Second) sTAND 
DID THEY STORYVILLE LINGERIE FINE THEIR 
IN SILK NEIGHBORHOOD OUR THE NOT GIRLS 
DID WEARING IN DOORWAYS IN AS. (Fourth) 
FINE DID THEY STAND IN STORYVILLE 
LINGERIE IN NEIGHBORHOOD SILK OUR NOT 
GIRLS THE WEARING DID IN DOORWAYS AS 
THEIR, (Seventh) AS THEY DID STAND 
STORYVILLE LINGERIE IN SILK NEIGHBOR- 
HOOD IN OUR DOORWAYS THEIR NOT WEAR- 
ING GIRLS FINE DID IN THE. (Eleventh) 
AS THEY DID NOT STAND STORYVILLE 
LINGERIE IN THEIR FINE SILK DOORWAYS 
IN OUR NEIGHBORHOOD DID THE GIRLS 
WEARING IN. (Sixteenth) THEY DID NOT 
STAND IN THEIR FINE SILK STORYVILLE 
LINGERIE IN THE DOORWAYS WEARING AS 
OUR GIRLS DID IN NEIGHBORHOOD, 


In Table 2, the original and the 
sixteenth orderings are given for the 
other five lists. Even if it is not clear 
from the examples that the sixteenth 


e @ 
w 
o 
E 
2 6 
> 
= 
° 
g 4 
= 
= 2 
3 0 
ORIGINAL =a" a'h ta" ie"® 
i SUCCESSIVE ORDERINGS 
(AND SUCCESSIVE SUBJECTS) 
Fic. 2, Mean ranking by judges for six 


scrambled sentences arranged in order of 
successive orderings by Ss. 


orderings resemble English more than 
the originals, Fig. 2 shows quantita- 
tively that Ss did replace unfamiliar 
word sequences with more familiar 
ones. In Fig. 2 successive orderings 
are plotted against mean rank as to 
sensibleness. For each scrambled 
sentence, the original and eight even 
orderings were typed on individual 
cards, shuffled, and 10 judges were 
asked to rank them as to their 
approximation to English. Tau be- 
tween ordering and mean rank was 
-89, which is significant beyond .0005. 

Figure 2 suggests that the orderings 
reached a limit of no further approach 
toward English somewhat before the 
sixteenth ordering, and the examples 
show that the sixteenth ordering is 


still far from a sensible English 
sentence, 


Discussion 


The fact that Ss replaced uncommon 
sequences with more common ones is a 
rather direct demonstration of the extra- 
experimental interference from language 
habits, and further discussion would add 
little of value; however, three procedural 


‘The judges were all graduate students 
familiar with approximations to English. 
Six coefficients of concordance were com- 
puted—one for their orderings of each sen- 
tence. Mean coefficient of concordance was 
-88, significant beyond the .001 level. Clearly 
the judges agreed among themselyes, 


E. B. COLEMAN 


details are worth examining. They do 
not raise serious doubts about the main 
conclusions, but they may influence the 
shape of the functions in Fig. 1 and 2. 
First we should examine the conse- 
quences of E's attempt to hold errors 
constant by progressively reducing study 
time as the lists became progressively 
more simple. For the later Ss, this con- 
stant-error criterion forced errors on 
sequences that were almost correct 
English. Thus it raised the asymptote 
and reduced the negative acceleration 
somewhat. The more usual constant 
time limit produces a sharper negative 
acceleration: as soon as the list comes 
close to English, Ss are able to memorize 
it completely and the curve levels off. 
Second, each S was exposed to and 
tested upon six successive lists totaling 
over 130 successive items. On the later 
lists, he was clearly subject to consider- 
able proactive interference from the 
earlier sequences and from his own re- 
sponses on earlier tests. Proactive inter- 
ference from S's earlier lists should have 
no effect at all if we assume that there 
are not extraexperimental interferences 
from language habits. Interference from 
lists would cause him to replace one 
trigram (for instance) with another from 
an earlier list. Some replacements would 
be of more common occurrence in 
English, some of less common occurrence. 
If the replacements were consistently 
of more common occurrence, then this 
would argue for extraexperimental inter- 
ference from language habits. Proactive 
interference from earlier tests would 
lead S to replace a trigram with a trigram 
from his own earlier reconstructions. 
This would raise mean trigram fre- 
quency because the mean trigram fre- 
quency of the reconstructions tended to 
be higher than the lists. Clearly this is a 
second-stage effect of extraexperimental 
interference from language habits. 
Third, it should be emphasized that 
the test was not pure reconstruction: 
it was reconstruction confounded with 
recall because Ss were required to jot 
down their recall of the list and were 
allowed to consult these notes during 
reconstruction. The reconstruction tech- 


SEQUENTIAL INTERFERENCES 51 


nique was simply a strategem to prevent 
the loss of elements that occurs in the 
usual serial reproduction technique. 

In addition to the three procedural 
points, a marginal finding is worth 
examining—the finding that long before 
the reconstructions became sensible Eng- 
lish, they reached a limit of no further 
approach toward English. After this 
point, although the orderings changed 
somewhat, they remained at about the 
same level of nonsense. Apparently Ss 
were deliberately trying to reproduce the 
same level of nonsense. 

In fact, several Ss reconstructing 
scrambled letters commented that when 
they were unsure of asequence, they would 
be certain that it was not a real word. 
For if it had been a real word, they said, 
they were sure they would have remem- 
bered it. Therefore in such a situation, 


they would purposely arrange the letters 


so that they did not spell a word. 


To check the fact of the asymptote, 


two lists of each kind were reconstructed 
10 more times by another group of Ss. 
The letter lists dropped from a mean of 
128 occurrences per 141,000 words at the 
eleventh reconstruction to 124 at the 
twenty-first. In short the fourth recon- 
struction resembled English spelling 
about as closely as the twenty-first 
reconstruction. 


SUMMARY 


Two extraexperimental sources of inter- 
ference were examined: (a) letter-sequence 


interferences, which result when a list pre- 


Sel wealth a e cite Lone 
ma baad Each succeeding S tried to 
reconstruct order given by the precedi 

S. With each successive leceurtractia 
scrambled lists came closer and closer to 
sensible English. However, the orderings 
reached an asymptote after which there was 
no further approach toward 


REFERENCES 


ARMSTRONG, L. Storyville days and nights. 
1a, Tien Sar w Bieg. New York: Mentor, 
1954. 3 


Bricos, G. E. Acquisition, extinction, and 
recovery functions in retroactive inhibition, 
J. exp. Psychol., 1954, 47, 285-293. 

Postman, L. Extra-experimental interfer- 
ence and the retention of words. J. exp. 
Psychol., 1961, 61, 97-110. 

THORNDIKE, E. L., & LORGE, I. The teacher's 
word book of 30,000 words. New York: 
Teachers College, Columbia University, 
1944, 

Unperwoop, B. J., & POSTMAN, L. Extra- 
experimental sources of interference in 
forgetting. Psychol. Rev., 1960, 67, 73-95. 

Unperwoon, B. J., & Scuurz, R. W. Mean- 
ingfulness and verbal learning. Chicago: 
Lippincott, 1960. 


(Received May 29, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No, 1, 52-57 


THE EFFECT OF ORDER OF APPROXIMATION TO THE 
STATISTICAL STRUCTURE OF ENGLISH ON THE 
EMISSION OF VERBAL RESPONSES 1 
KURT SALZINGER, STEPHANIE PORTNOY, ann RICHARD S. FELDMAN 


Biometrics Research, New York State Department of Mental Hygiene and Columbia University 


The variables controlling the emis- 
sion of verbal behavior may be 
roughly divided into response-pro- 
duced stimuli (the speaker’s verbal 
responses surrounding the “controlled” 
response) and external stimuli (audi- 
ence, reinforcements, etc.). The pres- 
ent study deals with the effect of 
response-produced stimuli as specified 
by Miller and Selfridge’s (1950) 
passages of different orders of approxi- 
mation to the statistical structure of 
English. 

These passages were originally con- 
structed for the purpose of studying 
the effect of sequential association 
upon memory. It was hypothesized 
that the better memory for higher 
orders of approximation to’ English 
is related to the greater number of 
words determining each subsequent 
word. Many investigations have 
made use of these or similarly con- 
structed passages as independent 
variables for such factors as memory 
(Deese & Kaufman, 1957; Marks & 

1 This investigation was carried out while 
the third author held a predoctoral fellowship 
from the National Institute of Mental Health, 
United States Public Health Service. This 
study was supported in part by Research 
Grant M1541, in part by Research Grant 
MY 4758, from the National Institutes of 
Health. The authors wish to thank J. Zubin 
for his help and interest in this research, 
William Reynolds of Rutgers University 
deserves special thanks for making available 
a majority of the Ss, We appreciate the 
assistance of Hilda Brody of Columbia 
University in providing additional Ss, and of 
Robert Keisner and Phyllis Zlotogura for 
analysis of part of the data. This paper was 
presented in part at a meeting of the Eastern 
Psychological Association, Philadelphia, 1961. 


52 


Jack, 1952; Miller & Selfridge, 1950; 
Richardson & Voss, 1960; Sharp, 
1958), shadowing of one of two 
dichotic messages (Moray & Taylor, 
1958), ‘‘meaningfulness’ (Marks & 
Taylor, 1954), and eye-voice span 
(Lawson, 1961). Therefore it was 
thought worthwhile to obtain an 
exact measure of the amount of con- 
textual dependency in each passage 
in order to test the assumption that 
consecutive orders are equally distant 
from each other. Obviously, if they 
are not, the shape of any curves em- 
ploying this assumption in relating 
the above mentioned factors to ap- 
proximation to English would be 
inaccurate. 

The technique chosen for evaluating 
the amount of contextual dependency 
was the “cloze’’ procedure (Taylor, 
1953, 1954, 1956), which was origi- 
nated as a measure of readability 
and which consists of having Ss 
guess the words which have been 
systematically deleted from a given 
passage. Thus, application of the 
cloze procedure to these passages 
will—in addition to supplying an 
exact measure of contextual depend- 
ency—provide standards for compar- 
ing different texts, e.g., one text 
might be described to be as readable 
as third-order, another as readable 
as seventh-order approximation to 
English. 

Finally, this study will provide the 
opportunity to examine the effect 
of order of approximation upon the 
number of guessed words which fall 
into the same grammatical category 


EMISSION OF VERBAL RESPONSES 53 


of speech as the ‘‘correct’’ word. 
This will make it possible to evaluate 
the contribution which the syntax, 
resulting from these approximations 
to English, makes to such functions 
as memory. 


METHOD 


Passages.—The 50-word passages of Miller 
and Selfridge (1950), at each of the eight 
orders of statistical approximation to English 
(zero, first, second, third, fourth, fifth, 
seventh, and Text), were the basic material 
for this study. Each passage was mimeo- 
graphed on a separate sheet of paper, each 
deleted word being indicated by an underlined 
blank of constant size. The order of presenta- 
tion of the passages was randomized within 
each set of eight and varied from S to S. 

Experimental groups.—Ninety-three under- 
graduate students, with a median age of 18.9 
yr., were required to participate in this experi- 
ment as part of their regular course work. 

Group Ai (17 Ss, 11 male and 6 female) and 
Group Az (16 male Ss) were given the pas- 
sages with every fifth word deleted, starting 
with the fifth word of each passage and omit- 
ting the last word. Groups A; and Az gave 
essentially the same results on all measures, 
and therefore they have been combined into 
a single group (AAs) in some of the analyses 
presented below. For all analyses of gram- 
matical category, only Group A: is included 
since it was assumed that further analysis of 
Group A; would have yielded the same results. 

Group B (33 male Ss) was given the same 
passages as Groups A; and As, with every 
fifth word deleted, starting with the sixth 
word of each passage and omitting the first 
word, 

Group C (27 male Ss) was given the same 
passages with every seventh word deleted, 
starting with the seventh word of each passage 
and omitting the last two words. 

Procedure—The passages were adminis- 
tered to Ssin a group. They were told in the 
instructions read to them that the typescripts 
contained no punctuation and they them- 
selves would have to decide where a new 
thought began, that only one word was called 
for in each blank, and that, when they were 
not certain, they should guess rather than 
skip a blank. Thirty minutes were allowed 
for completion of the task. 


RESULTS AND DISCUSSION 


Proportion of words correctly guessed, 
—Figure 1 shows that the proportion 


= AY GROUPS Ay B Ag (STH): p<.OOF FOR Dirr. AMONG CROERS 


PROPORTION OF CORRECT WORDS 


ORDER OF APPROXIMATION TO STATISTICAL STRUCTURE OF ENGLISH 


Fic. 1. Proportion of words guessed cor- 
rectly as a function of order of approximation 
to English. 


of correctly guessed words increased 
from Orders 0 through 7, with Text 
approximately equal to Order 5. 
Friedman analyses of variance (Siegel, 
1956), used for this and all subsequent 
comparisons, showed that the three 
groups (AiAs, B, C) did not differ 
significantly from one another (P =.53) 
and that, within each group, the 
increase over orders was statistically 
significant (P < .001 for each group). 
Therefore, the mean curve in Fig. 1, 
based on N = 93 and on about 40% 
of the words of each passage, repre- 
sents a more reliable summary of 
these data than each curve separately. 

This result also makes clear that 
having 6 words on either side of the 
blank does not produce more correct 
guesses than having 4 words on either 
side of the blank, thus reinforcing 
the conclusion of Taylor (1954) and 
MacGinitie (1961) that the words 
predicted for every fifth blank are 
independent of each other, and the 
conclusion of Aborn, Rubenstein, and 
Sterling (1959) that 11-word sentences 
result in optimum predictability in 
comparison to 6-word and 25-word 
sentences. Apparently Ss either do 
not or cannot make use of a context 
of more than 5 words on either side 
of each blank. 

The fact that the relationship be- 
tween order and proportion of correct 


54 K. SALZINGER, S. PORTNOY, AND R. S. FELDMAN 


TABLE 1 


MEAN NUMBER AND PROPORTION OF EXACT CORRECT WORDS AS A FUNCTION OF 
ORDER OF APPROXIMATION TO THE STATISTICAL STRUCTURE OF ENGLISH 


Order of Approximation 
Group Score Text 
0 1 3 4 5 7 
0 .03 1.64 | 2.45 1.42 2.61 2.97 
me Boonton 0 | .003 | 1068 | 1182 | .272 | 158 | 290 | 330 
0 09 R 1.39 1.58 3.24 3.55 215 
j ale 0 -010 .023 154 | 176 360 394 .239 
0 -29 # 1.61 1.61 2.72 3.78 2.45 
y a on 0 032 -032 .179 .179 -302 .420 272 
Mean Number 0 14 37 1.55 1.88 2.46 3.31 2.52 
Mean Proportion 0 .016 041 172 209 .273 368 2.80 


a Number of correct words 


S; 
» Proportion of correct words to total number of words guessed, 


words is not linear means that the 
assumption of equal distances between 
Consecutive statistical orders is not 
tenable. If we utilize the values in 
Table 1 as correct estimates of the 
distances between orders of approxi- 
mation, then the distance between 
Orders 2 and 3 is more than five times 
the distance between Orders 1 and 2. 
For memory improvement on these 
Same passages? (Selfridge, 1949), the 
Corresponding ratio is less than 1.5. 
Figure 2 shows the memory data 
plotted in two ways: (a) according 
to the assumption of equal intervals 
between orders of approximation, 
and (b) with orders spaced according 
to number of words correctly guessed 
in the present study. From Orders 3 
through 7, the curves are essentially 
the same. Up to Order 3, however, 
the intervals between successive orders 
vary greatly in size, indicating that 

2 The set of passages used in the present 
study is available in Miller and Selfridge 
(1950). However, their memory data are 
based on averages of two passages at each 
order of approximation, The Percentages 
of words correctly recalled, for the Single set 
of passages used here, were obtained from 


Selfridge’s (1949) thesis, and 
her first set of lists. Reo 


the equal-intervals assumption is least 
tenable in this region. One addition- 
al factis that Text gives riseto fewer 
correct words than Order 7. This in- 
dicates that it was misplaced on the or- 
der continuum and also suggests that 
different texts might well appear in 
different positions on this continuum. 
Proportion of different words.—Fig- 
ure 3 shows a plot of the proportion 
of the total number of guessed words 
which were different. With the ex- 
ception of Text, which is again out of 
line, it is clear from this graph that 
increasing order of approximation not 
only increases the probability of 
evoking a correct response but also 
limits the variety of different re- 
sponses. The difference among the 
three groups of Ss approaches signifi- 
cance (.05 < P < .08), but the graphs 
of the separate groups each show the 
general tendency to decrease over 
orders. The decrease is significant 
in each case (P < .008 for AAs, 
P < .001 for B, P = .002 for a); 
Figure 4 shows a plot of the propor- 
tion of the total number of different 
incorrect words. With the exception 
of Text, there is a general decrease 
in the proportions with increasing 


EMISSION OF VERBAL RESPONSES 55 


order, although the overall drop 
is only about half of what it is in 
Fig. 3. The differences among the 
three groups approach significance 
(.05 < P < .08) as before, but here 
this is due to the fact that Group C 
does not show a significant decrease 
over orders (P = .20) while Groups 
AA, and B do show significant de- 
creases (P < .05 and P < .04, re- 
spectively). This is the only measure 
on which one of the groups differed 
significantly from the others. A 
suggested explanation is that, with 
only six blanks for each passage in 


ya ORDERS SPACED BY 
oS BaOAL INTERVALS 
r 


(UPPER ABSCISSA AXIS) 


PERCENT OF WORDS CORRECTLY RECALLED 
DATA FROM SELFRIDGE (1949) 


= 
ote 34 


ORDER OF APPROXIMATION TO STATISTICAL STRUCTURE OF ENGLISH 


Fic. 2. Percentage of words correctly 
recalled (Selfridge, 1949) as a function of 
order of approximation to English, with an 
equal-interval scale for orders (dashed line), 
and with the intervals determined by the 
average number of words guessed correctly 
in the present study (solid line). 


5 Text 7 


which every seventh word was de- 
leted, the total number of guessed 
words may not have been large 
enough to allow significant changes in 
number of different words from order 
to order when only incorrect guesses 
are considered. 

Grammatical classification.—The 
grammatical classification was based 
on Fries’ (1952) system of analysis. 
The two groups of words called 
general grammatical categories below 
are lexical words (roughly equivalent 
to the classical categories of nouns, 
pronouns, verbs, adjectives, and ad- 


+ Me ROU Ay B Ag ISTH), p < C08 FOR CUFF amont ORDERS 
S CROP B ISTH): < 001 FOR DIFF. AMONG ORDERS 
4 SOUP © IFTM ip = COR FOR Dirr. awona oncens 
S AVERAGE; O9<p< OD FOR DIFF, AMONG GROUPS 


ot Hi 


ORDER OF APPROXIMATION TO STATISTICAL STRUCTURE OF ENGLISH 


Fic. 3. Number of different words in- 
cluding the correct word (expressed as a 
proportion of the total number of words 
guessed) as a function of order of approxima- 
tion to English. 


NUMBER OF DIFFERENT WORDS (INCL. EXACT CORRECT WORDS) 
TOTAL NUMBER OF WORDS 
> p 


verbs) and function words (roughly 
equivalent to articles, conjunctions, 
prepositions, auxiliary verbs, inter- 
jections, and quantity words). The 
five specific grammatical categories 
are nouns and pronouns considered 
as one class, verbs, adjectives, ad- 
verbs, and function words as already 
described. 

Figure 5 shows that, as order of 
approximation increases from Orders 0 
to 7, an increasing proportion of the 
words emitted by Ss belongs to the 
same specific grammatical category 
as the deleted words. The increase 
is significant for each group (P<.001 
for As, B, and C), and the groups do 
not differ from one another (P =.24). 


* AV, GROUPS Ay BA2 (STH); p-<.05 FOR DIFF, AMONG ORDERS 
GROUP B (STH): <.04 FOR DIFF, AMONG ORDERS 
4 GROUP C (TTH)ip *.20 FOR DIFF. AMONG ORDERS 
AVERAGE; 09 <p<.08 FOR DIFF. AMONG GROUPS 


8 * 4 . 


| 
j 
ji 


ORDER OF APPROXIMATION TO STATISTICAL STRUCTURE OF ENGLISH 


Fic. 4. Number of different incorrect 
words (expressed as a proportion of the total 
number of incorrect words) as a function of 
order of approximation to English. 


an 
an 


+ GROUP Ay [STHhp <.001 FOR DIFF. AMONG ORDERS 


ene 
7 |r 


o 
” 
“l 
> 
al 


IN THE CORRECT SPECIFIC GRAMMATICAL CATEGORY 


PROPORTION OF WORDS (INCL EXACT CORRECT WORDS) 


ORDER OF APPROXIMATION TO STATISTICAL STRUCTURE OF ENGLISH 


Fic. 5. Proportion of guessed words be- 
longing to the same specific grammatical 
category as the correct word, as a function of 
order of approximation to English. 


This graph again shows performance 
on Text to be worse than at Order 7 
and only slightly better than at Order 
5. Itshould be noted that at Order 0 
Ss perform very close to chance level, 
i.e., when the emitted words are put 
into the five specific grammatical 
categories, about one-fifth fall into 
the correct category. 

Figure 5 differs from Fig. 1 (num- 
ber of correct words) in that it seems 
to approach an asymptote at about 
Order 3 with only a relatively small 
increase after that, while the number 
of correct words rises as much after 
as before Order 3. It is also apparent 
that the maximum proportion of 
words in the correct grammatical 
category is more than twice as large 
as the proportion of exact correct 
words, i.e., while Ss may not react 
to the meaning they react to the 
syntax. This type of “knowledge” 
may well be used as a further way of 
distinguishing different texts. 

Again comparing Fig. 5 and 1, it is 
seen that the increase from Orders 1 
to 2 for words in the correct specific 
grammatical category accounts for a 
considerably greater proportion of the 
overall increase than does the cor- 
responding change for exact correct 
words. Beyond Order 3, as already 
Stated, correct specific grammatical 


K. SALZINGER, S. PORTNOY, AND R. S. FELDMAN 


classifications appear to approach 
an asymptote while number of exact 
correct words continues to increase. 
Therefore, if grammatical classifica- 
tion of guessed words is taken as a 
measure of amount of syntactical 
structure, and exact correct words are 
taken as a measure of amount of 
meaning, improvement in memory 
must be attributed primarily to in- 
creased syntactical structure between 
Orders 1 and 2, about equally to syn- 
tactical structure and to meaning 
between Orders 2 and 3, and primarily 
to meaning beyond Order 3. 

If we examine the relationship be- 
tween order and specific grammatical 
category, taking into account only 
those responses which differed from 
the deleted words, it is found to be 
nearly identical to that shown in 
Fig. 5, where correct responses are 
included. The increases over orders 
are significant (P < .001 for A» B, 
and C) and the three groups do not 
differ from one another (P > .15). 

The relationship between order and 
general grammatical category also 
remains nearly the same whether or 
not correct responses are taken into 
consideration. Again at Order 0 Ss 
perform very close to chance level, 
i.e., when the emitted words are put 
into the two general grammatical 
categories of lexical and function 
words, without further subdivision, 
about one-half fall into the correct 
category. In general these relation- 
ships are very similar to the analogous 
ones for specific grammatical cate- 
gories. Although the increases over 
orders cover a smaller range when 
the words are divided into only two 
categories, they are significant for all 
three groups (including correct re- 
sponses: P < .002 for A,, P < .001 
for B and C; not including correct 
responses: P < .02 for A, P < .001 
for B and C), and the groups do not 


EMISSION OF VERBAL RESPONSES 57 


differ from one another (P = .53 in 
both cases). 


These results are in general agreement 
with Epstein’s (1961) finding that learn- 
ing of a syntactically structured series 
of words is superior to that of a random, 
unstructured series of words, since the 
two series correspond to Order 1 and 
Text of the present study. Epstein 
found in addition that there was no 
significant difference between his Ss’ 
ability to learn syntactically structured 
nonsense syllable material and their 
ability to learn syntactically unstruc- 
tured meaningful material, and this 
result demonstrates that syntax or 
meaning alone contributes to learning. 
This makes more plausible our suggestion 
that both “meaning” and syntax make 
contributions to memory for Miller and 
Selfridge’s (1950) passages. 


SUMMARY 


Ninety-three undergraduate students were 
required to guess the words that were system- 
atically deleted from a series of passages vary- 
ing in order of approximation to the statis- 
tical structure of English. The Ss guessed a 
greater proportion of words the higher the 
order of approximation to English, Propor- 
tion of words in the correct grammatical cate- 
gory also showed an increase with increasing 
order of approximation. Proportion of words 
in the correct grammatical category increased 
most from Order 0 to Order 3 while proportion 
of correct words continues to increase as much 
after Order 3 as before it. Thus analysis of the 
predicted words into their grammatical cate- 
gories may well be useful for further differen- 
tiation of text materials, since the two types 
of analysis do not yield the same type of 
information. Finally, it was pointed out 
that the assumption of equal intervals be- 
tween successive orders of approximation 
is untenable, and the relationship between 
memory and order of approximation to 
English can be explained in part by syn- 
tactical structure and in part by the “mean- 
ing” called for by the context. 


REFERENCES 


ABORN, M., RUBENSTEIN, H., & STERLING, 
T. D. Sources of contextual constraint 


upon words in sentences. J. exp. Psychol., 
1959, 57, 171-180. 

DEESsE, J., & KaurMan, R. A. Serial effects 
in recall of unorganized and sequentially 


organized verbal material. J. exp. Psychol., 
1957, 54, 180-187. 
Epstein, W. The influence of syntactical 


structure on learning. Amer. J. Psychol., 
1961, 74, 80-85. 

Fries, C. C. The structure of English. New 
York: Harcourt, Brace, 1952. 

Lawson, E. A. A note on the influence of 
different orders of approximation to the 
English language upon eye-voice span. 
Quart. J. exp. Psychol., 1961, 13, 53-55. 

MacGinitiz, W. H. Contextual constraint 
in English prose paragraphs. J. Psychol., 
1961, 51, 121-130. 

Marks, M. R., & Jack, O. Verbal context 
and memory span for meaningful material. 
Amer. J. Psychol., 1952, 65, 298-300, 

Marks, M. R, & Taytor, W. L. The 
influence of contextual and goal constraints 


on the meaningfulness of “automatic 
sentences.” J. soc. Psychol., 1954, 40, 
43-51. 


Mer, G. A., & SELFRIDGE, J. A. Verbal 
context and the recall of meaningful 
material. Amer. J. Psychol., 1950, 63, 
176-185. 

Moray, N., & TAYLOR, A. The effect of 
redundancy in shadowing one of two 
dichotic messages. Lang. Speech, 1958, 1, 
102-109, 

Ricwarpson, P., & Voss, J. F. Replication 
report: Verbal context and the recall of 
meaningful material. J. exp. Psychol., 
1960, 60, 417-418. 

SELFRIDGE, J. A. Investigations into the 
structure of verbal context. Unpublished 
honor’s thesis, Harvard University, 1949. 

Smarr, H.C. Effect of contextual constraint 
upon recall of verbal passages. Amer. J. 
Psychol., 1958, 71, 568-572. 

SIEGEL, S. Nonparametric statistics for the 
behavioral sciences. New York: McGraw- 
Hill, 1956. 

Taytor, W. L. Cloze procedure: A new 
tool for measuring readability. Journalism 
Quart., 1953, 30, 415-433. 

Tayior, W. L. Application of “cloze” and 
entropy measures to the study of contextual 
constraint in samples of continuous prose. 
Unpublished doctoral dissertation, Uni- 
versity of Michigan, 1954. 

Tavor, W. L. Recent developments in the 
use of cloze procedure. Journalism Quart., 
1956, 33, 42-48. 


(Received June 5, 1961) 


al of Experimental Psychology 
eer V4 64, No. 1, 58-61 


THE EFFECTS OF REWARD AND KNOWLEDGE OF 
RESULTS ON THE PERFORMANCE OF A SIMPLE 
VIGILANCE TASK 
RAYMOND R. SIPOWICZ, J. ROGER WARE, ann ROBERT A. BAKER 
United States Army Armor Human Research Unit, Fort Knox, Kentucky 


Although the effects of reward and 
knowledge of results on human per- 
formance have been intensely studied, 
the effectiveness of such variables on 
human monitoring has received rela- 
tively little attention, In view of the 
practical importance of vigilance re- 
search and the need to sustain and 
maintain a high level of detection, 
this neglect is somewhat surprising, 
Pollack and Knaff (1958) studied the 
effects of reward and punishment on 
monitoring, and reported that punish- 
ment was more effective than reward. 
Moreover, the reward condition proved 
relatively ineffective in improving 
performance beyond a level obtained 
without reward. 

Mackworth (1950); McCormack 
(1959); Loeb and Schmidt (1960) ; 
and Weidenfeller, Baker, and Ware 
(1962) did, however, obtain signifi- 
cant improvement in vigilance per- 
formance using knowledge of results 
(KR). In the last study, not only 
KR but “false KR” was also found 
to be effective. Knowledge of results 
in this experiment was given by means 
of a bright white light. While it might 
be assumed that such added stimula- 
tion would raise S's activation level 
and hence increase detection prob- 
ability, the inclusion of a control 
group having the light alone (ad- 
ministered in synchrony with the 
typical vigilance decrement) showed 
no arousal effects. A similar finding 
has been reported by Davis, McC ourt, 
and Solomon (1960). 

The effectiveness of KR in these 
studies, and the apparent failure of 


58 


“reward” in the Pollack and Knaff 
(1958) study, is somewhat contra- 
dictory—especially in view of the 
fact that feedback or KR can be, 
and often is, logically regarded as a 
form of reward or reinforcement. In 
the Pollack and Knaff study no men- 
tion was made of the amount of 
reward except to say that it was an 
extra hour of pay. Further, the con- 
ditions of reward were such as to make 
it difficult for more than 1 or 2 Ss in 
the group to be rewarded. 

If, however, in the typical vigilance 
study the reward is contingent upon 
the maintenance of a certain level 
of vigilance by each S, the effects 
of the reward might be more pro- 
nounced than in a situation where a 
reward is given for each signal de- 
tected or on a competitive basis where 
only the best are rewarded: More- 
over, if reward and KR are indi- 
vidually effective, a condition combin- 
ing the two might be doubly effective. 

In order to furnish answers to some 
of these questions, the effects of KR, 
reward, and the combination of the 
two in a typical vigilance task were 
studied under the conditions in which 
a monetary reward was individually 
administered and was contingent upon 
the maintenance of a high level of 
performance. 


METHOD 


Subjects —Eighty Fort Knox armor train- 
ees, aged 17 to 24 yr., free from visual defects, 
served as Ss, Twenty Ss were arbitrarily 
assigned to Group C (Control), 20 to Group 
KR, 20 to Group R (Reward), and 20 to a 


VIGILANCE 59 


combined KR and reward group (Group 
R + KR). 

Apparatus—The Ss’ task was to detect 
aperiodic interruptions of a continuous light 
source over a 3-hr. period. The light source 
(12 v. de bulb, .45 ft-c as a point source, 
operating at 54 v.) was located at approxi- 
mately eye level in a flat black plywood box. 
The schedule of interruptions (signals) was 
predetermined on the basis of 12 signals per 
$ hr, with a total of 72 for the 3-hr. session. 
The randomized intersignal intervals ranged 
from 24 to 360 sec. with an average interval 
of 150 sec. The schedule was presented by 
means of a film-tape, fed through a Gerbrands 
variable-interval programer, and a simple 
timing circuit. The latter determined the 
duration of the interruption (.03 sec. measured 
at the timing relay with a Hunter Klockoun- 
ter), and the former determined the intervals 
between the interruptions. The signal 
presentations and Ss’ signal detections were 
recorded by a 20-pen Esterline-Angus opera- 
tions recorder. 

Procedure—Each S, wearing earphones to 
reduce ambient noise, monitored the light in 
an isolated room. For the no-KR condition, 
S was told to press the response button as soon 
as he saw a signal. Each S was further told 
that the signals could occur at any time, so 
it was necessary to remain alert and watch 
the light at all times. To insure that S 
understood the requirements, a practice 
period of 10 signals (at 1-min. intervals) was 
given before the watch session began. 

For the KR conditions, a 1}-in. pilot lamp 
was installed at approximately eye level on 
the right side of the monitoring display. The 
S’s requirements were the same as before; S 
was told that the pilot lamp would light if he 
missed a signal, thus indicating a signal was 
missed and that he should be more alert 
in order to detect subsequent signals. This 


light, unlike the signal light, was covered by 
a white filter lens and operated at its maxi- 
mum voltage—6 v. dc. The light was flashed 
for a 2-sec. period if a given signal was not 
detected within 5 sec. after its presentation, 

For the reward conditions, each S was told 
he would receive $3.00 if he detected all the 
signals. In addition, for each signal missed S 
was told that he would.lose a portion of the 
initial $3.00 in geometric progression begin- 
ning with 5 cents, for the first miss. By miss- 
ing six or more signals during the watch 
session, S lost the entire amount. Thus, if S 
missed only one signal in the 3-hr. period he 
received $2.95; for two misses, he received 
$2.85; for three, $2.65; for four, $2.25; for 
five, $1.45, and with six or more misses, 
nothing. As in the KR condition, a miss was 
scored if S failed to report a signal within 5 
sec. after its appearance. 

Although the study was designed for 20 
Ss per group, several Ss fell asleep during the 
watch session. Since Ss could not be directly 
observed, it was arbitrarily decided to elimi- 
nate from the study all Ss who missed 12 or 
more consecutive signals. This was done on 
the assumption that Ss missing 12 or more 
consecutive signals were either asleep or 
uncooperative. On this basis, 5 of the original 
80 Ss were rejected—2 from Group C, 1 from 
Group KR, and 2 from Group R. None of the 
20 Ss, however, were eliminated from Group 
R+KR. In an attempt to equalize the 
number of Ss in each group, additional ses- 
sions were held. Due to experimental diffi- 
culties, however, data were obtained on only 
19 Ss in Group KR. 


RESULTS 


The mean percentage of signals 
missed by each group, by 30-min. 
periods,"is shown in Table 1. On an 


TABLE 1 


MEAN PERCENTAGES OF SIGNALS MISSED FoR ALL GROUPS OVER SIX 30-MIN. 
PERIODS AND OVER ENTIRE 3-HR, SESSION 


Successive 30-Min, Periods 


Overall 
Group 7 2 3 4 5 6 
Mean| SD |Mean| SD | Mean| SD | Mean| SD | Mean| SD | Mean| SD | Mean) SD 
G 18.2 | 12.6 | 21.6 | 16.5 | 19.2 | 13.6 | 25.0 | 15.1 | 27.4 | 17.4 | 34.1 | 27.5 | 24.3 | 12.1 
KR 7 8.1 |10.8 |11.1 | 12.2 | 11.3 | 14.4 | 11.5 | 14.4 | 19.1 | 12.3 | 17.7 | 11.8 | 10.5 
R 8.7|11.9| 59| 7.7| 52| 95] 7.9 | 8.4|11.6 |13.5 | 10.5 | 11.2 | 8.3 71 
RKR | 4.4] 49| 20| 35| 6.2 71| 5.8] 6.7| 36| 49| 3.6| 4.9| 43] 24 


60 R. R. SIPOWICZ, J. R. WARE, AND R. A. BAKER 


overall basis, Group C missed 24.3% 
of the signals; Group KR, 12%; 
Group R, 8.4%; and Group R+KR, 
43%. The raw data in terms of 
number of signals missed by the 
groups were analyzed to obtain means 
and variances. A test for homo- 
geneity of variance was performed 
on these data and resulted in a 
significant F value. The data were 
transformed to a log scale and then 
subjected to an analysis of variance. 

This analysis showed the groups 
were significantly different (F= 20.92, 
df = 3, P < .01). Using Duncan’s 
(1955) multiple range test, all the 
experimental groups were found to 
differ significantly from the control 
(P <.01). The mean difference be- 
tween Group R + KR and Group KR 
was also significant at the .01 level. 
The difference between Group R+-KR 
and Group R was, however, significant 
at only the .05 level. No significant 
difference was found between Group 
R and Group KR. 

An inspection of the performance 
curves also revealed the typical vigil- 
ance decrement within the first hour 
for Group C. This expected decre- 
ment, however, was not shown by any 
of the experimental groups. For the 
experimental groups, the greatest 
decrement occurred for Group KR 
between the first and fourth 30-min. 
periods. A correlated ¢ test for the 
decrement, however, showed the dif- 
ference was nonsignificant. 

Although it might be argued that 
knowledge of results should lead to an 
increase in the number of false re- 
sponses, an analysis of the present 
data showed no significant differences 
between groups, The Ss in Group KR 
averaged 1.05 false responses as com- 
pared with 0.84 for Group C, 0.72 for 
Group R, and 0.25 for Group R+ KR, 


DISCUSSION 


Of most importance was the finding 
that the combination of reward and 
knowledge of results led to the detection 
of better than 95% of the signals—a 
performance at a “near perfect” level. 
Further, the extremely small variance 
of this group may be some indication 
of the extent to which individual differ- 
ences in performance can be decreased 
by the use of such motivational tech- 
niques. Classification of Ss into “good 
performers” and ‘bad performers” as 
was done in other studies (Buckner, 
Harabedian, & McGrath, 1960; Fraser, 
1953) may be adequate in vigilance tasks 
in which no attempt is made to sustain 
a high level of motivation. Results from 
this study, however, suggest the per- 
formance of all Ss in monitoring tasks 
may be sustained at a high level. Thus, 
the problem of selection and assignment 
of Ss to vigilance tasks on the basis of 
their initial ability can be eliminated if 
effective motivational techniques are 
discovered and properly employed. 

Of additional interest is the fact that 
reward alone proved to be only slightly 
more effective than KR. It should be 
noted, however, that in the reward 
situation S had no way of knowing 
whether or not a singal was missed and, 
consequently, was unable to adjust 
effectively his monitoring behavior to 
meet the requirements of the task. 

The relatively high level of per- 
formance, as well as the failure to find 
the usual vigilance decrement for the 
two reward conditions, focuses consider- 
able attention upon the method used 
in dispensing the reward. First of all, 
Ss did not have to detect all of the signals 
in order to be rewarded. The S, how- 
ever, was required to detect at least 93% 
of the signals. Moreover, Ss were told 
before the session began that they would 
be given the reward. Their subsequent 
performance determined whether or not 
it would be retained. This technique, it 
is believed, was particularly effective 
in sustaining a high level of motivation 


VIGILANCE 61 


over the entire watch period. When 
specific knowledge of the reward situa- 
tion was then made available throughout 
the watch session, S’s motivation was 
further increased and an even higher 
level of performance was obtained. 

In previous studies of KR such as 
Mackworth’s (1950), Ss were informed 
whenever they made a false response, 
when they failed to detect a given signal, 
and when they correctly detected a 
signal. Itis likely, however, that much 
of this information is superfluous. 
Knowledge of results as used in this study 
gave Ss information only when they 
failed to detect a given signal. In terms 
of the mean number of signals missed, 
this technique was as effective as those 
used by Mackworth. Mackworth’s ex- 
perimental group missed 17.3% of the 
signals presented in a 2-hr. session, 
whereas Ss in the present study missed 
only 12% over a 3-hr. period. 


SUMMARY 


Four groups of Ss (20 to a group) monitored 
aperiodic and brief interruptions of a contin- 
uous light source under isolated conditions 
for a 3-hr. period. The Ss in Group R were 
given $3.00 if they detected all signals pre- 
sented during the watch session, but lost 
.05, .15, .35,.75, 1.55, or all if they missed one, 
two, three, four, five, or six signals. Group 
KR was informed of all signals missed by a 
bright flash of light. Group R + KR re- 
ceived both KR and reward according to 
the schedule for Group R. Group C, a con- 
trol, received neither reward nor KR. 

Although all experimental groups were 
significantly better than the control group, 
the combination of reward and KR produced 
the highest level of signal detection. The 
results are interpreted as indicating that 
either reward or KR can be effective in main- 
taining a high level of vigilance. The effec- 
tiveness of reward, however, is highly de- 


pendent upon the manner in which it is used. 
The effectiveness of such incentives in im- 
proving performance and reducing inter-S 
variability also attenuated the importance 
previously assigned to individual differences. 
It is further suggested that such differences are 
primarily motivational and, as such, are 
susceptible to experimental manipulation and 
control, 


REFERENCES 


Buckner, D. N., HARABEDIAN, A., & Mc- 
Gratn, J. J. A study of individual dif- 
ferences in vigilance performance. USA 
TAGO Hum. Factors Res. Br. tech. res. Rep., 
1960, No. 2. 

Davis, J. M., McCourt, W. F., & SOLOMON, 
P. The effect of visual stimulation on 
hallucinations and other mental experiences 
during sensory deprivation. Amer. J. 
Psychol., 1960, 116, 889-892. 

Duncan, D. B. Multiple range and multiple 
F tests. Biometrics, 1955, 11, 1-42. 

Fraser, D. C. The relation of an experi- 
mental variable to performance in a pro- 
longed visual task. Quart. J. exp. Psychol., 
1953, 5, 31-32. 

Logs, M., & Scumipt, E. A. Influence of 
time on task and false information on 
efficiency of responding to pure tones, 
USA Med. Res. Lab. Rep., 1960, No. 426, 

Macworts, N. H. Researches on the 
measurement of human performance. Spec. 
Rep. Ser. Med. Res. Coun., Lond., 1950, No. 
268, 25-29. 

McCormack, P. D. Performance in a 
vigilance task with and without knowledge 
of results. Canad. J. Psychol., 1959, 13, 
68-71. 

Porac, I., & Knarr, P. R. Maintenance 
of alertness by a loud auditory signal. 
J. Acoust. Soc. Amer., 1958, 30, 1013-1016. 

WEIDENFELLER, E. W., BAKER, R. A, & 
Wart, J. R. The effects of knowledge of 
results (true and false) on vigilance per- 
formance. Percept. mot. Skills, 1962, 14, 
211-215, 


(Received June 7, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 1, 62-66 


SOME CONDITIONS INFLUENCING THE ACQUISITION 
AND UTILIZATION OF CUES! 


LOY S. BRALEY? 
University of Buffalo 


A paper by Bruner, Matter, and 
Papenek (1955) posed the question 
as to the variables that control the 
range of environmental information 
that an organism acquires during 
some behavioral sequence. This prob- 
lem of the “breadth of learning” 
assumes that an organism is exposed 
to a wide range of possible cues under 
most circumstances, but that re- 
sponses are acquired to only a portion 
of these with the others remaining 
as undifferentiated context. While it 
is clear that the range of cues acquired 
may be differentially advantageous 
depending on the task requirements, 
the focus of their paper and the 
Present study was on the acquisition 
of cues that were not immediately 
task-relevant, but which became so 
at a subsequent time. The above 
authors demonstrated, as conjectured 
by Tolman (1948), a reduction in sub- 
sequent utilization of initially irrele- 
vant cues as a product of overlearning 
and higher levels of motivation. 

This was but a start on an empirical 
question. Recent reviews by Easter- 
brook (1959) and Kausler and Trapp 
(1960) on cue utilization demonstrate 
the continued interest in this problem, 
Easterbrook, concerned with the rela- 
tion of emotion to behavioral organiza- 


‘This article is based on a PhD disserta- 
tion submitted to the Faculty of the Graduate 
School of Arts and Sciences of the University 
of Buffalo. The author is greatly indebted to 
Ira S, Cohen, under whose guidance this 
study was conducted, and to Walter Cohen 
for many valuable suggestions throughout 
all phases of the investigation, 


? Now at University of California, Santa 
Barbara. 


62 


tion took the position that an in- 
crease in emotional arousal functions 
to reduce the utilization of cues. 
Consequently, degree of behavioral 
organization depends on task de- 
mands for “‘breadth”’ of cues. Kausler 
and Trapp were in substantial agree- 
ment although their focus was on the 
effects of motivation on the utilization 
of cues in incidental learning. They 
suggested additional relevant varia- 
bles of motivation as generalized D 
versus incentive-oriented set, spatial 
contiguity of relevant and irrelevant 
cues, and task difficulty. 

The present study was designed 
(a) to demonstrate with human Ss, 
as Bruner et al. (1955) have done with 
rats, that exposure to initially irrele- 
vant cues will facilitate later learning 
when these cues become the relevant 
discriminanda, (b) to test the hypothe- 
sis that high scores on a response- 
inferred motivational variable (Tay- 
lor MA scale) are inversely related 
to acquisition and utilization of such 
cues, and (c) to determine whether 
the presence of the previously correct 
cue from earlier learning would dif- 
ferentially facilitate or retard the 
shift to new cues as a function of low 
and high motive strength, respec- 
tively. Certain clinical conceptions 
of anxiety (Cameron, 1951 ; Sullivan, 
1953) would predict such a per- 
severative tendency. 


METHOD 


Subjects: —Eighty extreme scorers on the 
MA scale were selected as Ss from a sample 
of 525 undergraduate questionnaires obtained 
at the University of Buffalo and Springfield 


$ 


ACQUISITION AND UTILIZATION OF CUES 63 


College. The upper 20% of this sample 
had an MA score range of 20 to 40, with a 
mean of 25.3; the lower 20% a range of 0 to 7, 
with a mean of 4.59. The mean of all scores 
was 13.77. Twenty-four Ss were female, 13 
of which were high MA scale scorers, All Ss 
were contacted and asked to participate in 
such a manner as to avoid any association 
between the MA scale and the experiment 
proper. 

Because of contradictory evidence (Taylor, 
1955) on the relationship of IQ to MA score, 
a short form of the Wechsler-Bellevue, Form 
I (Herring, 1952) was administered. The 
correlation between IQ and MA was .05. 
Learning rate on the experimental task and 
IQ revealed a nonsignificant r of —.22. 

Task and stimulus materials—The basic 
task was one of concept learning with simul- 
taneous presentation of a positive and negative 
instance on each trial. Thisis similar to the Bru- 
neretal. (1955) procedureand thatof Blumand 
Blum (1949), and Lashley (1942) in their con- 
tributions to the continuity controversy. Each 
instance consisted of twosmall geometric figures 
insidea larger geometric figure, drawn ona white 
4 X 6in. card. The three figures on any card 
were combinations of circles, triangles, and 
squares each of which could be red, blue, 
green, or yellow. An instance then, might 
be a small red circle, small green triangle, 
within a large blue square. The stimuli 
for the three stages of the experimental 
procedure were as follows: 

Stage I: All Ss learned a concept where the 
positive cue was two small like-colored figures 
within a large figure of a different color. 
Only 12 positives were used as pretesting had 
indicated rapid learning for this simple 
concept. Since only two colors appeared on 
these positives, 12 combinations were possible. 
This also permitted appropriate combinations 
of the three types of figures. 

The same negative instances were used 
in all three stages so the possibility of memori- 
zation had to be obviated. The 30 negative 
instances were combinations of 4 values of 
color, 3 of shape, and 2 of size (small or large 
figure). In addition, 6 negatives with all 
figures of the same color were included to 
avoid concept attainment by a different- 
colors cue, 

Stage II: While all Ss continued to be 
reinforced for the Stage I positive cue, half 
of the Ss were exposed to 10 trials of irrelevant 
cues. Five of these consisted of cross-hatching 
in the large figure and 5 consisted of the 
substitution of an L shape for one of the small 
figures, All new irrelevant cues appeared on 
Stage I positive instances. 


Stage III: The positive cues were now 
either the presence of a cross-hatched large 
figure or an L shaped small figure. The 30 
positive instances were identical to the nega- 
tives except for the addition of cross-hatching 
on half, or substitution of a small L shape on 
the remainder. No two instances that were 
the same in original form were opposed during 
exposure to S. Half of the Ss in Stage III had 
the Stage I positive cue present as a negative 
instance on 50% of the trials. For this 
condition a set of 20 negatives was used 
consisting of the last 10 Stage I positives and 
the last 10 of the regular negatives. 

Design and procedure—The experiment 
was a 2 X 2 X 2 factorial design with 10 Ss 
per cell. Conditions were: (a) high and low 
anxiety, (b) presence or absence of irrelevant 
cues in Stage II, and (c) presence or absence 
of Stage I positive cue during Stage III. 

The Ss were randomly assigned to one 
of the four experimental conditions according 
to anxiety grouping and were run individually. 
The IQ measure was obtained at the conclu- 
sion of the experimental task. 

The S faced a plywood screen with two 
exposure windows at approximately eye level. 
With the simultaneous presentation of two 
cards on each trial S was to indicate his choice 
of the correct card by depressing one of two 
response keys fixed to the table immediately 
below the windows. A correct choice closed 
a circuit and reinforcement was delivered 
immediately by a small white bulb recessed 
between and above the windows. Position 
of the correct card was varied according to a 
prearranged randomly chosen order. The Ss 
were instructed that their job was to choose 
the correct one of two cards and that they 
would have to begin by guessing. However, 
the reinforcing stimulus would indicate when 
they were correct in their choice so that even- 
tually they were to reach a point where they 
could choose the correct card each time. 

In Stage I, trials were continued to a cri- 
terion of 10 successive correct responses. 
Eight Ss were discarded for failure to meet 
criterion by Trial 70. Stage II began with 
completion of the criterion trials and was, 
to S, continuous with Stage I in that there 
was no interruption in presentation of the 
stimulus materials. The experimental group 
was exposed to the series of 10 irrelevant 
cues. The control Ss were simply continued 
for 10 trials beyond criterion on the Stage I 
cue, giving an equal number of overlearning 
trials to both groups. Stage III was again 
continuous with Stage II and where rein- 
forcement could only be achieved by re- 


64 LOY S. BRALEY 


TABLE 1 


MEANS AND SDs OF TRANSFORMED TRIALS TO 
CRITERION FOR CONCEPT LEARNING: 
Srace II 


Irrelevant Cues | Irrelevant Cues 


Positive resent in Absent I 

Anxiety | Cue from Stage If Stage 

age 

Mean SD Mean SD 
High | Present] 5.55 | 2.55 | 3.17 | 2.04 
"8" | Absent | 6.35 | 2.93 | 5.68 | 2.91 
L Present! 5.45 | 1.51 | 3.93 | 1.56 
OW | Absent | 8.13 | 1.16 | 4.40 | 2:97 


sponding to either of the previously irrelevant 
cues, 


RESULTS 


There was no substantial difference 
in Stage I acquisition trials between 
control Ss (Mean = 10.20, SD=11.60) 
and those exposed to the irrelevant 
cues in Stage II (Mean = HESI. 
SD = 12.92). While not shown, it 
might also be noted that neither 
experimental nor control Ss made 
any errors during the 10 Stage II 
trials. 

Table 1 presents the transformed 
means and SDs of the eight experi- 
mental conditions. A Bartlett test 
revealed nonhomogeneous variances 
among the conditions, necessitating 
a square-root transformation to meet 
the requirements of the analysis of 
variance. 

Table 2 indicates a significant F 
for the Stage II irrelevant cue ex- 
posure condition. However, rather 
than facilitating performance in Stage 
II, exposure to the irrelevant cues 
impaired performance. It appears 
then that the facilitation of subse- 
quent learning through utilization of 
previously irrelevant cues, as found by 
Bruner et al. (1955), cannot be 
demonstrated with human Ss under 
these experimental conditions, 

It is also clear from Table 2 that 


differences in the motivational condi- 
tion (MA scale) bear no relationship 
to performance in the Stage III con- 
cept learning. Further, an analysis 
of variance of Stage I cue learning 
and MA score (within MS larger than 
between MS for df = 7/72) was not 
significant. 

For the third condition the analysis 
of variance indicates that continua- 
tion of the Stage I positive cue into 
Stage HI facilitated performance in- 
dependent of MA scale level. Dif- 
ficulty in shifting from a successful 
(positively reinforced) cue to a new 
one while the old one is still present 
was no more characteristic of the 
high-MA Ss than of the low-MA Ss. 

With two cues required for concept 
attainment in Stage III it is of some 
interest to determine differential cue 
difficulty. Error scores were coma 
puted for the two cues in the Stage 
IIT task and an analysis of variance | 
run on the irrelevant cue exposure 
condition and the type of cue (cross- 
hatched ground vs. L shaped figure), 
A significant F (F= 10.62, df =1/156) 
emerged for the irrelevant cue condi- 
tion, confirming the analysis by trials, 
as well as for the type of cue (F=8.88, 
df = 1/156). The cross-hatched cue 
covering the entire background of the — 


TABLE 2 


ANALYSIS OF VARIANCE OF TRANSFORMED 
TRIALS TO CRITERION FOR CONCEPT 
LEARNING: STAGE III 


Source af MS F $ 
Anxiety (A) 1 1.69 | 
Irrelevant cue 
Exposure (IC) 1 | 86.15 | 14.49* 
Stage I positive cue 

present (PC) Palese.147 | 8.787 
AXIC 1| 6.06 | 1.02 
A XPC 1 04 
IC X PC 1 31 
AX IC X PC 1 | 19.17 | 3.22 
Within 72 | 5.95 


*P =.01, 


ACQUISITION AND UTILIZATION OF CUES 65 


large figure was clearly an easier cue. 
With no procedure available for 
scaling the difficulty of cues it seems 
reasonable to assume that this was a 
more novel cue generating higher 
attention value. 


Discussion 


The failure of the motivational vari- 
able to predict differential acquisition 
and utilization of cues deserves little 
comment. Strictly speaking, these re- 
sults are applicable only to this task, 
although they raise further questions 
about the status of the MA scale as an 
independent selection device. 

While several studies (Bruner et al., 
1955; Jeeves & North, 1956; Peterson & 
Peterson, 1957; Weiss & Margolius, 
1954) have demonstrated positive trans- 
fer of context or irrelevant stimuli, the 
present results suggest additional factors 
that demand attention. Here, exposure 
to initially irrelevant cues functioned 
so as to inhibit rather than facilitate 
utilization of these cues when subse- 
quently chey became the basis for correct 
discrimination. Berlyne (1958) and 
Dember (1960) have presented data 
showing the preference for, and attention 
value of, stimulus change and novelty. 
Luria (1957) and Sokolov (1954) have 
found that orienting reactions are in- 
duced by changes in the stimulus field 
that serve to increase the accessibility 
of the organism to deal with these 
changes. These orienting responses are 
maintained or extinguished as a function 
of the relevance of new or changed 
stimuli, Consequently, in the present 
study, despite the similarity of immediate 
negative reinforcement at the start of 
Stage III, the stimuli constituting the 
positive instances were novel and atten- 
tion-compelling for the control Ss while 
for the experimental group they repre- 
sented the familiar. Both the fact that 
53% of the control Ss reached criterion 
in the first 10 trials as opposed to only 
18% of the experimental group, and the 
greater ease of learning the cross-hatched 
ground cue are subject to the same 


interpretation. It might also be noted 
that the greater ease in learning the 
Stage III task where the positive Stage 
I cue was present as a negative instance 
fits well with this interpretation. This 
familiar cue constituted a clearly de- 
fined negative instance 50% of the time 
and thereby served to orient S to the 
other card which was now a positive 
instance. Had the primary cue been 
randomly associated with either positive 
or negative instance in Stage ITI it would 
not have had the effect of increasing 
novelty and, therefore, discriminability 
of the positive instance. 

Two further task variables are prob- 
ably crucial to the evaluation of the 
effects of the acquisition of initially 
irrelevant information. One is that 
positive transfer of previously irrelevant 
cues may occur (Babb, 1956; Bruner 
et al., 1955; Lawrence, 1949, 1950) only 
where the irrelevant stimuli are not 
competing in the same sensory field, e.g., 
a main visual task with irrelevant audi- 
tory cues. Different sensory channels 
would seem to reduce cue interference 
and the necessity of “adapting out" 
(Restle, 1955) cues. The second factor 
is that where the irrelevant cues are 
randomly present throughout a Stage I 
task, and then systematically related 
to the positive cue in Stage II only for 
the experimental group, then the novelty 
advantage of these cues for the control 
group in Stage III should be nullified. 


SUMMARY 


A three-stage concept-learning task was 
used to investigate the effects on subsequent 
cue utilization of, prior exposure to relevant 
cues, differences in level of motivation 
(MA scale), and the shift of a previously 
positive cue to the status of a negative in- 
stance 50% of the time. The experiment 
was in the form of a 2 X 2 X 2 factorial 
design with 10 Ss per cell. 

The results showed no differences on 
any aspect of the concept-learning task as a 
function of level of motivation, The intro- 
duction of additional (but irrelevant) cues, 
after an initial discrimination has been 
learned, function so as to inhibit rather than 
facilitate the utilization of these cues when 
they later become the basis for correct 


66 LOY S. BRALEY 


discrimination. A previously positive cue 
retained in a later learning task, where its 
reinforcement value is negative, facilitates 
the learning of a new discrimination. Results 
were discussed as suggesting an interpreta- 
tion in terms of stimulus novelty. 


REFERENCES 


Bass, H. Proportional reinforcement of 
irrelevant stimuli and transfer value. J. 
comp. physiol. Psychol., 1956, 49, 586-587. 

BERLYNE, D. E. The influence of complexity 
and change in visual figures on orienting 
responses. J, exp. Psychol., 1958, 55, 289- 
296. 

Biv, R. A., & Buu, J. S. Factual issues 
in the continuity controversy. Psychol. 
Rev., 1949, 56, 33-50. 

Bruner, J. S., MATTER, J., & PAPENEK, 
N. L. Breadth of learning as a function 
of drive level and mechanization. Psychol. 
Rev., 1955, 62, 1-10. 

CAMERON, N., & Macaret, A. Behavior 
vse Boston: Houghton Mifflin, 

Demper, W. N. Psychology of perception. 
New York: Holt, 1960. 

EASTERBROOK, J. A. The effect of emotion 
on cue utilization and the organization 
of behavior. Psychol. Rev., 1959, 66, 
183-201. 

HERRING, F. H. An evaluation of published 


short forms of the Wechsler-Bellevue scale. | 


J. consult. Psychol., 1952, 16, 119-123. 
JEEVEs, A. A., & Norms, A. J. Irrelevant 
or AT Giatboages stimuli in dis- 
crimination learning. J, exp. Psychol. 
1956, 52, 90-94. E E 
Kauster, D. H., & Trapp, E. P. Motivation 
and cue utilization in intentional and 


incidental learning. Psychol. Rev., 1960, 67, 
373-379. 

Lasuiey, K. S. An examination of the 
“continuity” theory as applied to dis- 
crimination, J. gen. Psychol., 1942, 26, 
241-265. 

Lawrence, D. H. Acquired distinctiveness 
of cues: I. Transfer between discrimina- 
tions on the basis of familiarity with the 
stimulus. J. exp. Psychol., 1949, 39, 
170-784. 

LAWRENCE, D. H. Acquired distinctiveness 
of Cues: II. Selective association in a 
constant stimulus situation. J. exp. Psy- 
chol., 1950, 40, 175-188. 

Luria, A. R. The role of language in the 
formation of temporary connections. In 
B. Simon (Ed.), Psychology in the Soviet y 
Union, Stanford: Stanford Univer. Press, 
1957. Pp. 115-129. 

PETERSON, L. R., & PeTerson, M. J. The 
role of context stimuli in verbal learning. 
J. exp. Psychol., 1957, 53, 102-105. 

Reste, F. A theory of discrimination learn- 
ing. Psychol. Rev., 1955, 62, 11-19. 

Soxo.oy, E. N. Higher nervous energy and 
the problem of perception. In, Communica- 
tions to the XIV International Congress of 
Psychology, USSR, 1954. 

Suttivan, H. S. The interpersonal theory of 
psychiatry. New York: Norton, 1953. 

TAYLOR, J. A. The Taylor manifest anxiety 
scale and intelligence. J. abnorm. soc. 
Psychol., 1955, 51, 347-354. 

Torman, E. C. Cognitive maps in rats and 
men. Psychol. Rev., 1948, 55, 189-208. 

Weiss, W., & MarcoLius, G. The effect 
of context stimuli on learning and retention- 
J. exp. Psychol., 1954, 48, 318-322. 


(Received June 12, 1961) 


ie, Vol. 6 No Le br 


EFFECT OF STIMULUS CONDITION AND REACTION 
TIME INFORMATION ON SPATIAL STIMULUS 
GENERALIZATION ! 


CHARLES Y. NAKAMURA anv JAQUES W. KASWAN 
University of California, Los Angeles 


Instructed voluntary response (VR) 
procedures for studying spatial gen- 
eralization (Brown, Bilodeau, & Bar- 
on, 1951) have recently received much 
attention. But as noted by Sherman 
and Knopf (1960) little has been done 
to define the variables that deter- 


“mine error frequency, the measure 


of stimulus generalization (SG) in this 
procedure. Although there have been 
some attempts to generalize formula- 
tions derived from classical condition- 
ing situations to the VR situation, 
there is little agreement between 
results obtained from the two pro- 
cedurts. One proposed reason for 
this is that conditioning studies of SG 
were in large part concerned with 
primary stimulus generalization, 
whereas the VR studies must give 
greater consideration to mediated 
generalization variables (Gibson, 1959; 
Mednick & Freedman, 1960). 


The concept of stimulus categorization 
appears to be a useful description of 
mediational processes involved in VR 
SG. Used this way categorization in- 
volves the assumption that if stimuli 
to which Ss are instructed to respond 
can be discriminated and conceptualized 
as belonging to a common class, this will 


1 This research was supported by Research 
Grant G-9589 of the National Science Founda- 
tion, and by grants from the Research Com- 
mittee of the University of California, Los 
Angeles. The authors wish to acknowledge 
the contributions of Marion Schulman, who 
assisted in the collection of the data, and 
Selma J. Brotsky, who assisted both in the 
collection and analysis of the data. Parts 
of this article were presented in a paper 
read at the 1961 American Psychological 
Association convention in New York City. 


67 


facilitate categorization of these stimuli. 
Where categorization is so facilitated, 
generalization to other stimuli in the 
same continuum will be less likely to 
occur than when categorization is more 
difficult. The reasoning is similar to 
Wallach’s (1958) discussion of categori- 
zation as a determinant in SG-like be- 
havior, where he implied that ease of 
discrimination of stimuli is directly 
related to categorizability and inversely 
related to SG. A recent study by 
Evans (1961) provides direct evidence 
that spatial stimulus generalization is 
a function of whether Ss can perceive 
the relative spatial location of the train- 
ing and test stimuli. 

A major goal of the present study is to 
attempt the identification of some stimu- 
lus and response variables necessary 
to specify the role of categorization in 
VR spatial SG. A brief description of 
the experimental situation will assist 
in making the following discussion of 
hypotheses more explicit. The stimuli 
are a horizontal row of 11 lights at eye 
level on a circular panel presented under 
three different stimulus conditions. In 
Cond. 5-6-7, Ss are instructed to press 
a key only when Lights 5, 6, or 7 (posi- 
tive lights) come on. In Cond. 3-6-9, 
they are to respond only to Lights 3, 6, or 
9. In the third condition, essentially 
a control group, only the center Light 
6 is positive. 

Because of the spatial contiguity of 
Lights 5-6-7, Ss are expected to identify 
each as belonging to a group of three, 
thus facilitating categorization of these 
lights as positive. No such grouping 
is likely to be discerned in Lights 3-6-9 
so that categorization of these lights 
should be more difficult. Johnsgard 
(1957), in a study of stimulus-back- 


68 CHARLES Y. NAKAMURA AND JAQUES W. KASWAN 


ground contrast, presented a similar 
explanation of SG effects. 

This arrangement of training and test 
stimuli also permits the comparison of 
error frequencies predicted by the cate- 
gorization hypothesis and those predicted 
by Hull’s gradient summation hypothesis 
based on the exponential summation 
of overlapping gradients of generalized 
habit strength. Bilodeau, Brown, and 
Meryman (1956) reported support for 
the latter hypothesis. Kalish and Gutt- 
man (1957), on the other hand, found 
only partial support for the Hullian 
hypothesis both in their review of the 
Bilodeau et al. (1956) results and in their 
own research. Under the conditions 
of the present experiment, the gradient 
summation hypothesis would predict 
approximately the same number of 
errors to Lights 4 and 8 in both Cond. 
5-6-7 and 3-6-9. (Lights 4 and 8 were 
selected for specific comparison of errors 
because in both stimulus conditions they 
are located ideally in the spatial con- 
tinuum where there are at each negative 
light three overlapping gradients gen- 
erated from the training lights.) If, 
however, spatial contiguity facilitates 
categorization, fewer errors to these two 
lights would be expected in 5-6-7 than 
in 3-6-9. Moreover, in accord with the 
categorization hypothesis that the three 
Positive lights in 5-6-7 are conceptualized 
as belonging to a single class, both the 
total number of errors and the error 
gradients in 5-6-7 are expected to be 
similar to those obtained in Stimulus 
Condition 6 where only the single light 
Number 6 is positive. Such a prediction 
would not be made from the gradient 
summation hypothesis. 

Specific predictions of response la- 
tencies also follow from thi 
tion hypothesis. On the assumption 
that reaction time (RT) generally in- 
creases as a function of stimulus com- 
plexity (Flores, 1956; Grebb, 1954) and 
that categorization is presumably a 
determinant in accuracy of perception 
(Bruner, Goodnow, & Austin, 1956, p.9), 
it would be expected that accurate 


response latencies should be shorter in 
5-6-7 than in 3-6-9, 


e categoriza- 


The above predictions of errors and 
latencies generated from the stimulus 
categorization hypothesis produce, as 
logical consequents, some specific pre- 
dictions of error-latency relations. If 
latencies reflect diffculty of categoriza- 
tion of positive lights, Ss who do not 
take the required time for accurate 
categorizing should make more errors 
than those who take adequate time. 
Accordingly, if Ss can be induced to 
reduce response latencies they should 
make more errors. Since there is some 
evidence that knowledge of performance 
may reduce RT (Bilodeau & Bilodeau, 
1961, p. 250), giving Ss information 
about their speed of reaction should 
reduce latency. “ Insofar as 5-6-7 laten- 
cies are expected to be close to simple 
RTs, further reductions in RT or in- 
crease in errors should be limited. In 
3-6-9, however, greater absolute reduc- 
tions in latencies are possible. Accord- 
ingly, greater amount of speed informa- 
tion should be associated with larger 
error differences and smaller latency 
differences between the two stimulus 
conditions, and these should be attribu- 
table to the speed information effect 
in Cond. 3-6-9. 


METHOD 


Apparatus.—The apparatus was a modi- 
fication of that used by Brown et al. (1951). 
The main component was a 180° semicircular 
panel on which was mounted a horizontal 
row of 11 lights separated by intervals of 4 
of visual angle, at eye level. The essential 
modification was the use of 4° of visual angle 
with the lights at a distance of 30 in. from S's 
eyes instead of the 8° intervals at a distance 
of 60 in. employed by Brown et al. The 
lights, identified by numbering 1 through 11 
from left to right, were standard 6.3-v., 15- 
amp. miniature bayonet indicator lamps 
encased in milk-glass covers, A similar orange 
colored light located directly above the central 
light, Light 6, served as both fixation point 
and ready signal. The panel was 3 ft. high, 
painted flat black, covered over the top with 
black cloth, and placed in a semidark room 
to minimize extraneous visual cues. The 
responded by pressing a telegraph key placed 
at his right hand (or left, if left handed): 
Trial intervals were regulated by an electric 


SPATIAL STIMULUS GENERALIZATION 69 


motor. The Æ, seated behind the panel and 
not visible to S, selected the stimulus light, 
randomly varied the foreperiod between 2, 3, 
and 4 sec. on a Hunter electric interval timer, 
and read the RTs to the nearest ,01 sec. from 
a Hunter electric clock. 

Procedure-—The basic 2 X 3 factorial de- 
sign involved six sets of Ss under two stimulus 
conditions and three RT information groups. 
The groups under Cond. 5-6-7 were instructed 
to respond to the lighting of either Lights 
5, 6, or 7 and those in Cond. 3-6-9 to either 
3, 6, or 9. All the groups were instructed not 
e respond when any of the other lights were 

it. 

Within Cond. 5-6-7 and 3-6-9, Ss in the 
total information (TI) group were given 
information of speed of response after every 
trial and the partial information (PI) group 
after every fifth trial. The Ss were told their 
performance was “good” (short latency), 
“fair,” or “poor” (long latency) in equal 
proportion and Æ attempted to make this 
correspond with Ss’ actual variations in 
latencies. For the no information (NI) 
group, speed and accuracy were stressed 
equally and instructions to this effect were 
repeated after every 25 trials, All Ss were 
told when they made an error. 

After the sequence of ready signal, stimu- 
lus light, and S's response was demonstrated, 
S was given a series of 20 training trials 
distributed among the three positive lights 
in a predetermined random sequence. Trial 
intervals were 12 sec., beginning with the 
onset of the ready light. Following the 
training trials, S was told that now any of the 
lights might go on but he was to respond only 
to the positive lights (5, 6, 7, or 3, 6, 9, de- 
pending on the stimulus condition) and not 
to any other lights. The test series involved 
5 presentations of each of the negative lights, 
along with 40 booster trials of each of the 
three positive lights distributed in a pre- 
determined random order. 

In addition to Cond. 5-6-7 and 3-6-9 in 
the basic design, a third stimulus condition 
(Cond. 6) was included where Ss were in- 
structed to respond only to Light 6. This 
group was given total RT information similar 
to the other TI groups. The number of 
booster trials given Light 6 during the test 
series was equal to the sum of booster trials 
to the three positive lights in each of the other 
two stimulus conditions. 

Subjects.—The Ss were 130 students, both 
men and women, from a course in introduc- 
tory psychology. Within each of Cond. 5-6-7 
and 3-6-9, 20 Ss were randomly assigned to 
each PI and TI group and 15 to each NI speed 


information group. There were 20 Ss in 


Cond. 6. 
RESULTS 


Errors.—The mean percentages of 
errors to each of the negative lights 
are plotted for the different speed 
information conditions in Fig. 1 
for 5-6-7 and Fig. 2 for 3-6-9. The 
analysis of variance of total errors 
over all information groups showed 
that the stimulus condition effect 
was significant (F=38.19, df=1/104, 
P < .001) with fewer total errors in 
5-6-7 than in 3-6-9. The stimulus 
effect was also significant for errors 
to Lights 4and 8 (F = 14.91, df=1/104, 
P < .001), with the fewer errors in 
5-6-7. This is in contrast to the 
nearly identical averages of the sum- 
mation hypothesis predictions to these 
two lights of 54% and 52% errors 


a —A4 SUMMATION MYPOTHESS 
PREDICTIONS FOR TE GROUP 


PER CENT RESPONSES 


eS a 


ci), Eee T Bylot 
STIMULUS LAMP NUMBER 


Fic. 1. Mean percentage of errors in 
Cond. 5-6-7 to each negative (test) lamp, and 
percentages predicted by the summation 
hypothesis for the TI group. (The latter were 
obtained by erecting around each of Lights 
5, 6, and 7 the gradient obtained when only 
Light 6 was positive, and exponentially 
summating overlapping points. The two 
wings of the single Light 6, also shown, are 
displaced one spatial unit away from center 
so that there is correspondence of percentage 
of errors to each test light, for all gradients, on 
the dimension of distance from the nearest 
positive light.) 


70 CHARLES Y. NAKAMURA AND JAQUES W. 


expected for 5-6-7 and 3-6-9, respec- 
tively. (Lights 4 and 8 frequencies 
were combined in the analysis of 
variance since a chi square test of the 
difference between errors to these 
lights was not significant—y?=.90, 
df = 2, P > .10.) These results are 
consistent with the stimulus grouping 
hypothesis that the spatial contiguity 
of positive lights facilitated cate- 
gorization and led to the reduction 
of errors. Errors are clearly not 
attributable to a failure to dis- 
criminate lights as positive since Ss 
failed to respond appropriately to 
positive lights in only three trials. 
The hypothetical summation gradi- 
ents in Fig. 1 and 2 were plotted by 
erecting around each of Lights 5, 6, 
and 7 and around Lights 3, 6, and 9 
the gradient obtained when only 
Light 6 was positive. The values of 
the overlapping points, at each nega- 
tive light, were summated according 
to Hulls formula for exponential 
summation of gradients (Hull, 1943, 
p. 200). The value 100 was sub- 
stituted for M in the formula where 
M is the physiological limit of the 
learning process. In this experiment, 
the maximum percentage of responses 
to the positive lights was 100%. 


100, 
croua 
90 — t 
O-=-0 Pr 
< NT 
g SUMMATION HYPO} 
5 PREDICTIONS POR TI GROUP 
Í 10 
60 
g °° 
& 40 
8 
£ 30 


D E 6 Ce 6 

STIMULUS LAMP NUMBER 

Fic. 2. Mean percentage of errors i 

Cond. 3-6-9 to each negative (test) Grau 

percentages predicted by the summation 

hypothesis for the TI group. (The latter 
were computed as described for Fig. 1.) 


EE W 


KASWAN 


ai 


CONDITION 3-6-9 

a BOF 

Mi 

a 

2 

Ss 

& 

FA 

i 

ec 

g 40 

= 

= 

w CONDITION 5-6-7 

z 

a 

a 

= 

3.0 
2.0 La 1 1 
NO PARTIAL TOTAL 
INFORMATION INFORMATION INFORMATION 

(Nr) (PI) (m1) 


Fic. 3. Mean number of errors to Lights 4 
and 8 in two stimulus conditions and three 
speed information groups. 


Figure 1 shows that the error 
gradient of the TI group in Cond. 
5-6-7 closely approximated the gra- 
dient obtained in Cond. 6, also under 
TI. The mean difference between 
these sets of points was not significant 
(F=1.57, df=1/38), indicating that 
the number of stimuli to be cate- 
gorized did not affect generalization. 
Moreover, the large discrepancy be- 
tween summation hypothesis predic- 
tions and obtained error frequencies 
for the TI group in Cond. 5-6-7 indi- 
cates that summation does not: ac- 
count for the results obtained by this 
VR procedure. The mean difference 
between these two sets of points 
was significant (F = 23.81, df = 1/8, 
P < .01)? However, Fig. 2 suggests 


*In order to obtain scores for a variance 
estimate of the hypothesized gradient, 
necessary to the analyses of differences be- 
tween distributions, the summation procedure 
was carried out separately for Cond. 5-6-7 
and 3-6-9 by using the error scores, to the 
relevant lights, made by each S in Cond. 6. 
A statistical problem posed because several 
Ss in Cond. 6 had zero error scores could not 
be resolved by a transformation since the 
subsequent scores were to be exponentially 
summated. This was overcome by using 


ee 


SPATIAL STIMULUS GENERALIZATION 71 


that in Cond. 3-6-9 the results ob- 
tained for the TI group approxi- 
mated fairly closely the predictions 
of the summation hypothesis. The 
nonsignificant F of 2.39 (df = 1/8) 
for the difference between these 
curves was consistent with this ob- 
servation. 

The curves in Fig. 3 plot mean 
errors to Lights 4 and 8 as a function 
of speed information for Cond. 5-6-7 
and 3-6-9 and show that greater 
amounts of speed information were 
associated with higher occurrence of 
errors (F=7.39, df=2/104, P<.01). 
The information effect was also found 
to be significant for total errors to all 
negative lights (F = 12.15,df= 2/104, 
P< .001). A further £ test analysis of 
errors to Lights 4and 8 at each informa- 
tion level showed that the stimulus con- 
dition effect occurred only where infor- 


the mean scores of Ss randomly combined 


` into groups of 4, at the cost of reduction in df. 


60 


54 
wn 
Q 
ó 
9 48 
n 
z 
oe 
z 
w 
zi 
J 36 
z 
q 
w 
= 

30 

24 

NO PARTIAL 
INFORMATION INFORMATION 
(NI) (PI) 


lif 


mation was given (t = 2.56, df = 38, 
P <.05, for the difference between 
PI groups and ¢ = 3.24, df = 38, 
P < .01, for the TI groups). The 
corresponding difference between the 
NI groups was not significant. 

Latencies.—All the latency com- 
putations were based on the recipro- 
cals of the latency scores (Edwards, 
1960, p. 131) and the weight of the 
statistical analyses was almost wholly 
on the latency scores to positive 
lights. The latter was necessary 
since the latencies for errors were not 
suited to the usual statistical tests 
of differences between means since 
mean latencies for Ss were com- 
puted from unequal numbers of 
scores because individual Ss made 
different numbers of errors to different 
lights. 

Figure 4 shows the marked effect of 
stimulus condition on latencies to posi- 
tive lights at all levels of RT informa- 


COND. 3-6-9, MEAN LAMPS 3-9 


COND, 3-6-9, MEAN LAMPS 3-6-9 


COND. 3-6-9, LAMP 6 


COND. 5-6-7, MEAN LAMPS 5-7 
COND. 5-6-7, MEAN LAMPS 5-6-7 
COND. 5-6-7, LAMP 6 


SINGLE LAMP 6 COND., LAMP 6 
SINGLE LAMP 6 COND., PRACTICE 
COND, 3-6-9, PRACTICE 

COND. 5-6-7, PRACTICE 


TOTAL 
INFORMATION 
(TI) 


Fic. 4. Mean latencies to positive lights under different stimulus conditions 
within different speed information groups. 


72 CHARLES Y, NAKAMURA AND JAQUES W. KASWAN 
ND. 
62 PAE COND. 5,6,7 COND. 
; POSITIVE ——— COND. 3,6,9 3-6-9 
mae POSITIVE 
o OPI 
NI ONI o 
58 i NI 
g nPE 
A / PI 
a ! e COND. o 
3°? j TI 3-6-9 PL Rd 
POSITIVE e 
z / X ltl `g 
> / x o / 
O 46 I oun NI / 
i d NEN H A 
< \ * PI i \ 
4 2 RN Q A A \ 
Be f \ \ aR a 7 \ 
á i? \\ p © 
X% 
2 4 \\ J 
\ COND. a 
34 e ees 
POSITIVE 
| ie SS | Cea EOR S 10 tl 
STIMULUS LAMP NUMBER 


Fic. 5. Mean latencies of error responses to negative (test) lights in Cond, 5-6-7 and 3-6-9. 


(Mean latencies to positive lights are shown 


in boxes in the appropriate positions between 


negative lights. ‘The lowest box above Light 6 shows the combined mean latency of the thre 
information groups to positive lights under Cond. 5-6-7 since the three group means we 


nearly identical,) 


tion. The F of 148.55 (df = 1/104) 
for the test between mean latencies 
to Lights 3, 6, and 9 and to Lights 
5, 6, and 7 of the test trials was signifi- 
cant (P < .001). Comparisons be- 
tween Cond. 5-6-7 and 3-6-9 at each 
level of information yielded ¢ test 
values significant beyond the .001 
level. The Z's were 8.94, 6.41, and 
6.49 for comparisons between the 
Pairs of NI, PI, and TI groups, 
respectively. Figure 4 also shows 
that the magnitude of the differences 
between stimulus conditions decreased 
with the giving of greater amounts 
of speed information and indicates 


that the locus of this eflect was in 


Cond. 3-6-9. The apparent differen- 
tial effect of RT information on 
latency was confirmed by the signifi- 
cant Stimulus X Information inter- 
action (F=4,47, df=2/104, P <.02). 
Analyses by ¢ tests of the differences 
between NI vs. SI and NI vs. TI 
within Cond. 3-6-9 yielded t's of 3.05 
(P <.01) and 3.84 (P < .01), re 
spectively; whereas, none of the £8 
between information levels within 
Cond. 5-6-7 reached the .05 level of 
significance. 

It is seen in Fig. 4 that latency t? 
Light 6 in Cond. 3-6-9 was shorter 
than the mean latency to Lights 3 and 
9 (F = 46.84, df = 1/104, P < .001) 


—— 


SPATIAL STIMULUS GENERALIZATION 73 


and similarly for the corresponding 
difference between Light 6 and the 
mean for Lights 5 and 7 in Cond. 
5-6-7 (F=12.99, df=1/104, P<.001). 
It is probable that the shorter latency 
for Light 6 is attributable, in part, 
to the signal light above it which 
served as a discriminative cue. How- 
ever, it is unlikely that this alone 
could account for the main effects 
since the absolute mean latency to 
Light 6 differed among stimulus 
conditions. It was longest under 
Cond. 3-6-9, shorter in Cond. 5-6-7, 
and shortest in Cond. 6, as shown 
in Fig. 4. 

Comparison of the mean of the 
three mean latencies to positive lights 
for the TI group in Cond. 5-6-7, with 
the mean latency to Light 6 for Cond. 
6 yielded an F of 18.73 (df = 1/38, 
P < .001). This finding contradicted 
the assumption that latencies to the 
grouped stimuli should approximate 
those obtained when a single light 
was positive. Hence, though the 
error gradients did not differ for these 
two stimulus conditions, as noted in 
Fig. 1, the data suggest that cate- 
gorizing three contiguous stimuli as a 
class is in fact more difficult than 
distinguishing a single positive stimu- 
lus in a continuum of stimuli. 

Figure 5 plots error response laten- 
cies to the negative lights for the 
three RT information groups within 
each of Cond. 5-6-7 and 3-6-9. The 
upper part of Fig. 5 shows that in 
3-6-9, latencies for PI and TI groups 
tend to be shorter than those for the 
NI group. In contrast, (lower part 
of Fig. 5) there were no substantial 
differences in latency gradients or in 
latency to individual lights as a 
function of speed information in 
Cond. 5-6-7, and error latencies were 
clearly much shorter than those in 
Cond, 3-6-9 with practically no over- 
lap. Also, error latencies were con- 


sistently much shorter than latencies 
to adjoining positive lights for all 
three information groups within Cond. 
3-6-9 and the differences were slight 
but in the same direction for Cond. 
5-6-7, 

Error-latency interaction. — Com- 
parison of latencies to positive lights 
in Fig. 4 with errors for Cond. 5-6-7 
and 3-6-9 in Fig. 1 and Fig. 2, re- 
spectively, shows that although la- 
tency differences between the two 
conditions decreased with greater RT 
information, differences in error fre- 
quency generally increased, largely 
because of the greater effect of speed 
information in 3-6-9 than in 5-6-7. 
Wherever RT information reduced 
latencies to positive lights, as in 3-6-9, 
error frequency increased. Where RT 
information had no substantial effect 
on such latencies, as in 5-6-7, there 
was no significant difference in error 
frequency between the information 
groups within that stimulus condition. 

Further evidence of the interaction 
of errors and latencies was observed 
in that all except 1 of the 15 Ss in the 
NI group of Cond. 3-6-9 obtained 
mean latencies to positive lights of 
.5 sec. or greater, whereas, 19 of the 
40 Ss given RT information (PI and 
TI groups) in Cond. 3-6-9 had mean 
latencies shorter than .5 sec. Since 
the NI group obtained significantly 
fewer errors than the PI and TI groups 
in 3-6-9, the .5-sec. latency may be 
conceived as an optimum time for 
categorization of stimulus lights into 
positive and negative classes. Ac- 
cordingly, Ss responding faster than 
that should make more errors than Ss 
with latencies of .5 sec. or greater. 
When this comparison was made 
within TI and PI groups of 3-6-9, an 
F of 19.85 (df = 1/38, P < .001) was 
obtained. Moreover, comparison of 
error frequency of Ss above and below 
the median latency of .58 sec. within 


74 CHARLES Y, NAKAMURA AND JAQUES W. KASWAN 


the NI group in Cond. 3-6-9 also 
yielded a significant result (¢=2.26, 
df = 28, P < .05) for further indica- 
tion of the relation of shorter latency 
with greater errors. Similarly for 
Cond. 5-6-7, though neither error 
frequency nor latency had differed 
as a function of RT information, fre- 
quency of errors between Ss above 
and below the median latency of .4 
sec., for the three RT information 
groups combined, was significantly 
different (F =4,19, df=1/53, P <.05). 

A third source of evidence for 
error-latency interaction was the nega- 
tive Pearson product-moment 7's be- 
tween frequency and latency of er- 
tors. For Cond, 5-6-7, the r’s for 
each group were NI=—.51, PI=—.41, 
and TI = —.50. Under Cond. 3-6-9, 

i they were NI = —.48, PI = —.81, 
and TI = —.71. All except the r 
of —.41 were significant at the .05 
level or better. As expected, errors 
and latencies were most substantially 
correlated when RT information was 
given in Cond. 3-6-9, where stimulus 
categorization was presumably most 
difficult. However, the only signifi- 
cant difference (P < .05) between 
correlations between Cond. 5-6-7 and 
3-6-9 within any information level 
was that between the —.81 and —.41 
for the PI groups. 

When percentages of errors in 
Fig. 1 and 2 are compared with error 
latencies in Fig. 5, the direction of 
changes in magnitude from light to 
light are notably parallel. Though 
this is particularly true of Cond. 3-6-9, 
it is also seen in Cond. 5-6-7; where 
decreases in error are greatest, as from 
Lights 4 to 3 and from Lights 8 to 9 
error latencies show a corresponding 
decrease in five of the six instances, 
While latency gradients beyond these 
points are somewhat irregular, Varia- 
tions are within a narrow range 
Suggesting an essentially horizontal 


distribution consistent with the flat 
error frequency gradients. Thus the 
results suggest that error frequency 
and error latency gradients are posi- 
tively related. 


DISCUSSION 


The assumption that stimulus condi- 
tions facilitate the categorization of lights 
as positive or negative was consistently 
supported. It seems, however, that the 
stimulus contiguity hypothesis does not 
entirely account for this facilitation. 
The persistent sharp drop in errors and 
latencies to lights adjoining Light 6 in 
Cond. 3-6-9 and the shorter latency to 
Light 6, compared to other positive 
lights, suggest that the central position 
of this light and (or) the signal light 
located above it served as discriminative 
cues which facilitated categorization 
of the adjoining lights as negative stimuli. 
It is possible that the central location 
of the positive lights and the signal light 
also played a role in Cond. 5-6-7, al- 
though the first negative light is two 
lights removed from the signal light. 
Occurrence of such effects had not been 
expected since Bilodeau et al. (1956), 
using a similar apparatus, found that 
error gradients and frequencies to single 
positive lights located 16° of visual angle 
from center were virtually identical with 
those obtained to a central positive 
light. Though this finding clouds the 
interpretation of results in terms of a 
stimulus pattern effect, the influence of 
the latter is apparent in the fact that 
latencies to Light 6 itself decrease from 
Cond. 3-6-9 to 5-6-7 to Cond. 6, sug- 
gesting an effect of stimulus arrange- 
ment on speed of categorization. 

Specification of stimulus arrangement 
and stimulus position as determinants 
of SG appear of special interest because 
this involves the problem of specifying 
the role of stimulus units in S( >. In their 
review, Mednick and Freedman (1960) 
indicate that most SG studies follow 
an empirical formulation which assumes 
that SG is a decreasing function of 
increasing physical difference between 


SPATIAL STIMULUS GENERALIZATION 75 


test and training stimuli. Hull (1943, 
p. 198) explicitly recognized the unten- 
ability of postulating such arbitrary 
simple physical stimulus units and pro- 
posed the jnd as the stimulus unit for 
establishing SG gradients. The present 
results re-emphasize that equal physical 
distance units from different positive 
lights do not correspond to equal decre- 
ments in SG in the VR paradigm. This 
was shown most explicitly by the com- 
parison of the error frequencies obtained in 
Cond. 5-6-7 with the higher frequencies 
of Cond. 3-6-9, and between the obtained 
curves and the summation hypothesis 
predictions in each of these conditions. 
It was further demonstrated by the fact 
that error frequencies to lights adjoining 
Light 6 in Cond. 3-6-9 were much lower 
than errors to lights surrounding Light 
3 and Light 9, although the difference 
in physical distance was identical. The 
results indicate the advantage of a 
stimulus unit or scale that could take 
into account the specification of the 
relation among stimuli. The proposal 
of an ordinal scale of stimulus values by 
Mednick and Freedman (1960) is an 
attempt in this direction. However, 
their approach would predict sharper 
gradients over the negative lights in 
Cond. 5-6-7 than obtained in the present 
study. Since Lights 4 and 8 are sepa- 
rated from the training stimuli by only 
one ordinal unit, Mednick and Freedman 
would expect higher percentages of 
errors on them, relative to the more 
peripheral lights, because their formula- 
tion does not take into account the effect 
of contiguity of the training lights. 

The overall results of this study show 
a close relation between error and 
latency. The negative error-latency 
relation wherein Ss who took less time 
before responding to training stimuli 
generally had greater error frequencies 
indicates that errors were due largely 
to insufficient time taken (for accurate 
categorization) before responding. An- 
other aspect of the close relation between 
categorization and latency is observable 
in the positive relation between error 
frequency and error latency which is 
also found in most other VR SG studies 


(Gibson, 1939; Mednick, 1958; Rosen- 
baum, 1953). 

The manifold role of latency in the 
results of this study make it reasonable 
to suggest that latency may serve to 
specify both stimulus and response 
anchors of the term “categorization” 
used here to describe the verbal and other 
mediated processes which are undoubt- 
edly important determinants of SG in 
any VR paradigm. Stimulus referents . 
of categorization can be specified by 
noting that the time taken for responses 
varied with the arrangement or position 
of lights, as in the different latencies for 
Cond. 5-6-7 and 3-6-9. Response refer- 
ents of categorization can be specified 
by noting that latency was a major 
determinant of error frequency under 
given stimulus arrangement conditions, 
as in the effect of speed information in 
Cond. 3-6-9. Thus, difficulty or ease of 
categorization is measurable in terms 
of latency required to make a correct 
response, under specified conditions such 
as, in this study, relative emphasis on 
speed induced by providing reaction 
time information. 4 


SUMMARY 


This study examined the effects of stimulus 
arrangement and experimentally induced 
differences in response latency on perform- 
ance in a spatial stimulus generalization 
situation involving voluntary responses. In 
Cond. 5-6 7, Ss were instructed to respond 
to any one of three centrally and contiguously 
located lights (Lights 5, 6, and 7) of 11 lights 
in a row, but to inhibit responses to all 
other lights. In Cond. 3-6-9, the positive 
lights were noncontiguous (Lights 3, 6, and 9). 
The results showed that error frequency and 
latency were smaller in Cond. 5-6-7 than in 
Cond. 3-6-9. It was also found that giving 
Ss information on speed of responses reduced 
latencies and increased errors in Cond. 3-6-9 
but had practically no effect in Cond. 5-6-7. 
These findings were consistent with predic- 
tions of stimulus generalization generated 
from a stimulus categorization hypothesis. 
Predictions of error frequency by this hy- 
pothesis were compared with those made from 
Hull's gradient summation hypothesis of 
stimulus generalization. The latter failed 
to predict any of the obtained points on the 
gradient for Cond. 5-6-7, but did approximate 


76 CHARLES Y. NAKAMURA 
most of the points obtained in Cond. 3-6-9, 
under conditions where Ss were given maxi- 
mum speed information. The close and 
consistent relationship found between error 
frequency and latency was interpreted as 
indicating that latency can be used to specify 
both stimulus and response anchors of the 
mediational process in spatial VR SG. 


REFERENCES 


Bropeau, E. A., & Biropeav, I. McD. 
Motor-skills learning. Annu. Rev. Psychol., 
1961, 12, 243-280. 

Bitopeau, E. A., Brown, J. S., & MERYMAN, 
J. J. The summation of generalized reac- 
tive tendencies. J. exp. Psychol., 1956, 51, 
293-298. 

Brown, J. S., Bmopeau, E. A. & Baron, 
M. R. Pidirectional gradients in the 
strength of a generalized voluntary response 
to stimuli on a visual-spatial dimension. 
J.exp. Psychol., 1951, 41, 52-61. 

Bruner, J. S., Goopnow, J. J, & Austin, 
J. A. A study of thinking. New York: 
Wiley, 1956. 

Epwarps, A. L. Experimental design in 
psychological research. New York: Rine- 
hart, 1960. 

Evans, W. O. Two factors affecting stimulus 
generalization on a spatial dimension. 
J. exp. Psychol., 1961, 61, 142-149, 

Frores, I. The effect of organization upon 


complex reaction time. J. Psych, l., 1956, 
41, 301-313, Pg A 


AND JAQUES W. KASWAN 


Gipson, E. J. Sensory generalization with 
voluntary reactions. J. exp. Psychol., 1939, 
24, 237-253. 

Gissoy, E. J. A re-examination of generali- 
zation. Psychol. Rev., 1959, 66, 340-342. 

Gress, L. W. The effect of stimulus com- 


plexity on discriminative responses. J. 
exp. Psychol., 1954, 48, 289-297. 
Hut, C. L. Principles of behavior. New 


York: Appleton-Century, 1943. 

Jounscarp, K. W. The role of contrast 
in stimulus intensity dynamism (V). 
J. exp. Psychol., 1957, 53, 173-179. 

Katsu, H. I, & GUTTMAN, N. Stimulus 
generalization after equal training on two 
stimuli. J. exp, Psychol., 1957, 53, 139-144. 

Mepyick, S. A. Gradients of latency in a 
generalized voluntary response. Amer. J. 
Psychol., 1958, 71, 752-755. 

Mepnick, S. A., & FREEDMAN, J.L. Stimu- 
lus generalization. Psychol. Bull., 1960, 
57, 169-200. 

ROSENBAUM, G. Stimulus generalization 
as a function of level of experimentally 
induced anxiety. J. exp. Psychol., 1953, 45, 
35-43. 

SHERMAN, J., & Knorr, I. J. Changes in the 
gradient of stimulus generalization as a 
function of two procedural variations. 
Psychol. Rep., 1960, 7, 253-258. 

Wattacu, M. A. On psychological similarity. 
Psychol. Rev., 1958, 65, 103-116. 


(Received June 27, 1961) 


f 
; 


Journal of Experimental Psycholo; 
1962, vel 64, No. 1, 77-80 p 


STIMULUS GENERALIZATION AS A FUNCTION OF 
THE FRAME OF REFERENCE ! 


DAVID R. THOMAS ann CHARLES G. JONES 
Kent State University 


In a study reported by Philip (1952) 
it was shown that the location of the 
original stimulus within a generaliza- 
tion test series modifies the shape 
of the obtained gradient. Philip re- 
quired his Ss to rank cards containing 
varying proportions of green and 
blue dots along a greenness-blueness 
scale. The frequency with which 
a given rank, say Number 3, was 
subjectively attributed to the different 
cards constituted a gradient of gen- 
eralization for that value. A separate 
gradient was thus generated for each 
rank employed. 


Philip systematically varied the length 
of the generalization test series. With the 
shortest series employed (six values) there was 
a tendency for judgments to accumulate near 
the center of the scale, the “central tendency 
effect” (Hollingworth, 1909). This effect 
was reflected in asymmetrical gradients 
around stimulus values which were non- 
centrally placed in the series of generalization 
test stimuli. 


Because of the unusual nature of 
Philip’s (1952) procedure, the signifi- 
cance of his finding with regard to 
generalization as studied by other 
methods may be questioned. The 
purpose of the present study was to 
assess the generality of Philip’s find- 
ing, using a method for obtaining 
generalization gradients developed by 
Kalish (1958). The Kalish procedure 
is more typical of generalization 
studies in that Ss are first exposed to a 
single stimulus value and are sub- 
sequently tested for their ability to 
select the original from a randomly 


1 This research was supported in part by 
National Institutes of Health Grant M-4203. 


77 


presented series of stimuli. This is in 
contrast to the Philip procedure in 
which absolute judgments of pre- 
dominating color are made without 
previous exposure to some standard 
stimulus value. It was reasoned that 
if the “central tendency effect” were 
shown to distort measures involving 
retention as well as absolute judg- 
ment, its relevance for studies of 
generalization would be more con- 
vincingly demonstrated. 


METHOD 


Subjects —The Ss were 50 undergraduate 
men taken from introductory psychology 
courses at Kent State University. All Ss 
had normal color vision, as determined with 
the Dvorine (1944) color perception test, 

Apparatus.—The study employed a Skin- 
ner-type key pecking apparatus, modified for 
use with human Ss. The box was approxi- 
mately 15 in. long, 11 in. high, and 14 in. 
wide and was painted flat black. The front 
wall of the box was made of transparent 
Plexiglas so that Ss could clearly view the 
pecking key. The S sat in a chair approxi- 
mately 2} ft. from the key, which was a 
circular plastic disc | in. in diameter. Illumi- 
nation was provided by a Cambridge Ther- 
mionic Corporation monochromator, Model 
B, Series 1066, equipped with an Olympus, 
Model 201250 6-v., 5-amp. light source. The 
patch of color on the key was approximately 
4 mL. in luminance. The only other light 
in the room was a 7.5-w. “night light” on E's 
side of a black cloth screen separating S 
from E. 

A telegraph key was used to measure S’s 
responses. The telegraph key was placed 
on the table next to the Skinner box, within 
easy reach of S's right hand. It was wired 
so that its release would illuminate a signal 
light on E’s side of the screen, thus signifying 
a response. The box was equipped with 
an electrically operated shutter which inter- 
rupted the monochromator beam when E 
threw a switch, 


78 DAVID R. THOMAS AND CHARLES G. JONES 


i ORIGINAL 
STimULUS 


Ti ym 
E 


rt 


RESPONSES 
2 s 


Zo 


l 


485 495 505 SiS 525 535 545 555 S65 
WAVE LENGTH (My) 


Fis. 1. Generalization gradients of the 
five experimental groups. (The gradient of 
Group 1 is at the top of the figure with that 
of Group 2 directly below it, ete. Note that 
the value of the original stimulus is the same 
for all groups.) 


Procedure—The Ss were divided unsys- 
tematically into five groups of 10 Ss each. 
Each $ received the same instructions, They 
were as follows: 


This is an experiment in color ception. 
At the beginning of the leans a 
specific color will be presented through the 
small hole in front of you. Try to keep this 
color in mind because you will be asked to 
identify it later. After 1 minute this 
color will be turned off and you will place 
your finger and press down on the tele- 
graph key in front of you. | will give the 
signal “ready” and a few seconds later a 
color will again be Presented. You must 
decide whether this is the original color 
shown you at the start of the experiment, 
If it is, lift your hand as rapidly as you can 


from the key. If it is not, keep pressing ¢ 
the key. 

I will say the word “ready” whenever 
am about to present a color and you shoul 
be pressing the key at that time. We are 
going to try some practice trials. Now wi 
are going to run through a series exactly 
as we would do if this were the real experi 
ment. 


At this point the instructions were inter 
rupted and a stimulus of 600 mu was presente 
for 1 min. Then S was presented with óil 
my, 590 my, 600 mu, 620 mz, and 580 my. 
Each test stimulus was presented for 5 sec 
with 5 sec. intervening between presenti 
tions. If $ appeared to have understoo 
the instructions, they were continued as 
follows: 


Now we are going to begin the experi- 
ment. Remember, try to keep the orig 
color in mind and respond as rapidly as 
can, lifting your finger only when 
original color appears. Do not be dis 
turbed, however, if you should respond t 
other colors. 


The Ss in all five groups were presented 
with the same original stimulus, 525 my 
(a_middle-green), for 60 sec. The groups 
differed only with regard to the series O 
stimuli employed in testing for generalization. 
In Group 1 the test stimuli were 485 m 
through 525 my, in 10-my steps. For Group 
2 the range covered was 495 my-535 my, for 
Group 3, 505 mp-545 my, for Group 4 
515 ma-555 my, and for Group 5, 525 : 
565 mu. For each S the five test stimuli were. 
randomized within a series and 12 different 
series were presented. The number of re 
Sponses made to the different test stimuli 
constituted a generalization gradient. 


RESULTS AND Discussion 


In Fig. 1 are presented the generali- 
zation gradients of the five groups 
of Ss. It should be remembered that 
all groups were exposed to the same 
standard stimulus. In spite of this, 
the five gradients differ strikingly 
in a manner consistent with the find- 
ings of Philip (1952), The tendency 
to respond to stimuli closer to the 
center of the test series is so strong 
that the peak of the generalization 
gradient tends to be displaced from 


STIMULUS GENERALIZATION 79 


the value of the original stimulus. 
Thus, a change of, for example, 10 my 
from the original stimulus does not 
produce a fixed generalization decre- 
ment, but may result in no change in 
response strength or even in an incre- 
ment depending on the location within 
the range of test stimuli. It should 
be noted that only with Group 3, 
where the test stimuli were symmetri- 
cally distributed around 525 ma, did 
the peak of the gradient fall clearly 
at that value! 

The difference in gradient shape was 
shown to be statistically reliable in 
the following manner: A simple analy- 
sis of variance was performed to test 
for differences in the mean number 
of responses given to 515 mg in Groups 
1, 2, 3, and 4. The result was an F 
of 9.06 (df = 3/36, P<.01). A 
parallel analysis was performed for 
mean responses to 535 my in Groups 
2, 3, 4, and 5. The result was an F 
of 8.41 (df = 3/36, P < .01). 


The purpose of this study was to 
determine whether the effect of “cen- 
tral tendency” on the generalization 
gradient is peculiar to the absolute 
judgment situation employed by Philip. 
It is safe to conclude that it is not. 
Indeed, the distortion of the gradient 
proved even greater in the present experi- 
mental situation than under the condi- 
tions of Philip’s experiment. 

These findings may be interpreted 
with reference to Helson's (1947) theory 
of adaptation level. It may be argued 
that the series of test stimuli provides 
a frame of reference against which the 
memory trace of the original stimulus is 
judged. When test stimuli are presented 
which fall asymmetrically around the 
original stimulus, a change in the frame 
of reference may be assumed to result, 
culminating in a heightened tendency 
to respond to stimuli nearer to the center 
of the test series, thereby distorting the 
resulting generalization gradient. 

The significance of the “central tend- 


ency effect” for studies of generalization 
employing conditioning techniques re- 
mains an open question. We would 
guess, however, that the background 
of test stimuli would exert far less infu- - 
ence in that situation than in the present 
one. In this study, S's experience with 
the original stimulus was limited to one 
60-sec. exposure. The resulting limited 
familiarity with the stimulus makes 
extensive retroactive interference by 
the test series which follows more likely, 
In a conditioning situation, however, 
S has the opportunity through repeated 
exposure to become much more familiar 
with the value of the CS. Greater 
familiarity with the stimulus should 
reduce the effect of the test situation 
which is later employed. Some direct 
evidence on this issue has been reported. 
Guttman (1956) discussed some pilot 
work with pigeons and an operant condi- 
tioning technique in which the asym- 
metry of the distribution of test stimuli 
appeared to have no effect on the re- 
sulting generalization gradient. More 
recent work from the Kent State labora- 
tory has tended to corroborate his find- 
ing. Of course the species-related and 
procedure-related differences between the 
pigeon and human studies are so great 
as to make these findings merely sug- 
gestive. Research is needed to assess 
the role of familiarity with the CS in 
determining the influence of the ‘‘central 
tendency effect” on generalization. 


SUMMARY 


Five groups of 10 Ss each viewed a mono- 
chromatic light of 525 my (a middle-green) for 
60 sec., and then were presented 12 different 
random series of wave lengths under instruc- 
tions to respond only to the original color. 
The number of responses made to the different 
test stimuli constituted a gradient of general- 
zation. Group 1 was tested with the series 
485-525 mz, in 10-myu steps; Group 2, 495- 
535 mu; Group 3, 505-545 my; Group 4, - 
515-555 mp; and Group 5, 525-565 mu. 
Only Group 3, with a central value of 525 mu, 
produced a generalization gradient with a 
definite peak at 525 my; in all other cases the 
peak of responding was displaced toward the 
center of the series of test stimuli. The extent 


80 DAVID R. THOMAS AND CHARLES G. JONES 


of this displacement varied directly with the 
degree of asymmetry of the test series around 
the value of the original stimulus. These 
‘results support the assumption that the gen- 
eralization test series serves as a frame of 
reference against which the memory trace of 
the original stimulus is judged. 


REFERENCES 


Dvorine, |. Dvorine color perception testing 
charts. Baltimore, Md.: Waverly, 1944. 
Guttman, N. The pigeon and the spectrum 
and other perplexities, Psychol. Rep., 

1956, 2, 449-460. 


HeEtsoy, H. Adaptation level as a frame of 
reference for prediction of psychophysical 
data. Amer. J. Psychol., 1947, 60, 1-29, 

HoLLINGWoRTH, H. L.. The inaccuracy of 
movement. Arch. Psychol., N. Y., 1909,2 
(Whole No. 13). 

Katisu, H. I. The relationship between 
discriminability and generalization: A 
re-evaluation. J. exp. Psychol., 1958, 55, 
637—644. 

Puri, B. R. Effect of length of series upon 
generalization and central tendency in the 
discrimination of a series of stimuli. 
Canad. J. Psychol., 1952, 6, 173-178. 


(Received June 28, 1961) 


Journal imental 
1962, V Spie 1,8 


inin 


EFFECT OF REWARD MAGNITUDE, PERCENTAGE OF REIN- 
FORCEMENT, AND TRAINING METHOD 
ON ACQUISITION AND REVERSAL 
IN A T MAZE! 


WINFRED F. HILL 
Northwestern University 
JOHN W. COTTON 
University of California, Santa Barbara 
AND KEITH N. CLAYTON 
Vanderbilt University 


The effect of reward magnitude on 
extinction is an unresolved question. 
Zeaman (1949) and Metzger, Cotton, 
and Lewis (1957) have found faster 
running early in extinction for animals 
receiving the larger acquisition re- 
ward, but convergence of the curves 
as extinction progressed. This is 
consistent with the assumption (Hull, 
1952; Spence, 1956) that the incentive 
motivation factor (K) adjusts rapidly 
but not immediately to a change in 
reward magnitude. On the other 
hand, Hulse (1958) and Armus (1959) 
have found faster running over a 
number of extinction trials for animals 
receiving the smaller reward. This 
might be predicted from the depres- 
sion effect found in some studies of 
magnitude change during acquisition 
(Spence, 1956), since no reward is a 
greater reduction from large reward 
than from small, but the generality 
of this effect is open to question. All 
of the above findings were with 100% 
reinforcement in acquisition. Hulse’s 
study further complicated the picture 
by showing an interaction between 
magnitude and percentage of reward, 
with large reward producing greater 


1 This study was carried out at North- 
western University and was supported by 
Grant G8706 from the National Science 
Foundation. 


81 


resistance to extinction than small 
if rewards were intermittent during 
acquisition. 

The present study translates the 
question of the effect of reward mag- 
nitude on extinction into a special case 
of extinction—discrimination reversal. 
In particular, it tests whether the 
interaction between magnitude and 
percentage of reward found by Hulse 
in straight-alley extinction will also 
appear in reversal of a T maze habit. 

While making this test, it is possible 
in the same experiment to study the 
effect of reward magnitude on acquisi- 
tion. A number of prior experiments 
suggest that larger rewards lead to 
faster learning of a discrimination if 
the noncorrection method is used but 
not if the correction method is used, 
either in a single-choice situation or 
in a multiple-unit maze. McKelvey 
(1956) investigated this apparent 
interaction of magnitude and method 
directly and found no effect of magni- 
tude on correct choices with either 
method. However, he manipulated 
magnitude by varying the time that 
rats were permitted to eat rather than 
the amount of food available. Since 
rats learn to speed up their eating 
over a period of deprivation, it is 
possible that his “large reward” and 
“small reward” groups may have 


82 W. F. HILL, J. W. COTTON, AND K. N. CLAYTON 


been eating nearly equal amounts at 
the end of acquisition. This possi- 
bility is strengthened by the fact 
that an initial difference in speed 
between the two groups disappeared 
by the end of acquisition. The 
acquisition phase of the present 
experiment incorporates the same 
basic design as McKelvey’s, but with 
reward magnitude defined by quan- 
tity rather than by eating time. 


METHOD 


Subjecis—The Ss were 96 female albino 
tats of the Sprague-Dawley strain purchased 
from Holtzman Rat Company, Madison, 
Wisconsin. Eight additional Ss were dis- 
carded because of failure to run, refusal to eat 
the reward pellets, or E's error. The Ss were 
between 72 and 86 days old at the beginning 
of training. 

Apparatus —The Ss were trained in an 
enclosed, single-unit T maze. This consisted 
of a start box 10 in. long and 6 in. wide, a 
Stem 48 in. long and 4 in. wide, arms 7 in. 
long and 4 in. wide, and goal boxes 13 in. 
long and 6 in. wide. An alcove for food 
opened off each goal box in such a way that 
the reward was not visible from the entrance 
to the goal box. The entire maze was 8} in. 
high, with sides and floor of plywood and lids 
of Plexiglas, The inside was painted flat 
black. A guillotine door separated the start 
box from the stem, and sliding doors sepa- 
rated the stem from the arms. A Standard 
Electric timer was started by S's weight on a 
treadle just beyond the start box and stopped 
by S's weight on either of two treadles just 
beyond the guillotine doors. 

_Design—All Ss received 48 acquisition 
trials followed by 24 reversal trials. The 
design was a factorial combination of four 
dichotomous variables: (a) Percentage rein- 
forcement of the correct side during acquisi- 
tion (100% vs. 50% random), (b) magnitude 
of reward during acquisition (four pellets 
vs. one), (c) acquisition training method 
(correction vs. forced-trial noncorrection) 
and (d) magnitude of reward during reversal. 
Thus there were 8 treatment cells of 12 Ss 
in acquisition and 16 cells of 6 Ss in reversal. 
Half of the Ss in each cell were initially trained 
to the right side, half to the left. 

It was desirable to hold both number of 


acquisition trials and Proportional number 


of acquisition reinforcements constant for 
both magnitudes and both probabilities of 
reward. For noncorrection Ss, this required 
the use of forced trials. Hence, noncorrection 
Ss received a combination of 16 free and 32 
forced trials in acquisition, so arranged that 
half of their acquisition responses were to 
each side. A given ordinal-numbered trial 
was either free or forced for all noncorrection 
Ss, but each S in a given cell had a different 
sequence of forced right and left turns. 
All trials by the correction method were free. 
Reversal training for all Ss was by the non- 
correction method with all trials free and with 
100% reinforcement. 

The above procedure had the effect that a 
correction § received twice as many rein- 
forcements in acquisition as a noncorrection 
S in the corresponding group. However, 
since method was of interest in connection 
with acquisition rather than with reversal, 
this difference was not crucial. Experiments 
on the role of incorrect responses in discrimi- 
nation learning suggest that this procedure 
should make correction and noncorrection 
groups more nearly comparable than if num- 
ber of reinforcements were held constant and 
number of incorrect responses allowed to vary. 

Procedure.—Seven days before the begin- 
ning of training, Ss were placed on a feeding 
schedule of 10 gm. of ground Purina chow a 
day, which continued throughout the experi- 
ment. On 6 of these 7 preliminary days, each 
S received 3 min. of handling and was allowed 
to eat four of the reward pellets. Water was 
always present in the living cages and the 
carrying cages throughout the experiment. 

In both acquisition and reversal, Ss were 
given six trials a day. Thus there were 8 
days of acquisition and 4 of reversal, succes- 
Sive except for 1 nontraining day between 
Days 6 and 7 of acquisition. The reward 
pellets were 45-mg. Noyes pellets. At the 
start of each trial S was placed in the start 
box. When S was oriented toward the door, 
E opened it, permitting § to enter the stem. 
In the noncorrection procedure, after S 
entered either arm the sliding door was 
closed and 5 was confined for approximately 
15 sec. (If it was a forced trial, the door to 
the other arm was already closed.) In the 
correction procedure, if S entered the incorrect 
arm the door remained open and § was per- 
mitted to retrace, but when S entered the 
correct arm the door was closed and S was 
confined for 15 sec, In both conditions S 
remained in a carrying cage for about 30 sec- 
between trials. The Ss received their daily 
ration about 4 hr. after training. 


{ 


~— ee N N S S SS 


ACQUISITION AND REVERSAL IN A T MAZE 83 


RESULTS 


Acquisition—The number of cor- 
rect choices was tabulated for each 
S in acquisition. Since only the free 
trials were relevant for the noncor- 
rection Ss, only the corresponding 
ordinal-numbered trials were counted 
for the correction Ss. Thus each 
acquisition score represented the num- 
ber of correct turns out of a possible 
16. Three cases of failure to run 
(out of 1536 trials) were counted as 
errors. 

The effects of the three acquisition 
variables are shown in the three 
pairs of curves in Fig. 1, each curve 
based on 48 rats. A triple-classifica- 
tion analysis of variance, with 
df = 1/88 for each F ratio, indicated 
that all three main effects were 


a 
c=} 


5 
S 


PROBABILITY OF A CORRECT RESPONSE 
f x 
a 


Fic. 1. Acquisition for all Ss, classified 
according to the three acquisition variables. 


TABLE 1 
NUMBER or CORRECT CHOICES IN REVERSAL 


Reint. ferent | Method) Reward | Mean | SD 
100 1 C 1 14,83] 6.58 
100 1 G 4 19.67| 1.36 
100 1 N 1 10.17) 1.94 
100 1 N 4 14.00) 1.90 
100 4 ra 1 | 17.50) 2.35 
100 4 Cc 4 17.83) 1.75 
100 4 N 1 8.50) 1.87 
100 4 N 4 | 14.33) 3.26 
50 1 € 1 | 17.17) 4.02 
50 1 iC 4 | 18.17) 2.40 
50 1 N 1 | 12.17) 3.97 
50 1 N 4 | 18.50) 2.07 
50 4 Ç 1 7.67| 4.08 
50 4 c 4 | 10.00) 5.29 
50 4 N 1 7.83) 5.34 

4 N 4 


Note.—N = 6 each group. 


significant. Acquisition was faster 
for four pellets than for one (F=6.71, 
P = .05), for 100% reinforcement 
than for 50% (F = 16.86, P = .001), 
and for forced noncorrection method 
than for correction (F=7.38, P=.01). 
None of the interactions was signifi- 
cant, all four Fs being less than unity. 
The Hartley test did not indicate any 
heterogeneity of variance (Fmax. of 
3.64). Although there was a marked 
negative skew in the distribution of 
scores, it is unlikely (Boneau, 1960) 
that this had any substantial effect 
on the Fs. 

Reversal.—The number of correct 
choices on the 24 reversal trials was 
tabulated for each S. Table 1 pre- 
sents the means and SDs of these 
scores for the 16 groups. These 
distributions were more nearly normal 
than those for acquisition, and the 
apparent heterogeneity of variance 
was not significant by Bartlett's test 
(B = 19.76, df = 15). 

Table 2 summarizes the quadruple- 
classification analysis of variance of 
these data. Although all four main 
effects were significant when tested 


84 W. F. HILL, J. W. COTTON, AND K. N. CLAYTON 


TABLE 2 


ANALYSIS OF VARIANCE OF NUMBER OF 
CORRECT CHOICES IN REVERSAL 


Source df MS F 
Percentage reinforcement 
A 1| 84.376 | 6.20* 
Training method (M) 1 | 273.376 | 20,074% 
Acquisition reward (KA) 1 | 360.376 | 26.46% 
Reversal reward (KR) 1 | 266.667 | 19.59% 
To XM 1 | 126.040 | 9.70** 
X KA 1 | 330.040 | 24.23% 
h XKR 1 2,667 — 
M XKA 1 3.374 — 
M X KR 1| 37.499 | 2.75 
KA XKR 1 10.667 == 
W XM XKA 1| 18.377 | 1.35 
XM X KR 1 167 = 
X KA XKR 1 0 — 
M X KA XKR 1 «667 = 
% XM XKA XKR 1| 53.998 | 3.06 
Within cells* 80| 13.620 
* Used as error term for all F tests. 
P< .05; 
HP <0, 
** p < 001. 


against the within-cells variance, the 
three that involved acquisition vari- 
ables were overshadowed by inter- 
actions. Figure 2 shows the course of 
reversal learning classified in three 
different ways so as to clarify these 
interactions. The effect of acquisition 
variables on reversal may be sum- 
marized by saying that 100% cor- 
rection Ss reversed fastest and 50% 
large-reward Ss reversed most slowly. 
Reversal, like acquisition, was faster 
for the larger current reward. 

The number of errors prior to the 
first correct response in reversal was 
also tabulated and subjected to an 
analysis of variance on the three 
acquisition variables. This measure 
gave substantially the same results 
as the total correct reversal responses, 
The main effects of percentage and 
method fell short of significance, but 
the effect of acquisition magnitude 
remained significant at the -001 level, 
The interactions of percentage with 
reward size and percentage with 
method manifested the same patterns 


as before and were both Significant 
at the .01 level. 


Speeds.—Time scores were con- 
verted to reciprocals and analyses 
of variance computed. In a triple- 
classification analysis for the last four 
trials of acquisition, with 1/88 df 
for each F, the correction method 
gave faster running at the .001 level 
(F = 131.43) and larger reward gave 
faster running for correction-method 
Ss only (F for magnitude-method in- 
teraction = 4.70, P = .05). No other 
Fwas significant. In a quadruple-clas- 
sification analysis for all 24 reversal 
trials, with 1/80 df for each F, Ss re- 
ceiving the larger current reward ran 
faster (F = 10.16, P = .01) and those 
trained with the correction method 
ran faster (F = 8.90, P = .01). No 
other F was significant. 


PROBABILITY OF A CORRECT RESPONSE 


TRIALS 


Fic. 2. Reversal learning for all Ss, classi- 
fied in three different ways according to 
reversal variable (top) and acquisition vari- 
ables (middle and bottom). 


ACQUISITION AND REVERSAL IN A T MAZE 85 


Discussion 


Acquisition—The superiority of the 
forced-trial, noncorrection method, in 
spite of the smaller number of reinforce- 
ments that it provided, argues for the 
importance in learning of making non- 
rewarded incorrect responses. Not only 
did the correction Ss have less experience 
with the incorrect side, they also were 
able to obtain reward, although delayed, 
even when they made the wrong choice. 
The fact that the noncorrection Ss were 
forced half the time to the wrong side 
makes it difficult to compare these re- 
sults with those of other studies com- 
paring the correction and noncorrection 
methods. However, in emphasizing the 
importance of incorrect responses, this 
study agrees with others that have found 
better learning of a discrimination with 
a combination of rewarded and non- 
rewarded trials than with the same 
number of trials all rewarded. 

No support was found for the hypothe- 
sis that greater reward leads to faster 
spatial discrimination learning with the 
noncorrection but not with the correc- 
tion method. Larger reward gave faster 
learning throughout, and there was 
no interaction between magnitude and 
method. The possibility that the present 
finding is due to the use of forcing in the 
noncorrection method cannot be ruled 
out. However, since this experiment 
was begun, another finding disconfirming 
the hypothesis has been reported by 
Lawson, Cross, and Tambe (1959). 
Thus it appears that some other factor 
must be found to explain the discrepant 
prior findings on the relation of dis- 
crimination learning to reward magni- 
tude. 

The more rapid acquisition with 100% 
than with 50% reinforcement is com- 
parable to the common, though by no 
means universal, finding with acquisi- 
tion of a simple running response. 

Reversal.—The finding that a large 
reward 50% of the time produced greater 
resistance to reversal than any of the 
other three combinations of magnitude 
and probability is similar to Hulse’s 
(1958) finding on resistance to extinction 


in a straight alley. However, the lack 
of any difference among the other three 
groups differs from Hulse’s findings. 
The effect found here is consistent both 
with a partial reinforcement extinction 
effect, but only for the large reward 
condition, and with a greater persistence 
of more strongly rewarded responses, 
but only for the 50% condition, and 
hence is not completely consistent with 
either of these principles. 

Training method had no appreciable 
effect on reversal when total number of 
reinforcements was held constant by 
comparing 50% correction with 100% 
noncorrection Ss. The interaction that 
was found between method and per- 
centage may reflect in part a tendency 
to prefer the less often experienced side 
(Denny, 1957). Since the 100% correc- 
tion Ss made the largest number of 
correct turns in acquisition (the non- 
correction Ss being forced equally often 
to the two sides), this tendency should 
be greatest for them and should lead to 
more rapid extinction of the old response 
and acquisition of the new response, as 
was found. The partial reinforcement 
effect might then be invoked to explain 
why the 50% correction Ss were so 
markedly slower in reversing than the 
100% correction Ss. According to this 
combination of factors, however, the 
50% noncorrection group should have 
reversed most slowly, which was not 
the case. Thus this interaction also 
remains at least partly unexplained. 

The faster reversal with larger reversal 
reward agrees with the finding of faster 
acquisition with larger reward. Since 
the reversal training method of free, 
noncorrection trials was different from 
either of the acquisition methods, the 
superiority of the larger reward in this 
condition increases the generality of the 
finding that larger reward leads to faster 
discrimination learning. There was no 
evidence of generalization decrement 
resulting from the change in reward 
magnitude between acquisition and ex- 
tinction. The lack of interaction be- 
tween reversal reward magnitude and 
any of the acquisition variables argues 
for the independence of acquisition and 


86 


reversal variables in their effect on 
reversal. 

Speeds.—Since the recorded running 
speeds include time in the stem plus 
time in the choice area, the speed data 
are difficult to interpret. The most 
striking finding is the superiority of the 
correction method. McKelvey (1956) 
also found greater speed and less ac- 
curacy with the correction method, and 
in his study this might be attributed to 
the greater consistency of reward with 
correction method. This explanation 
seems inadequate in the present study, 
however, since 100% reinforcement did 
not give significantly faster running than 
50%. It is also striking that the superi- 
ority of the correction group continues 
through reversal, even though all Ss 
were on noncorrection method during 
reversal. Since the noncorrection Ss 
received primarily forced trials in acqui- 
sition, it seems likely that forcing to 
the incorrect side, rather than non- 
correction as such, was the crucial factor. 
However, why the difference should 
remain after all animals were changed 
to free trials is problematic, 

In general the effect of magnitude is 
greater on choices than on speeds. In 
particular, noncorrection Ss made more 
correct choices in acquisition, but did 
not run faster, for large than for small 
reward. This argues against Pubols’ 
(1961) suggestion that the effect of 
magnitude on choices is mediated by the 
effect on speeds. 


SUMMARY 


Acquisition and reversal in a T maze were 
studied for 96 rats as a function of four vari- 
ables combined factorially: (a) 100% vs. 50% 
random reinforcement in acquisition, (b) 
one vs. four reward pellets in acquisition, 
(c) correction vs. forced-trial noncorrection 
method in acquisition, and (d) one vs. four 
reward pellets in reversal. All Ss received 
100% reinforcement and free-trial noncorrec- 
tion method in reversal. 

Acquisition of the correct response was 
faster for large reward, 100% reinforcement 
and forced noncorrection method, with no 


W. F. HILL, J. W. COTTON, AND K. N. CLAYTON 


interactions. Reversal was faster for large 
reversal reward, faster after 100% reinforce- 


‘ment with correction method than any other 


combination of percentage with method, and 
slower after 50% reinforcement with large 
acquisition reward than any other combina- 
tion of percentage with acquisition magnitude. 
Acquisition by the correction method gave 
faster running during both acquisition and 
reversal, Running was faster with larger 
current reward both during acquisition by 
correction method and during reversal. 


REFERENCES 


Armus, H. L. Effect of magnitude of rein- 
forcement on acquisition and extinction 
of a running response. J. exp. Psychol., 
1959, 58, 61-63. 

Boneau, C. A. The effects of violations of 
assumptions underlying the / test. Psy- 
chol. Bull., 1960, 57, 49-64. 

Denny, M. R. Learning through stimulus 
satiation, J. exp. Psychol., 1957, 54, 62-64. 

Hut, C. L. A behavior system. New Haven: 
Yale Univer. Press, 1952. 

HuLse, S. H., Jk. Amount and percentage 
of reinforcement and duration of goal 
confinement in conditioning and extinction. 
J. exp. Psychol., 1958, 56, 48-57. 

Lawson, R., Cross, H. A., & TAMBE, J. a 
Effects of large and small rewards on maze 
performance after different prior experi- 
ences with reward amounts, J. comp. 
physiol. Psychol., 1959, 52, 717-720. 

McKeLvey, R. K. The relationship between 
training methods and reward variables 
in brightness discrimination learning. J. 
comp. physiol. Psychol., 1956, 49, 485-491. 

METZGER, R., COTTON, J. W., & Lewis, D. J. 
Effect of reinforcement magnitude and of 
order of presentation of different magni- 
tudes on runway behavior, J. comp: 
physiol. Psychol., 1957, 50, 184-188. 

PusoLs, B. H. The acquisition and reversal 
of a position habit as a function of incen- 
tive magnitude. J, comp. physiol. Psychol., 
1961, 54, 94-97, 

Spence, K. W. Behavior theory and condi- 
tioning. New Haven: Yale Univer. Pres, 
1956, 

ZEAMAN, D. Response latency as a function 
of the amount of reinforcement. J. exp: 
Psychol., 1949, 39, 466-483, 


(Received June 30, 1961) 


Journal of Experiment, 
1962, Vol. OO Nel Shane 


PAIRED-ASSOCIATE LEARNING UNDER SIMULTANEOUS 
REPETITION AND NONREPETITION CONDITIONS! 


WILLIAM F. BATTIG 


University of Virginia 


Several recent investigators (e.g., 
Clark, Lansford, & Dallenbach, 1960; 
Estes, 1960; Estes, Hopkins, & Croth- 
ers, 1960; Rock, 1957; Rock & Heimer, 
1959) have obtained evidence leading 
them to conclude that associative 
learning is an all-or-none rather than 
a gradual incremental process, and 
that repetition serves only to provide 
additional opportunities for such all- 
or-none associations to be learned. 
However, none of these studies have 
been sufficiently free of defects in 
experimental design and procedure to 
preclude alternative interpretations, 
discussed at length elsewhere (Post- 
man, 1962; Underwood, 1961; Under- 
wood, Rehula, & Keppel, 1962), so 
that the theoretical issue of all-or- 
none vs. gradual association forma- 
tion remains a matter of considerable 
dispute. 

The present paper reports a series 
of experiments designed to eliminate 
or minimize the influence of several 
inadequately controlled variables in 
the procedure originally introduced 
by Rock (1957), wherein performance 
of a repetition group receiving the 
same list of paired-associate items on 
each trial did not differ from a 
modified nonrepetition group, for 
whom all pairs not correctly responded 
to on any trial were removed from the 
list and replaced by new pairs on the 


1 Experiments I and II were reported at the 
April 1959 meetings of the Eastern Psycho- 
logical Association, Atlantic City, Niofes 
Exp. III and IV were supported by a contract 
with the United States Office of Education, 
Department of Health, Education, and Wel- 
fare. The author is indebted to Douglas 
Nelson for assistance with Exp. IV. 


87 


following trial. Since groups learning 
under the repetition and nonrepeti- 
tion procedures may also differ in such 
factors as instructions and approach 
to the task (Brackett, 1961), or 
amount of interference produced by 
other pairs in the list (Brown, 1961; 
Clark et al., 1960), a within-Ss design 
was employed so that each S learned 
simultaneously under both conditions 
and therefore served as his own 
control. To test more adequately 
the role of repetition in association 
formation as distinguished from other 
processes involved in paired-associate 
learning (e.g., Underwood & Schulz, 
1960), an attempt was made to de- 
velop materials which provided a 
relatively pure and substantial case 
of association formation. These ma- 
terials were also highly homogeneous 
and carefully calibrated so as to be 
equivalent in difficulty, in order to 
minimize the effects of selective 
elimination of the more difficult in- 
correct pairs on each trial, which has 
been shown to result in a significantly 
easier list under the nonrepetition 
condition (Underwood et al., 1962; 
Williams, 1961). 


EXPERIMENTS I AND II 


Method 

The method and procedure for both Exp. 
I and II closely followed Rock (1957), except 
that the list for each S consisted of two equal- 
sized subsets of pairs representing the repeti- 
tion (R) and nonrepetition (NR) conditions. 
All pairs of the R subset were presented on 
every trial throughout the experiment, where- 
as only those pairs responded to correctly 
on the preceding trial were retained in the 
NR subset, all incorrect pairs being removed 
from the list and replaced on the next trial 


88 


by new pairs. The 15 Ss in each experiment 
were given typical paired-associate instruc- 
tions with an added statement that new pairs 
might be introduced during the experiment, 
although the procedure effectively prevented 
S from distinguishing between the R and NR 
subsets, or from detecting the basis for intro- 
ducing new NR pairs. 

All learning materials were typed on 3X5 
in. cards and presented manually through a 
card-exposure device. Each trial began with 
the successive presentation of all pairs in the 
list for 3 sec. each with a 5-sec. interpair inter- 
val, followed by all stimulus terms alone in a 
different order at a 5-sec. rate while S at- 
tempted to recall the associated response, 
with a 30-sec. intertrial interval. To further 
guard against differentiation of the R and 
NR subsets, both were distributed evenly 
throughout the list on each trial. 

Experiments I and II differed only in the 
kind of learning material employed. In 
Exp. I, 84 pairs of nonsense syllables of 47— 
53% association value (Glaze, 1928) were 
selected so as to be maximally homogeneous 
in judged ease of learning according to a 
Previous scaling procedure (Battig, 1959), 
and divided into 14 subsets of 6 pairs, matched 
as to mean and SD of these scale values. 
The initial list of 12 pairs for each S consisted 
of two of these equivalent 6-pair subsets 
representing the R and NR conditions, re- 
placement pairs for the latter being selected 
from another equivalent subset until ex- 
hausted, then from another subset, and so on 
until the final trial. Each of the 14 subsets 
was used equally often as the R and initial 
NR subset and at each point in the sequence 
of replacement pairs for the various Ss. 
Learning was carried out to a criterion of 1 
errorless trial or a maximum of 10 trials. 

Pairs of common 4-letter words as stimuli 
and 2-digit number Tesponses were used in 
Exp. II, wherein each S learned to a criterion 
of one errorless trial an 18-pair list consisting 
of two 9-pair subsets of Rand NR pairs, A 
total of 84 such word-number pairs were con- 
structed so as i 
associations or similarities, each being used 
equally often in the R and initial NR subset 
and at various points in the sequence of N R 
replacements, 

Each S in both Exp. I and II was given a 
recognition test immediately after the last 
trial, consisting of the individual Presentation 
of 10 incorrect NR pairs which had been re- 
placed, intermixed randomly with 10 addi- 
tional pairs from the Pool of 84 which S had 
never seen, with S indicating in each case 


WILLIAM F. BATTIG 


whether or not he thought he had seen the 
pair previously during the experiment. 


Results 


Means and SDs for the repetition 
(R) and nonrepetition (NR) condi- 
tions are presented separately in 
Table 1 for total errors over the six 
trials on which data were available 
from all 15 Ss of Exp. I, and for total 
errors and trials to the criterion of one 
errorless trial in Exp. II. Revealed 
herein is a marked and statistically 
significant difference in favor of Cond. 
R in Exp. | ¢=6.44, df = 14), 
whereas the slight superiority of 
Cond. NR for both measures in Exp. 
II fell far short of significance (both 
ls < 1). However, the recognition 
test showed significantly above-chance 
recognition of incorrect NR pairs 
which was actually superior in Exp. II 
(79.7% correct, £ = 7.76) to Exp. I 
(70% correct, £ = 7.75), demonstrat- 
ing that something had been learned 
about the replaced incorrect pairs 
even in Exp. II where their removal 
did not retard NR learning. 


The results of Exp. I, wherein every 
1 of the 15 Ss made more NR than R 
errors, may be due to facilitation by 
repetition of the substantial response 
learning required for the nonsense- 
syllable materials, and therefore are 
inadequate to disprove an all-or-none 
theory of association formation as dis- 
tinguished from paired-associate learning 
e.g., Underwood et al., 1962). Although 
such response learning was minimized 
for the word-number pairs of Exp. II, so 
also was the amount of association for- 
mation significantly reduced, resulting in 
such rapid learning (63% correct re- 
sponses on Trial 2) that the performance 
measures become quite insensitive to any 
effects of repetition. Since these mate- 
rials also differed considerably in diffi- 
culty, thereby Providing 
advantage for Cond. NR, 
Exp. II cannot be interpreted as un- 
equivocally supporting an all-or-none 
theory of association formation, 


an important 
the results of 


PAIRED-ASSOCIATE LEARNING 89 


EXPERIMENT III 


To overcome the major deficiencies 
of Exp. I and II, an attempt was made 
to develop for use in Exp. II] ma- 
terials which required a substantial 
amount of association formation while 
relatively free from other nonassocia- 
tive processes typically involved in 
paired-associate learning. This was 
accomplished by (a) constructing 
pairs with maximally different and 
unrelated stimulus and response mem- 
bers; (b) using stimuli of low meaning- 
fulness and familiarity which were 
highly discriminable from each other, 
since previous evidence indicates that 
discriminability represents a major 
factor in learning of the stimulus 
(Battig, Williams, & Williams, 1962), 
whereas meaningfulness and famili- 
arity are of lesser importance (Under- 
wood & Schulz, 1960); (c) using 
highly meaningful and familiar re- 
sponses to minimize response learning. 


Method 


Each of 72 nonsense shapes of high dis- 
criminability and low association value was 
paired with a different two-digit number 
between 12 and 98 (excluding double num- 
bers and numbers ending in zero), forming 72 
pairs with minimal shape-number similarity 
according to the judgments of 10 preliminary 
Ss. Previous measures had been obtained of 
(a) association value and discriminability 
of each shape from 122 Ss; (b) association 
value of each number from 28 Ss; (c) rated 
“ease of learning” of each shape-number pair 
by 66 Ss; (d) actual learning difficulty in a 
separate study reported elsewhere, which 
describes the materials in more detail (Battig 
& Brackett, 1961). These 72 pairs were 
divided into 12 subsets of 6 pairs each which 
were approximately equivalent with respect 
to each of these four indices, each being used 
with approximately equal frequency as the R 
and initial NR subset and throughout the 
sequence of NR replacement pairs. 

‘All other aspects of method and procedure 
were identical to Exp. I with the following 
minor exceptions: (a) all materials were pho- 
tographed and presented for learning and at- 
tempted recall by means of an automatic 
slide projector; (b) a 5-sec. rate (4-sec. ex- 


TABLE 1 


PERFORMANCE MEASURES FOR REPETITION 
AND NONREPETITION CONDITIONS OF 
Exp. I-III 


Repetition Nonrepetition 
Exp. Measure 


Mean | SD | Mean} SD 


I Errors | 22.13] 5.40 | 28.93) 5.25 
II | Errors | 13.27} 6.95 | 12.40] 5.87 
Trials 4.47 | 1.41 | 4.20} 1.17 

III Errors | 16.00 | 4.87 | 20.07 | 7.04 
Trials 5.80 | 1.17 | 6.67} 1.85 


Note.—Errors and trials are to a criterion of one 
errorless trial, except for Exp. I which is based on Trials 
1-6 for all Ss, 


posure, 1-sec. interslide interval) was used 
for presentation of pairs for learning as well 
as for stimuli alone during attempted recall, 
with a 30-sec. interval between trials and 
between presentation and recall series within 
each trial; (c) learning was continued to a 
criterion of two successive errorless trials 
(although none of the 15 Ss ever made an 
error on the trial following the first errorless 
trial); (d) following the postexperimental 
recognition test for incorrect NR pairs, each 
S was shown again each of the 12 pairs he 
had learned and asked to describe verbally 
how he had learned them. 


Results 


As shown in Table 1, performance 
was superior on the R subset according 
to both error and trial measures, al- 
though significantly so only for errors 
(692,83) 2 < .02) but not for trials 
(t = 1.85, .05 < P < .10). Further 
error analysis revealed the difference 
to be primarily due to significantly 
more response omissions under Cond. 
NR (t= 3.06, P < .01), which ac- 
counted for 29% of the NR errors as 
compared with 18% for Cond. R. 
The two conditions did not differ 
significantly either in partially correct 
responses (£ = 1.25), intralist intru- 
sions (¢ = 1.34), or extralist intru- 
sions (¢ = 1.05). Recognition test 
results were similar to Exp. I and II, 
yielding 71.7% correct identifications 


90 WILLIAM F. BATTIG 


EXPERIMENT IV 


4 Ommen ®© Repetition 
O0 Nonrepetition 


E 


“ 


MEAN CORRECT RESPONSES 


TRIAL 


Fic. 1. Mean correct responses per trial 
for the modified repetition and nonrepetition 
conditions of Exp. IV on Trials 1-7. 


(€=9.12). Further analysis revealed 
recognition errors to consist primarily 
of reports of incorrect NR pairs as 
not seen previously (85%) rather 
than of the unseen pairs as previously 
seen (15%). 

Based on postexperimental ques- 
tioning, the learned pairs were cate- 
gorized on the basis of S's ability to 
verbalize a mediated basis for asso- 
ciating the shape and number to- 
gether, yielding insignificantly more 
mediated associations for NR (24%) 
than R pairs (20%). No basis for 
association could be verbalized for 
67% and 63% of the R and NR pairs, 
respectively, the remaining 13% in 
each case representing a “doubtful” 
category. These results indicate that 
the present materials had been reason- 
ably successful in providing a rela- 
tively pure and substantial case of 
association formation, and were suf- 
ficiently homogeneous to minimize if 
not eliminate the problem of differ- 


ences in pair difficulty in favor of 
Cond. NR. 


EXPERIMENT IV 


Although the differences in favor 
of R over NR performance in Exp. III 
were significant statistically, they 
were not impressively large in magni- 
tude, due at least in part to the 


relatively small number of pairs 
learned by each S, and the pre- 
dominance of already learned pairs 
on all but the first few trials as a 
consequence of the rapid learning 
of the shape-number pairs. In order 
to overcome these deficiencies and 
provide a more sensitive test of the 
role of repetition in this task, the 
procedure was modified in Exp. IV 
so that all pairs learned on each trial 
were eliminated from the list and 
replaced by new pairs. 


Method 


As in Exp. III, each S was presented with 
a list of 12 shape-number pairs on each trial, 
divided into two equivalent subsets of 6 
pairs representing repetition (R) and non- 
repetition (NR) conditions. However, all 

R pairs were presented only for a single 
trial, being replaced after each trial by another 
of the equivalent subsets of 6 pairs which 
S had not seen previously. Pairs of the R 
subset were removed from the list and re- 
Placed by new pairs only if they had been 
responded to correctly on the preceding trial, 
so that this subset on any given trial typically 
included a combination of new pairs with 
previously presented incorrect Pairs. Trials 
were continued in this manner until the pool 
of 72 pairs was exhausted, which required 
seven~nine trials depending on the number of 
correct responses to R pairs for the various Ss. 

The only other differences in method and 
procedure from Exp. III consisted of (a) 
appropriate modifications in instructions to 
cover the changed conditions; (b) elimination 
of the postexperimental recognition test and 
attempted verbalization of correct associa- 
tions; (c) use of 20 paid Ss representing a 
wider range of Previous experience and 
sophistication about paired-associate learning 
experiments than in Exp. I-III, each of which 
had used 15 volunteers from undergraduate 
psychology courses, 


Results 


The mean number of correct re- 
Sponses per trial for pairs of the R and 
NR subsets are Presented in Fig, 1, 
which indicates a consistent and 
increasing superiority of Cond. R to 
NR over the seven trials on which 
data were available from all Ss. 


} 
i 


PAIRED-ASSOCIATE LEARNING 91 


Total correct responses for Cond. R 
(Mean = 17.85, SD = 4.24) were sig- 
nificantly greater (£ = 7.06, df = 19) 
than for Cond. NR (Mean = 11.65, 
SD = 4.96). Except for 2 Ss who 
performed identically under both con- 
ditions, all Ss made more correct 
responses on the R subset. The slopes 
of the R and NR curves of Fig. 1 were 
found to differ significantly by trend 
analysis of variance (F = 10.35, 
df = 1/38), due to a highly significant 
improvement over trials for Cond. R 
(F = 29.92, df = 1/19) whereas the 
slight increase in correct responses 
under Cond. NR fell far short of 
significance (F <1). In agreement 
with Exp. III, error analysis revealed 
significantly more omissions (t=2.86, 
P<.01) for Cond. NR (Mean = 11.10) 
than Cond. R (Mean = 7.50), but 
the excess of partially correct re- 
sponses and intrusions in Cond. NR 
(Mean = 24.35) over Cond. R 
(Mean = 21.75) was not significant 
(t = 1.58, P > .10). 

Although the greater increase in 
correct responses over trials under 
Cond. R undoubtedly reflects the 
increased proportion of frequently 
presented incorrect pairs on later 
trials, a more direct and convincing 
demonstration of the superiority of 
performance on repeated pairs comes 
from a comparison of proportions of 
correct responses for R pairs varying 


TABLE 2 


PROPORTIONS OF Correct RESPONSES FOR 
REPETITION PAIRS AS A FUNCTION 
or NUMBER OF PRIOR 
PRESENTATIONS 


Number of Prior Presentations 


Measure 


Total number 420| 265) 135] 66| 56 

Correct responses ns 1 at ae oe 
tal proportion | .281 A g 

Fom ais 270| .424| .496| .581| .539 


Mean proportion 


in number of prior presentations. 
Table 2 presents the total number of 
pairs presented once, twice, etc. 
within the R subset summed over all 
Ss, along with the total number and 
proportion of these presentations which 
yielded correct responses, and the 
mean proportion of correct responses 
averaged over Ss?. Revealed herein 
is a consistent increase with number 
of prior presentations in both total 
and mean proportion of correct re- 
sponses, the rate of increase being 
slightly less for total proportion due 
to the relatively greater contribution 
of slower learners to this figure 
as number of presentations increase. 
The proportions of first, second, and 
third presentations responded to cor- 
rectly by each S were subjected to 
trend analysis of variance, which 
showed the increase with number of 
presentations to be highly significant 
(F = 24.56, df = 1/19). 


DISCUSSION 


The present results demonstrate con- 
clusively the facilitation of paired-asso- 
ciate learning by repetition of incor- 
rect pairs under conditions where (a) 
other variables which may differ for 
R and NR conditions have been elimi- 
nated through the use of a within-Ss 
design; (b) careful precautions have been 
taken to minimize the effects of differ- 
ences between pairs in difficulty; (c) 
nonassociative factors in paired-asso- 
ciate learning, such as response learning 


2 A somewhat more stable estimate of per- 
formance on once-presented pairs would 
probably be provided by the inclusion of pairs 
of the NR subset, all of which were presented 
only once. However, performance on NR 
pairs (p = .247) was somewhat below that 
for R pairs presented for the first time 
(p = .281), probably due to the differential 
distribution of these pairs over trials under 
the two conditions. The decision to base 
the present comparison solely on R pairs 
therefore represents, if anything, an overly 
conservative estimate of the magnitude of 
the increase in proportion of correct responses 
with number of previous presentations. 


92 WILLIAM F. BATTIG 


and stimulus discrimination, have been 
largely eliminated from the task. Only 
in Exp. II, which was clearly inadequate 
due to the extremely rapid learning of 
the word-number pairs and the lack of 
control for differences in pair difficulty, 
were the results not in direct conflict 
with an all-or-none theory, and even 
here the results of the recognition test 
showed that something short of a correct 
association had been learned about the 
eliminated incorrect NR Pairs. 

In comparison with previous studies 
using the Rock (1957) procedure, Exp, 
III and IV would appear to be clearly 
superior with respect to the elimination 
of uncontrolled variables and biases 
favoring either the R or NR condition. 
However, despite the extensive efforts 
to equate the shape-number pairs in 
average difficulty, these pairs probably 
were not equally difficult for the indi- 
vidual S. Furthermore, besides being 
more difficult, the increasing superiority 
of performance on more frequently 
Presented R pairs in Exp. IV may have 
obtained in spite of the relatively large 
contribution of slow learners to measures 
based on these Pairs. Since either or 
both of these Sources of bias would 
clearly favor NR performance, the con- 
clusiveness of the present results in 
demonstrating the Positive role of repe- 
tition can only be enhanced thereby, 

Only if the Position is taken that some 
Process other than association formation 


theory of association formation. Any 
such argument, of course, reduces the 
nglessness unless 
on process can 
i) precise operational specifi- 
cation. While verbal definitions of as- 


sociation formation as distinguished from 
paired-associate 


up,” or “associating together” the stimu- 
lus and response members of each pair. 
Procedurally, efforts to Provide the 


necessary pure case of association forma- 


tion have concentrated on the elimina- 
tion of response learning, reflecting the 
influence of Underwood's conception of 
paired-associate learning as a two-stage 
Process consisting of response-learning 
and associative phases (Underwood et al., 
1962; Underwood, Runquist, & Schulz, 
1959; Underwood & Schulz, 1960). 
There can be little question but that the 
present results are directly applicable to 
this definition of association formation as 
paired-associate learning in the absence 
of response learning. 

However, although the present stimu- 
lus materials were designed to be maxi- 
mally discriminable, thereby eliminating 
another nonassociative process suggested 
to be involved in Paired-associate learn- 
ing (Battig et al., 1962; Gibson, 1940), 
they differed from the letters or words 
typically employed as stimuli in related 
studies in their low meaningfulness and 
familiarity, Inasmuch as stimulus fa- 
miliarity has little or no effect on paired- 
associate learning while stimulus mean- 
ingfulness affects primarily the associa- 
tion-formation Phase (Underwood & 
Schulz, 1960), the present shape-number 
materials would therefore appear to be 
maximally appropriate and sensitive as 
a test of the role of repetition in associa- 
tion formation, since they do not sub- 
stantially reduce, nor shortcut through 
already existing mediated associations, 
the amount of association formation 
required of S. 

Nevertheless, although the present 
materials have eliminated those non- 
associative processes previously identi- 
fied to be important in paired-associate 
learning, the possibility remains that 
other processes besides association forma- 
tion are still involved in learning the 
shape-number pairs. However, unless 
or until such Processes are identified 
and given adequate operational specifi- 
cation, it can be concluded that associa- 
tion formation as presently defined is not 
an all-or-none process, but instead builds 
up gradually in strength through repe- 
tition. Moreover, in view of the rapid 
an ven under the present 
conditions, which were carefully designed 


to maximize the required amount of 


PAIRED-ASSOCIATE LEARNING 93 


association formation, it would appear 
that this process may constitute such a 
small and insignificant part of the learn- 
ing involved in the typical paired-asso- 
ciate task, that questions concerning 
its all-or-none or gradual nature are 
likely to be of little consequence for the 
general understanding of factors im- 
portant in paired-associate learning. 


SUMMARY 


The effect of repetition of previously incor- 
rect pairs in paired-associate learning was 
evaluated in four separate experiments in 
which each S learned a single paired-associate 
list consisting of two equivalent subsets of 
pairs. In Exp. I-III, each of 15 Ss learned 
under conditions where the repetition (R) 
subset consisted of the same pairs repeated 
on all trials, while pairs of the nonrepetition 
(NR) subset were retained in the list only if 
responded to correctly and were otherwise 
removed from the list and replaced by new 
pairs on the next trial. Lists of 12 nonsense- 
syllable pairs, 18 word-number pairs, and 12 
pairs of nonsense shapes and numbers were 
used in Exp. I-III, respectively. Experi- 
ment IV (N = 20) used the same shape- 
number pairs as Exp. III, but under condi- 
tions where all pairs responded to correctly 
were immediately removed from the list, 
so that the NR subset consisted of a new set 
of pairs on each trial, while the R subset 
included both new pairs and previously pre- 
sented incorrect pairs. Significant differ- 
ences in favor of Cond. R were obtained in all 
cases except for Exp. II, which was attributed 
to the greater ease of learning and rather 
wide range of difficulty of the word-number 
pairs. Particularly in view of the significant 
positive effects of repetition in Exp. III and 
IV, using materials requiring a relatively pure 
and substantial case of association formation 
which were also carefully calibrated to mini- 
mize the effects of differences in pair difficulty, 
it was concluded that the present results are 
directly contradictory to an all-or-none theory 
of association formation in paired-associate 
learning. 


REFERENCES 


Bartic, W. F. Scaled difficulty of nonsense- 
syllable pairs consisting of syllables of 
equal association value. Psychol. Rep., 
1959, 5, 126. 

Barric, W. F., & BRACKETT, H. R. Com- 
parison of anticipation and recall methods 
of paired-associate learning. Psychol. Rep., 
1961, 9, 59-65. 


Barttic, W. F., WiLLiams, J. M., & WiL- 
LIAMS, J. G. Transfer from verbal-dis- 
crimination to paired-associate learning. 
J. exp. Psychol., 1962, 63, 258-268. 

Brackett, H. R. The effect of pretraining 
and knowledge of results on the acquisition 
of paired associates. Unpublished master's 
thesis, University of Virginia, 1961. 

Brown, S. C. Interpair interference in 
paired-associate learning- Unpublished 
master’s thesis, University of Virginia, 
1961. 

CLARK, L. L., Lansrorp, T. G., & DALLEN- 
pach, K. M. Repetition and associative 
learning. Amer. J. Psychol., 1960, 73, 
22-40. 

Estes, W. K. Learning theory and the new 
“mental chemistry.” Psychol. Rev., 1960, 
67, 207-223. 

Estes, W. K., Horxrns, B. L., & CROTHERS, 
E. J. All-or-none and conservation effects 
in the learning and retention of paired 
associates. J. exp. Psychol., 1960, 60, 
329-339. 

Gipson, E. J. A systematic application of 
the concepts of generalization and differen- 
tiation to verbal learning. Psychol. Rev., 
1940, 47, 196-229. 

Guaze, J. A. The association value of non- 
sense syllables. J. genet. Psychol., 1928, 35, 
255-267. 

Postman, L. Repetition and _paired-asso- 
ciate learning. Amer. J. Psychol., 1962, 
75, in press. 

Rock, I. The role of repetition in associative 
learning. Amer. J. Psychol., 1957, 70, 
186-193. 

Rock, I., & Hermer, W. Further evidence 
of one-trial associative learning. Amer. J. 
Psychol., 1959, 72, 1-16. 

Unperwoop, B. J. One-trial learning. 
Invited address at Midwestern Psycho- 
logical Association, Chicago, May 1961. 

Unperwoop, B. J., REHULA, R., & KEPPEL, 
G. Item selection in paired-associate 


learning. Amer. J. Psychol., 1962, 75, 
in press. 
Unperwoop, B. J., Runouist, W. N, & 


Scuutz, R. W. Response learning in 
paired-associate lists as a function of intra- 
list similarity. J. exp. Psychol., 1959, 58, 
10-78. 

Unperwoon, B. J., & Scuurz, R. W. Mean- 
ingfulness and verbal learning. Chicago: 
Lippincott, 1960. 

Waitiams, J. P. Supplementary report: A 
selection artifact in Rock’s study of the role 
of repetition. J. exp. Psychol., 1961, 62, 
627-628. 


(Received July 3, 1961) 


Journal of Experimental Prychology 
1962, vd. 64, No. 1, 94 


SUPPLEMENTARY REPORT: EFFECTS OF STIMULUS ASSOCIATION VALUE 
AND EXPOSURE DURATION ON R-S LEARNING! 


NED CASSEM axb DONALD H. KAUSLER 
St. Louis University 


Jantz and Underwood (1958) demonstrated 
a gross positive relationship between stimulus 
association value (AV) and the amount of 
R-S learning. However, they found the same 
reversal between 0% and 33% Glaze values 
that had been reported earlier by Postman, 
Adams, and Phillips (1955). As noted by 
Jantz and Underwood, the reversal most 
represented a sampling artifact in that 
only two syllables were included in their list 
at each AV level. The present study at- 
tempted to provide a more precise determi- 
nation of the relationship between the AV of 
stimuli and R-S learning by means of a more 
~ adequate sampling of stimulus items. 
In addition, the present study investigated 
effects of exposure duration during both 
S-R learning and R-S recall on the amount 
of R-S recall. If, as Proposed by Feldman 
and Underwood (1957), R-S learning is a 
Variant of incidental learning, in which S 
S associations for which he has no learn- 
ing set during a period in which he is set to 
other associations, then the amount 
of R-S or incidental learning should increase 
with increased exposure duration (Kausler & 
Trapp, 1961; Rosenberg, 1959), 
Method.—Eighty seminarians were ran- 
domly assigned to the 16 groups of a 2X2 x4 
factorial design, representing exposure dura- 
tion during S-R learning (Ty), exposure dura- 
tion during R-S recall (Tr), and AV. Each 
S had i ight-item list with 


six trials on an ej 
nonsense syllables as stimuli and the same 
tz and Underwood as 


words employed by Jan 
responses. Four sets of eight syllables were 
selected from the 0, 33, 67, and 100% Glaze 


lists, with the additional criterion that the 


R 


gs were used to control for associative 


effects. Both Ty and Tr were either 2 sec. or 
4sec. In other respects the experiment repli- 
cated that of Jantz and Underwood (1958), 

—As found in the earlier study, 
total syllables Correct during the recall trial 
displayed significant heterogeneity of vari- 


1 This study is based on a thesis subi 
first author to the ciate Seal, St. ate cee 
ent for aster of Arts de; 
Portions of this paper were presented 196l 
meeting of the Midwestens Psychological A a ation, 


94 


ance between groups. The total letters cor- 
rect, a criterion used by Feldman and Under- 
wood (1957), indicated homogeneity of 
variance. Consequently, the latter measure 
served in the analysis of variance of treatment 
effects reported here. However, comparable 
results were obtained throughout for the 
heterogeneous total syllable scores. 

The overall means and SDs for AV were 
10.30 and 4.77, 14.70 and 5.29, 16.60 and 4,16, 
and 19.40 and 3.82 for 0, 33, 67, and 100%, 
respectively. A trend analysis for these 
means revealed a highly significant linear 
trend (F = 22.34, P< .001). The overall 
means and SDs for Ty were 12.52 and 5.17 
for 2 sec. and 18.20 and 3.92 for 4 sec, The 
corresponding Tr statistics were 13.90 and 
4.54 for 2 sec. and 16.83 and 4.55 for 4 sec. 
The main effects for AV, Tr, and Tr were all 
significant (F = 10.85, df = 3/64, P < .001; 
F = 26.98, df = 1/64, P < .001; F = 7.14, 
df = 1/64, P < .01). Of the interactions, 
only the Ty X Tr approached significance 
(F = 2.85, df = 1/64, P < .10). This trend 
suggested that Tr had little effect on the 
2-sec. Tr groups but did on the 4-sec. Tr 
groups. That is, the 4-sec, recall condition 
was most effective in combination with the 
4-sec. S-R learning condition. 

The present results appear to verify the 
assumption made by Jantz and Underwood 
(1958) that their reversal for 0% and 33% 
represented a sampling artifact. The results 
also indicate that R-S learning is sensitive 
to exposure duration during both the learning 
and recall periods, 


REFERENCES 


: „A re-evaluation of the meaningfulness 
of all possible CVC trigrama, Psychol, eas 


tion, pie Rock, Arkansas, Ap: 
OSTMAN, L., ADAMS, P, A., & Purkis, L, W, 
in incidental learning: IT. The <i era loe 
Ta ano at Phe method of testing, J, exp. Psychol., 
a y IU. 
ROSENBERG, S. Exposure int A 
ing. Psychol. Rep., 1959, soe, ee ae 


(Received May 20, 1961) 


os hac 


SUPPLEMENTARY REPORT: EFFECTS OF INSTRUCTIONS ON EXTINCTION 
AND RECOVERY OF A CONDITIONED AVOIDANCE RESPONSE 


K. E. MOYER 
Carnegie Institute of Technology 


Lindley and Moyer (1961) found that, in 
the case of a classically conditioned finger 
withdrawal response, informing Ss just prior 
to extinction training that the UCS would no 
longer be delivered produced rapid extine- 
tion. The present study replicated that 
study, but with a conditioned avoidance 
procedure, and in addition tested an implica- 
tion of the drive level (D) interpretation of 
their result; viz., that the appropriate instruc- 
tions could raise D level and lead to an in- 
crease in conditioned responding. 

Metkod—The same experimental 
apparatus, intertrial stimulus intervals, 
of adjusting shock level, and pretests with 
the tone alone were used (Lindley & Moyer, 
1961). The CS was a _5-sec. tone which pre- 
ceded a 1.5-sec. electric shock without overlap. 
The finger could only move up when the CS 
and UCS were presented. No shock was 
delivered if S responded to the CS and kept 
his finger raised $ in. or more during the UCS 
interval. If S did not avoid the shock, 
moving the finger up $ in. escaped the shock. 

The booth in which S sat was dark. The 
initial instructions were similar to the previous 
instructions except that the avoidance rather 
than the classical CR procedure was explained. 
The S was told that we wanted to condition 


room, 


ate i A, ; 
4 A factorial design involving number of 


acquisition conditioning trials (to four CRs 
in $ consecutive trials, or that plus 20 trials), 
instructions prior to extinction (Neutral 
or Inhibitory), and instructions after 20 
extinction trials (No Instructions or Resume 
UCS) was used. The Neutral instructions 
prior to extinction were: “Be sure to let your 
finger jump up to the tone when it feels like 
it”; the Inhibitory instructions were: “There 
will be no more shock presented from now on. 
I want you to try to prevent your finger from 
moving when the tone is presented.” After 
90 extinction trials half the Ss were given an 
additional 5 extinction trials without any 
instructions (No Instructions) ; the remaining 
Ss were told: “From now on the tone will be 
followed by shock on most of the trials. 
Remember to let your finger jump up to the 


1 We would like to thank James H, Korn for collecting 
the data for this study. 


ANo 


RICHARD H. LINDLEY ' 
Trinity Umiversity, Teses 


tone if it feels like it” (Rseume UCS). The 
extinction ure was coatinued for $ 
trials (i.e, no shock was delivered). The 
normal intertrial interval was used between 
Trials 20 and 21 for the No Instructions _ 
groups; this interval was longer for the Re- 
sume UCS groups owing to the reading of 
the instructions. 

The $6 Ss (both men and women) from 
Carnegie Institute of Technology who reached 
criterion within 35 trials were assigned to 
the eight groups (N per group = 7) by a 
randomized block procedure, Twenty-seven 
Ss were rejected owing to failure to reach 
criterion within 35 trials, apparatus failure, 
or unwillingness to be shocked. 

Results—The mean number of trials to 
reach the criterion of four CRs in five con- 
secutive trials was 10.98, SD = 7.16. There 
were no significant differences among the 
groups in reaching this criterion (F = 0.30). 

In extinction, a CR was defined as any 
response that occurred during the .5-sec. 


cxTMCTION 


a 


eo 
= ~ 


Lis CRTERION 4/5 CRS 
WHea CRS 4+ 2O TRIALS 
112 smanoRY OXTNGTON 


Ni s NEUTRAL EXTINCTION 
INSTRUCTIONS 


TRIALS 


Fic, 1. Percentage of CRs for blocks of five trials 
during Seen ead recois training and p es 
cel ol on the t posteriterial trial for 
4/5 te 20 trial groups. 


96 K. E. MOYER AND RICHARD H. LINDLEY 


duration of the CS. The variances of the 
number of CRs in extinction were highly 
heterogeneous (x?=62.53, df=3, P<.001). 
Owing to the heterogeneity of variance, the 
data of the groups that received neutral 
instructions and of the groups that received 
inhibitory instructions were analyzed sepa- 
rately and the x? median test was used at 
times instead of the more usual ¢ or F tests. 

A? median test showed that the inhibitory 
instructions significantly reduced the number 
of CRs elicited in the 20 extinction trials 
as compared to the neutral instructions 
(x? = 23.17, df =1, P <.001). Figure 1 
shows the extinction and recovery data for 
all groups and also the percentage of CRs on 
the first trial after Ss reached the $ criterion 
in the two groups with the $ criterion plus 
20 trials. The groups with the additional 20 
conditioning trials gave more responses in 
extinction than the groups that merely met 
the $ criterion (for the two groups with Neu- 
tral instructions, ¢ = 1.27, df = 26, P > .05; 
for the groups with Inhibitory instructions, 
t = 218, df = 26, P < .05). 

The instructions designed to raise D level 
after 20 extinction trials had a significant 
effect on the groups that received Inhibitory 
instructions in extinction (median test 


i 
x? = 8.58, df= 1, P <.01). The same in- 
structions had no demonstrable effect on the 
groups that received Neutral instructions 
in extinction (x? = 1.29, df = 1, P >05). 

As in classical conditioning, inhibitory 
instructions dramatically reduced the number 
of CRs during extinction after avoidance 
conditioning, especially after the larger 
number of conditioning trials. This fits in 
with the notion that a habit factor is built 
up through reinforcement and is independent 
of a performance or drive factor which can be 
manipulated by instructions. The results 
also indicate that, for Ss who have been given 
inhibitory instructions, the appropriate in- 
structions can increase the number of CRs 
after extinction, presumably by raising D 
level. The latter result was not found for Ss 
who have been given neutral instructions in 
extinction, perhaps due to the already high 
level of responding of these groups after the 
20 extinction trials, 


REFERENCE 
LINDLEY, R. H., & Mover, K. E. Effects of instruc- 
tions on the extinction of a conditioned finger-with- 
drawal response. J. exp. Psychol., 1961, 61, 82-88. 


(Received May 18, 1961) 


4 


Journal of 


Experimental Psychology 


VoL. 64, No. 2 AucGust 1962 


SECONDARY REINFORCEMENT IN RATS AS A FUNCTION 
OF INFORMATION VALUE AND RELIABILITY 
OF THE STIMULUS! 


M. DAVID EGGER axp NEAL E. MILLER 


Yale University 


Although secondary reinforcement A possible situation in which to test 
has been of major importance to this hypothesis is the following: a 
behavior theory, especially in explana- short stimulus always precedes the 
tions of complex learning phenomena delivery of food. But it is made 
(e.g., Hull, 1943; Miller, 1951; Skin- essentially redundant by being over- 
ner, 1938), little is known about the lapped by a longer stimulus of slightly 
conditions for its occurrence in any earlier onset which is also invariably 
but the simplest situations. The followed by food. This situation is 
first hypothesis explored in the experi- summarized in Fig. 1. The longer 
ments reported here is that inasitua- stimulus is labeled Sı and the 
tion in which there is more than one shorter, Sx. For an S trained with 
stimulus predicting primary reinforce- this series of stimulus events, Sa is 
ment, e.g., food, the more informative a reliable, but redundant, i.e., non- 
stimulus will be the more effective informative, predictor of food. Hence, 
secondary reinforcer. Further it is according to our hypothesis, Sı should 
asserted that a necessary condition be an effective secondary reinforcer ; 
for establishing any stimulus as a Są should acquire little or no secondary 
secondary reinforcer is that the stimu- reinforcing strength, even though it 
lus provide information about the is closer in time to the occurrence of 
occurrence of primary reinforcement; food, and therefore in a more favor- 
a redundant predictor of primary able position than is S, on the gradient 
reinforcement should not acquire sec- of delay of reinforcement. 


ondary reinforcement strength. There is a way, however, to make 
Fe ee Sz informative. If Sı occurs a number 
his study was supported by funds trom n > S s 
Grant MY647 from the National Institute i Pat Bele arcon A 
i 


of Mental Health, United States Public. 1 
Health Service. We wish to thank Elizabeth interspersed with occurrences of the 


Sherwood for her assistance in running the stimulus sequence shown at the bot- 
animals. _ et tom of Fig. 1, then Sz, when it occurs, 
IA pin ot the data repre pape? no longer redundant; for now $e 

i is the only reliable predictor of food. 


tial Address to the American Psychological on y 
Association. Thus, it is predicted that for a group 


97 


So 
ir) 


z 


o LEARNABLE DRIVE 
o LEARNABLE DRIVE I 


Teee 
o eH 
Foon —— M 


S2 REDUNDANT 


Fic. 1. Schematic representation of the 

_ theoretical analysis of the two Main Experi- 

ment groups according to a strict interpreta- 
tion of the drive-reduction hypothesis, 


| 


Foop ——__J]__ 


S2 INFORMATIVE 


of rats who receive the stimulus se- 
quence depicted in Fig. 1 interspersed 
with occurrences of Sı alone, S; will 
bea considerably more effective sec- 
ondary reinforcer than for the group 
of rats who receive only the stimulus 
Sequence depicted in Fig. 1, 

It should be noted that both groups 
will receive exactly the same number 
of pairings of S: with food and in 
exactly the same immediate stimulus 
context, so that if a difference were 
found between the groups in the 
secondary reinforcing value of S, it 
could not be due to simple patterning, 
stimulus-generalization decrement, or 
differences jn association with food, 

Our predicted results would be 
compatible with a strict interpretation 
of the drive-reduction hypothesis of 
reinforcement (Miller, 1959), Such 
a theoretical analysis is represented 
schematically in the upper portion of 
Fig. 1. i to the drive- 


drive-reducing response. The left 
side of Fig. 1 illustrates that if most 
of the learnable drive already has been 
reduced by S;, little drive-reduction 
remains to be conditioned to S:. 


M. DAVID EGGER AND NEAL E. MILLER 


On the other hand, if S, sometimes 
fails to predict food, some of the condi- 
tioned drive-reduction to it should 
extinguish. Hence, as is depicted on 
the right side of Fig. 1, more of the 
drive-reduction should occur to, and 
be conditioned to, So. 

From Fig. 1, one can also see that 
the drive-reduction analysis also de- 
mands that the secondary reinforcing 
value of S; should be greater when 
it is a reliable predictor (making S: 
redundant) than when it is an unre- 
liable predictor (making S; informa- 
tive). Thus we are led to our second 
hypothesis, namely, that in a situa- 
tion in which a predictor of primary 
reinforcement exists which is both 
reliable and informative, this predic- 
tor should become a more effective 
secondary reinforcer than an unreli- 
able predictor. Note that here we 
predict the opposite of a partial-rein- 
forcement effect, which would be 
expected to increase the resistance 
to extinction of the unreliable pre- 
dictor, that is, the stimulus which had 
been paired with food only part of 
the time. In any prolonged test 
for secondary reinforcement, this in- 
creased resistance to extinction should 
show up as a greater total secondary- 
reinforcing effect, 


MAIN EXPERIMENT 
Method 


Subjects.—The Ss were 88 male rats of the 
Sprague-Dawley strain who i 


equipment failures, the data from 4 Ss were 
lost, and the data from another 4, selected at 
random, i i 


SECONDARY REINFORCEMENT IN RATS 99 


diameter rods running parallel to the side 
containing the Plexiglas door. Each box was 
enclosed in a large, light-proof, sound-dead- 
ened crate into which a stream of air was 
piped for ventilation and masking noise. 
Inside each of the Skinner boxes were two 
lights, one located 2 in. above the food cup, 
another located in the middle of the long back 
wall, opposite the Plexiglas door. The food 
cup was in the center of the front, 8-in. wall; 
the bar, a bent steel strip 1} in. wide, pro- 
truded } in. into the inner chamber of the 
box. The entire bar assembly was removable 
and, when withdrawn, its opening was sealed 
with a metal panel. The bar was located to 
the right of and slightly above the food cup. 
A downward force of at least 12 gm. on the 
bar activated a microswitch normally con- 
nected in the circuit of a Gerbrands feeder 
which delivered a standard .045-gm. Noyes 
pellet into the food cup. A loudspeaker was 
located 3 in. behind and slightly to the left 
of the front wall of the Skinner box. Both 
flashing lights (12 per sec.) and tones (750 
cps) were used as stimuli. 

Procedure-—All training sessions lasted 
25 min. per day. During the first three 
sessions, Ss were magazine-trained in the 
absence of the bar. Then the bar was in- 
serted, and, for two sessions, each bar press 
was followed by a pellet of food. A few rats 
who did not spontaneously learn to press were 
given an extra remedial session during which 
bar pressing was “shaped.” Over the next 
four sessions the required ratio of responses 
to reinforcements was gradually increased 
to 4:1. 

Then, for the subsequent five sessions, the 
bar was removed, and Ss were randomly 
assigned to Group A (for whom Sa was 
reliable but redundant) and Group B (for 
whom Sə was reliable and informative). 
Group A received the following sequence of 
events during each of its five “stimulus- 
training” sessions: once every 56 sec. on the 
average, a pellet of food was delivered into 
the food cup. The pellet was inevitably 
preceded by 2 sec. of Sı and 1} sec. of Ss. 
Both stimuli overlapped the delivery of the 
food pellet by } sec., and both terminated 
together. 

Group B also received this stimulus se- 
quence immediately preceding the delivery 
of the food pellet. But in addition, Group B 
Ss received aperiodically, interspersed with 
the stimulus-food sequence, 2 sec. of S, alone. 
The events for Group B occurred on the 


average of once every 30 sec. 
For half the Ss in each group, Sı was a 


flashing light and Sz was a tone, and for the 
other half, the conditions were reversed: S, 
was a tone and S+ was a flashing light. 

During 5 days of such training, each 
group received 135 pairings of S, and S; with 
food, and Group B received in addition about 
110 occurrences of S, alone. Thus for both 
groups S+ was followed 100% of the time by 
food, while S; was followed by food 100% of 
the time for Group A, but only 55% of the 
time for Group B. 

The above description of training applies 
to all but 16 Ss, 8 Group B and 8 Group A. 
For these Ss, training was exactly as described 
above except that the stimulus-food pairings 
occurred for both groups on the average 
of once every 75 sec. instead of 56 sec., and 
Group B received a stimulus event on the 
average of once every 15 sec. instead of 30 sec., 
so that S, was followed by food only 20% 
of the time for Group B. ‘These 16 Ss were 
given seven 25-min. “stimulus-training”’ 
sessions. The data from these Ss were 
analyzed separately and not included in the 
overall analysis of variance. 

Testing —On the day following the final 
stimulus-training session, Ss were tested as _ 
follows: the bar was reinserted and Test 
Session 1 began with each S pressing for food 
pellets on a fixed ratio of 3:1. The retraining 
presses continued until S had received 30 
pellets. At this point the bar was discon- 
nected and 10 min. of extinction ensued. 

At the end of the 10 min., the bar was 
reconnected, not to the feeder, but to a timer 
which delivered on the same 3:1 schedule 1 
sec, of whatever stimulus was being tested 
for secondary reinforcing strength. The test 
session continued until 25 min. had elapsed 
since the beginning of the extinction period, 
or until 10 min. after the first occurrence of a 
stimulus, whichever was longer. 

In the foregoing procedure, relearning 
following experimental extinction was used 
as the measure of secondary reinforcing 
strength on the assumption that it would be 
more rapid and less variable than would de 
novo learning of the skill of pressing the bar. 
A preliminary study had validated this 
technique showing that in such a test more 
bar presses would occur when followed by a 
stimulus previously associated with food than 
when the stimulus had not been associated 
with food. 

After an interval of 1 day, Test Session 2 
was conducted, identical to the first, except 
that this time the stimulus delivered following 
the 10-min. extinction period was the opposite 
from that tested in Test Session 1; for half 


100 M. DAVID EGGER AND NEAL E. MILLER 


of the Ss, Sz was tested in Test Session 1 and 
Sı was tested in Session 2; for the other half 
of the Ss, trained and tested subsequent to 
the first half, the stimuli were tested in the 
opposite order. 

For Ss tested first with Ss and then with 
Si, Test Session 3 followed another inter- 
vening day, this time with Ss pressing for 
S2 again. Throughout the course of the 10- 
min. extinction and ensuing “pressing for 
stimuli” period, the cumulative total number 
of bar presses for each S was recorded each 
minute, 

Response measures—The total number of 
bar presses in a 10-min. period following the 
first occurrence of the stimulus was the meas- 
ure of secondary reinforcing strength. Since 
there were significant between-S and within- 
S correlations (ro = .53; ry = -34) of this 
measure with the total number of bar presses 
in the 10-min. extinction periods, this total 
number of bar presses in extinction was used 
as a control variable in analyses of covariance. 
(It should be noted that most of the bar 
Presses during extinction occurred within 
the first 2-4 min. of the 10-min. extinction 
Period.) 

Furthermore, since it was found that in no 
case would analyses based only on data from 
Test Session 1 have led to any substantially 
different conclusions from those reported 
below, the means and results of analyses 
reported (unless otherwise noted) are based 
on combined data from Test Sessions 1 and 2, 


Results 


Overall analysis—Neither of the 
hypotheses being tested depended 


TABLE 1 


MEAN Responses DURING 10 Mtn, of 
EXTINCTION AND 10 MIN. oF 
“PRESSING FOR Stimucr" 


a uu m 


S: S: 


DFTN E a Si +S: 
Ext, Pressing} Ext, 
A | 11081 115.1 | 101.9 rr 
112.1| 761 ae | 2% 
95.6 


Group 


Note.—Test Sessions 1 and 2 combined, 


upon the significance of the main 
effects of the overall analysis, but 
instead upon comparisons between the 
means shown in specific subcells of 
Table 1. The marginal entries in 
Table 1 give the overall means for 
Groups A and B (rows), and for Sı 
and Sz (columns). The overall mean 
for each group is based on data from 
32 Ss each tested with S, and with 
Se; the overall mean for each stimulus 
position is based on data from all 
64 Ss. As seen from an inspection 
of Tables 1 and 2, Group A responded 
significantly more than Group B and 
the position of S; was reliably more 
effective than that of So. 

It should be noted that although 
the groups were identically treated 
in all other respects, the 32 Ss tested 
with S, first and S, second were run 
subsequent to the 32 Ss tested with 
Ss first and with Sı second. No 
significant differences between these 
groups existed in the control variable, 
total presses in 10 min, of extinction. 
Nor did an analysis of covariance 
reveal any significant effects of order 
of testing (O), or of the interaction 
of order of testing with experimental 
group (G), or with stimulus position 
(P) (see Table 2). 

Across all groups, the Ss responded 
more for the flashing lights than for the 
tones (F = 8.45; df = USS P < 01, 
analysis of covariance), 

Examination of the minute-by- 
minute response totals during the 
“pressing for stimuli” period revealed 
that the differences between groups 
tested at 10 min. had generally begun 
to appear after 3-5 min., and con- 
tinued to increase out to 15 min., 
which was the longest period any S 
was permitted to bar press for stimuli 
during a given test Session. 

As expected from our hypotheses, 
the (P x G) interaction was highly 
significant (F = 17.71; df = 1/55; 


SECONDARY REINFORCEMENT IN RATS 101 


TABLE 2 


SUMMARY OF ANALYSIS OF VARIANCE AND COVARIANCE: TEST Sessions 1 AND 2 COMBINED 


Analysis of Variance Analysis of Covariance 
Source 
df MS F af MS F 
Between Ss 

Experimental Group (G) |1 3,916.12 2.36 1 6,062.17 4.98* 
Modality of Sı (M) 1 7,938.00 4.79* 1 4,792.37 3.93 
Order of S:,S2 (O) 1 435.13 1 709.44 
GXM 1 2,907.03 1.75 1 573.36 
GXO 1 2,329.03 1.41 1 50.93 
M XO 1 9.0. 1 37.41 
GXMXO 1 1,624.50 1 1,382.64 1.14 

Error (b) 56 1,657.19 55 1,218.04 (ry = .53) 

Within Ss 

Stimulus Position (P) 1| 14,663.28 10.83** 1| 12,168.39 9.97%* 
PXG 1| 24,864.50 18.36*** 1 | 21,613.81 yE beza 
PXM=St 1| 15,664.50 11.56** 1] 10,316.32 8.45** 
PXO=T 1| 52,650.13 38.87*** 1 1,594.90 1.31 
PXGXM 1 87.78 1 546.78 
PXMXO 1 35.13 1 16.45 
PXGXO 1 5,330.28 3.94 1 3,168.95 2.60 
PXGXMXO 1 5,781.27 4.27* 1 6,720.79 5.51* 

Error (w) 56 1,354.57 55 1,220.64 (rw = .34) 


Note.—St = modality of stimulus tested; T = test session (1 or 2). 


*P <.05. 


P <.001, analysis of covariance). 
Hence, we were justified in making 
within experimental group and stim- 
ulus position comparisons. 

S:: Group B vs. Group A.—On the 
basis of our first hypothesis, we ex- 
pected that Group B Ss, for whom S: 
was informative, should press more 
for S than Group A Ss, for whom Sz 
was redundant. The difference be- 
tween the group means on the sec- 
ondary reinforcing measure was in 
the predicted direction and significant 
beyond the .05 level (F = 4.03; 
df = 1/56). (The means are given 
in Table 1.) However, the effect was 
not statistically reliable in an analysis 
of covariance. 

As mentioned above, 16 Ss, 8 each 
in Groups A and B, were trained with 
the number of occurrences of Sı alone 
for Group B increased so that 80% 
of the stimulus events for Group B 


were unaccompanied occurrences of 
Sı. For these Ss, tested with Szin Test 
Session 1, the means on the secondary 
reinforcing measure were in the pre- 
dicted direction, 97.5 vs. 88.0, but the 
difference was short of statistical 
significance. However, when these 
data were analyzed in an analysis of 
covariance and combined by means 
of a critical ratio test with the data 
discussed above, the predicted effect 
was significant beyond the .05 level. 
(CR = 1.97 if the data from these 
16 Ss are combined with those from 
the 64 Ss tested with S in Test 
Session 1 or Test Session 2; CR=2.02 
if the data are combined with those 
from the 32 Ss tested with Sz in Test 
Session 1 only.) 

S;: Group A vs. Group B—Our 
second hypothesis predicted that Sı 
would be a more effective secondary 
reinforcer for Group A, for whom it 


102 M. DAVID EGGER AND NEAL E. MILLER 


was reliable and informative, than 
for Group B, for whom it was un- 
reliable. This prediction was borne 
out by the data beyond the .001 level 
(F =P; df = 1/55; analysis of 
covariasice), 

Group Ai S, vs. S,.—As predicted 
from our first hypothesis, S, was a 
much more effective secondary rein- 
forcer than S: for Group A. The 
difference between the means for 
these two stimulus positions, 115.1 
vs. 65.8, was significant beyond the 
001 level (F = 26.35; df = 1/27; 
analysis of covariance). 


CONTROL EXPERIMENTS 


Pseudoconditioned and unconditioned con- 
trol—Fourteen Ss, male albino rats, handled 
exactly as in the Main Experiment, were 
trained in groups of 7 Ss each with stimulus 
Sequences identical to those of Groups A and 
B, except that the stimuli were never paired 
with the occurrence of food, which was de- 
livered at least 10 sec. after the occurrence 
of the stimuli. The two different patterns of 
stimuli used in training had no effect upon 
the pseudoconditioned rate of bar pressing. 
The mean for the 14 Ss with both test sessions 
combined was 64,3. These 14 Ss bar pressed 
for the stimuli significantly less in both Test 
Session 1 ( = 3.41; df = 28; P < -005) and 
Test Session 2 (t = 2.72; df = 28; P <02) 
than did the 16 Group A Ss bar pressing for 
the informative stimulus (S;) in each of the 
Main Experiment test sessions. Hence, in a 
group predicted to show a large secondary 
reinforcing effect, we did indeed find such an 
effect produced by our training procedure. 
, Eight Ss were exposed to the stimuli dur- 
ing training exactly as described above, 
except that the food pellets were eliminated 
entirely. The unconditioned rate of pressing 
for the stimuli was comparable to that of the 
pseudoconditioned group (M = 73,4), 

The mean for the total group of pseudo- 
conditioned and unconditioned Ss with both 
test sessions combined was 67.6, indicating 
that the secondary reinforcing value of the 
redundant stimulus for Group A of the Main 
Experiment (M = 65.8), once the uncondi- 
tioned rate of pressing for stimuli is taken into 
account, was small, if not Zero, as we pre- 
dicted from our first hypothesis. The 
estimates of the pseudoconditioned and un- 


conditioned scores may be somewhat high, 
however, since these Ss tended to have higher 
10-min. extinction scores than did Ss of the 
Main Experiment. 

Activation control.—To test whether the 
effects studied in the Main Experiment were 
related to secondary reinforcement or only 
to a possible activation effect of a stimulus 
formerly associated with food (Wyckoff, 
Sidowski, & Chambliss, 1958), 10 additional 
Ss were trained exactly as in the Main Experi- 
ment, 5 as in Group A and 5 as in Group B. 
However, during the testing of these Ss, the 
bar remained nonfunctional once it was 
disconnected from the feeder. Each S was 
tested at the same time as an identically 
trained S used in the Main Experiment. 
The yoked Activation Control S received only 
the stimuli earned by his Main Experiment 
partner. If the Main Experiment 5 pressed 
for a stimulus within 7} sec. of a yoked Activa- 
tion Control S's response, the stimulus for 
the Activation Control S was delayed so that 
it was not delivered until 7} sec. after his 
response. Hence spurious pairings of stimuli 
and pressing could not occur. 

Thus, for these 10 Ss, any pressing which 
occurred during the retraining test period 
could have been due only to the activation 
effects of the stimuli plus remaining operant 
level; the possibility of secondary reinforce- 
ment was eliminated. 

In Test Session 1, all 10 of the Activation 
Control Ss pressed less than did their second- 
ary-reinforced partners (P < -002, binomial 
test, two-tailed). In Test Session 2, 9 out of 
10 pressed less than did their yoked partners 
(P < .02, binomial test, two-tailed). Hence, 
we are quite certain that in the Main Ex- 
periment we were indeed studying secondary 
reinforcement. 

Partial reinforcement effect control.—In the 
Main Experiment we had found that in the 
Presence of a reliable predictor (S2), training 
with partial reinforcement of Sı produced 
less total pressing for S; as a secondary rein- 
forcer than did 100% reinforcement. This 
confirmed our hypothesis but was opposite 
to the effect of increased resistance to extinc- 
tion usually found with Partial reinforcement. 
In order to see whether the presence of the 
reliable predictor was indeed the crucial 
factor, we ran two special control groups of 8 
Ss each, one with the usual partial reinforce- 
ment procedure and one with, 100% rein- 
forcement. These groups were identical in 
all respects to those of the Main Experiment, 
except that the reliable predictor, Sz, was 
omitted. When these groups were tested, 


—— 


; 


ee — 


SECONDARY REINFORCEMENT IN RATS 103 


the partial reinforcement group tended to 
press more for the stimuli than did the com- 
tinuous reinforcement group (though the 
difference between the group means, 128.6 
vs. 115.6, was not statistically significant). 
However, the diference between these two 
groups was in the opposite direction and 
significantly diferent (F = 5.71; df = 1/35; 
P < .025) from the difference found between 
Test Session 1 means of the 32 Ss of the Main 
Experiment tested with S, during Test 
Session 1, Thus it appears that the presence 
of Ss, the reliable predictor of food, did play 
the crucial role in determining the direction 
of the results obtained in our tests of the 
secondary reinforcing value of S:. 


Discussion 


Our situation differed from those in 
which the effect of partial reinforcement 
on the establishment of secondary rein- 
forcement has been studied (e.g., Klein, 
1959; Zimmerman, 1957, 1959) in that 
during training all our Ss had a reliable 
predictor of food. The seemingly crucial 
importance of the presence or absence 
of a reliable predictor during training 
may help to explain the apparently 
conflicting results obtained from single- 
group vs. separate-group experimental 
designs in determining the effects of 
partial reinforcement on the strength 
of a secondary reinforcer (e.g., D'Amato, 
Lachman, & Kivy, 1958). It may be 
that partial reinforcement will increase 
resistance to extinction of a secondary 
reinforcer only if training occurs in the 
absence of a reliable predictor. 

It should be noted that our formula- 
tion of the conditions necessary for the 
establishment of a secondary reinforcer 
is compatible with the well-known “dis- 
criminative stimulus hypothesis” of sec- 
ondary reinforcement (Keller & Schoen- 
feld, 1950; Schoenfeld, Antonitis, & 
Bersh, 1950). Furthermore, our results 
with respect to S+: Group B vs. Group 
Acould perhaps be considered analogous 
to those reported by Notterman (1951) 
in studies using rats as Ss in both a Skin- 
ner box and a straight alley. 


SUMMARY 


Albino rats (N = 88, male) were trained 
to press a bar for food, then divided randomly 


into two groups and trained as follows for 
135 trials in the same Skinner boxes with the 
bars removed: two stimuli, when paired, 
ended together and always preceded food. 
For Group A, the second, shorter stimulus 
(Sa) was always redundant because the first 
stimulus (S;) had already given reliable 
information that food was to come. But for 
Group, B, S: was informative, because for 
them Sı also occurred sometimes alone 
without food. 

After the training sessions, the bars were 
reinserted, bar pressing was retrained with 
food pellets, extinguished, and then retrained 
again, this time using 1 sec. of one of the 
training stimuli as a secondary reinforcer 
in place of the food. The total number of 
bar presses in 10 min. following the first 
occurrence of the reinforcing 
stimulus was used as the measure of secondary 
reinforcing strength. The testing procedure 
was repeated after 48 hr. using the other 
training stimulus as secondary reinforcer, 
so that all Ss were tested with both stimuli 
in a balanced sequence. 

Control experiments were run to provide 
baseline levels for pscudoconditioned and 
unconditioned rates of pressing, and for any 
activating effect of the stimuli. 

As predicted, S+ was a stronger secondary 
reinforcer when it was informative than when 
it was redundant; S; was a more effective 
secondary reinforcer than Szin that group for 
which S; was a redundant predictor of primary 
reinforcement. In addition, S; was a more 
effective secondary reinforcer when it had 
been a reliable predictor of food. 


REFERENCES 


D'Amato, M. R., Lacan, R., & Krvy, P. 
Secondary reinforcement as affected by 
reward schedule and the testing situation. 
A comp. physiol. Psychol., 1958, 51, 737- 
41. 

Hutt, C. L. Principles of behavior. New 
York: Appleton-Century, 1943. 

KELLER, F. S., & ScHoenreLD, W. N. 
Principles of psychology. New York: 
Appleton-Century-Crofts, 1950. 

KLEIN, R. M. Intermittent primary rein- 
forcement as a parameter of secondary 
reinforcement. J. exp. Psychol., 1959, 58, 
423-427. 

Muter, N. E. Learnable drives and re- 
wards. In S. S. Stevens (Ed.), Handbook 
of experimental psychology. New York: 
Wiley, 1951. Pp. 435-472. 

MLER, N. E. Liberalization of basic S-R 
concepts: Extensions to conflict behavior, 


104 M. DAVID EGGER AND NEAL E. MILLER 


motivation, and social learning. In S. 
Koch (Ed.), Psychology: A study of a 
science. Vol. 2. New York: McGraw- 
Hill, 1959. Pp. 196-292. 

MILLER, N. E. Analytical studies of drive 
and reward. Amer. Psychologist, 1961, 16, 
739-754, 

NOTTERMAN, J. M. A study of some relations 
among aperiodic reinforcement, discrimina- 
tion training, and secondary reinforcement. 
J. exp. Psychol., 1951, 41, 161-169. 

ScHoENFELD, W. N., Antonrtis, J. J, & 
BersH, P. J. A preliminary study of 
training conditions necessary for secondary 
reinforcement. J. exp. Psychol., 1950, 40, 
40-45. 


Skinner, B. F. The behavior of organisms. 
New York: Appleton-Century, 1938. 

Wycxorr, L. B., Sipowskr, J., & CHams.iss, 
D. J. An experimental study of the rela- 
tionship between secondary reinforcing 
and cue effects of a stimulus. J. comp. 
physiol. Psychol., 1958, 51, 103-109. 

ZIMMERMAN, D. W. Durable secondary 
reinforcement: Method and theory. Psy- 
chol. Rev., 1957, 64, 373-383. 

ZIMMERMAN, D. W. Sustained performance 
in rats based on secondary reinforcement. 
J. comp. physiol. Psychol., 1959, 52, 353- 
358. 


(Received July 10, 1961) 


2 


e 


Journal oj Experimental Psychology 
1962, Vol. 64, No. 2, 105-109 


INTERACTIONS AMONG THE SOMESTHETIC SENSES IN 
JUDGMENTS OF SUBJECTIVE MAGNITUDE ' 


F. NOWELL JONES, DAVID SINGER, AND PAUL A. TWELKER 


University of California, Los Angeles 


That perceptual blends occur among 
touch, warmth, and cold is well known 
(e.g., Boring, 1942). Interest in these 
perceptual phenomena has been en- 
hanced recently by the revival of the 
suggestion that somesthetic senses 
of the skin are in fact not separate, 
but depend upon the same receptor 
and afferent mechanisms (Sinclair, 
1955; Weddell, Palmer, & Pallie, 
1955). Neurophysiological investi- 
gations support the view that some 
overlapping of mechanisms does occur, 
especially between touch and cold 
(Hensel & Boman, 1960). It is 
also true, for example, that the abso- 
lute thresholds for vibration may be 
altered by cooling or warming the 
skin (Weitz, 1941), although the 
physiological basis of the alterations 
is not clear. 

Whether or not one can make judg- 
ments about one kind of stimulus 
dimension uncontaminated by the 
presence of a different kind of stimu- 
lus is another question. That the 
absolute threshold for a given mo- 
dality can be changed by altering 
the condition of the skin or by im- 
proved coupling of the stimulator to 
the skin is not the case in point. 
Prolonged cooling or warming of the 
skin may change the threshold to 
touch by altering either the sensitivity 
of the receptors or by changing the 
biophysical characteristics of the skin. 
Changing the pressure of an applied 
cold or warm stimulator may alter 


1 This research was supported by Contract 
DA-49-007-MD-1001 between The Surgeon 
General of the United States Army and the 
University of California. 


the temperature thresholds by chang- 
ing the conditions of coupling to the 
skin. Our concern is rather with the 
question of S’s ability to make judg- 
ments on the basis of information 
resulting from one variety of adequate 
stimulation in the presence of ade- 
quate stimulation of another kind. 
If he were able to make independent 
judgments, it would lend some sup- 
port to the position of those who 
regard touch, cold, and warmth to be 
essentially independent channels of 
information. 


METHOD 


Apparatus:—Since we proposed to present 
mechanical stimuli simultaneously with either 
cold or warmth, it was necessary to devise 
an apparatus which permitted controlled 
variation of these using the same stimulator 
for all. For this reason the apparatus finally 
devised used a temperature stimulator which 
could be either cooled or warmed, and which 
was attached to a device providing mechanical 
displacement for touch stimulation. The 
temperature stimulator is described in detail 
elsewhere (Jones, Twelker, & Singer, 1962), 
and consisted of a semiconductor thermo- 
couple junction which could be cooled or 
warmed by the application of a direct current 
of proper orientation. The time constant, 
approximately 2 sec., was satisfactory for 
our purposes, as was the visual control of 
stimulus temperature by means of an im- 
bedded thermocouple. The effective junction 
consisted of a copper disc 7.1 mm. in diameter 
which was used as the stimulus tip throughout 
the experiment. With this rather large 
stimulator it was not necessary to search 
for “temperature spots” to insure temperature 
stimulation. Mechanical displacement was 
achieved by means of a specially wound 
500-ohm loudspeaker motor? whose move- 


2 The loudspeaker motor was made by 
Stephens Tru-Sonic, Incorporated, Culver 
City, California. 


105 


106 F. N. JONES, D. SINGER, AND P. A, TWELKER 


ment was controlled by an imposed current. 
Precise control of the amount, rate, and form 
of displacement was provided by feedback 
from the stimulator itself. Observed on an 
oscilloscope, the displacement of the stimula- 
tor was a straight line, with very sharp bends 
at the beginning and end of movement. There 
was no evidence of overshoot or ringing at 
the termination of displacement. 

Procedure.—There were four experimental 
conditions: in Cond, P-W pressure was judged 
with concomitant warmth; in Cond, P-C 
Pressure was judged with concomitant cold; 
in Cond. W-P warmth was judged with con- 
comitant pressure; in Cond. C-P cold was 
judged with concomitant pressure. In 
each case, the primary stimulus dimension 
had five levels, the concomitant dimension 
had four. Pressure was defined as extent of 
stimulator movement, or depth of intrusion, 
this being the stimulus parameter previously 
selected as most useful (Jones, 1960). Tem- 
perature stimuli were defined as above or be- 
low skin temperature, which, needless to 
Say, made it necessary to measure the skin 
temperature for every S and adjust the stim- 
ulus series? The Ss differed from each other 
by as much as 2° C, For Cond, P-W and P-C 
the five pressure levels were 1, 2, 3, 4, and 5 
mm. of rectilinear stimulus movement at a 
rate of 2 mm. sec. The four concomitant 
temperature stimuli were 0, 2, 5, and 9° CÇ, 
above or below the measured skin tempera- 
ture, respectively, For Cond, W-P and CP, 
there were five temperature stimuli either 
above or below skin temperature by 0, 1, 3, 6, 
and 9° C, with four equally spaced concom- 
itant pressure stimuli from 1 to 5 mm. 

For any of the Pairs of stimulus dimension 
there were 5 X 4 possible combinations of 
stimulus conditions, Since ordinal effects 
were expected, a 20 X 20 analysis of variance 
table was constructed according to the sug- 
gestions of Williams (1949). The design 
permitted the slightly confounded appraisal 
of first-order ordinal effects. Each row of 
the table represented 1 S, so that there was 
a total of 20 Ss. The same Ss served under 
all four conditions, but at four Separate 
sittings. The orders of conditions were 
arranged as follows: for Ss 1, 5, etc., P-C, C-P, 
W-P, P-W; for Ss 2, 6, etc., W-P, P-W, P-C 
C-P; for Ss 3, 7, ete., P-W, W-P, C-P, P-C 
for Ss 4, 8, etc., C-P, P-C, P-W, W-P. 


the absolute temperature differential, the 
actual skin temperature must be taken into 
account and is the logical zero point on the 
stimulus scale, 


The stimuli were judged by the method of 
magnitude estimation (Stevens, 1957), since 
this had proved to be a useful method in 
previous work on pressure (Jones, 1960). 
In Cond. P-W and P-C, Ss were instructed 
to judge on the basis of pressure or touch, 
while in Cond. C-P and W-P they were 
instructed to judge on the basis of coldness 
and warmth, respectively. For pressure, 
a standard stimulus movement of 3 mm. was 
designated as 10. In the case of a tempera- 
ture a 3° C, deviation from skin temperature 
with a stimulus movement of 1 mm. was 
designated as 10. In every case the standard 
was presented three times at the beginning 
of the experimental session, and not after- 
wards repeated. 

Three special problems of stimulus control 
require comment, First, with temperature 
stimulation there is a tendency for the skin 
temperature to drift as the result of stimula- 
tion. This was avoided by returning the 
stimulator to the previously measured skin 
temperature after each stimulation. Second, 
the timing of combined pressure and tempera- 
ture stimuli is complicated by the shorter time 
constant of the skin for mechanical as com- 
pared to thermal stimuli. In the present 
experiment, the concomitant temperature 
change was initiated approximately 5 sec. 
before the mechanical stimulus, Third, the 
ambient temperature remained within 1° F. 
of 72° F, throughout the experiment. 

Subjects.—The Ss were drawn from one 
beginning and one advanced class in psychol- 
ogy. They were not experienced in psycho- 
physical judgment, but very few had any 
difficulty. A few Ss were eliminated because 
of failure to understand the instructions or 
because of difficulties with the apparatus. 


RESULTS 


After a logarithmic transformation, 
the data for each of the four condi- 
tions were analyzed for ordinal ef- 
fects according to the routine given 
by Cochrane and Cox (1957, pp. 
135-138). There was no significant 
first-order effect for any condition, 
hence no correction for order was 
made. Tables 1 and 2 give the analy- 
ses of variance for the four conditions. 
As might be expected (Jones et al., 
1961), there is a highly significant S 
effect, using a conservative error term 


a ee. O 


INTERACTIONS AMONG SOMESTHETIC SENSES 107 


(triple interaction), in every condi- 
tion, and the S X Primary Modality 
interactions are significant, indicating 
differences in slope. The primary 
effect of the dimension to be judged 
is highly significant in every case, 
again as might be expected. In no 
case is the effect of the concomitant 
stimulus modality significant. In 
only one case, Cond. C-P, is the inter- 
action between modalities significant. 
Inspection of the Cold X Pressure 
table indicates that this interaction 
results from a slight tendency for 
the intermediate but not extreme cold 
stimuli to be judged greater when 
accompanied by a greater pressure. 
An analysis was performed to dis- 
cover whether or not there was an 
overall effect of cold or warmth on 
judgments of pressure, since this could 
not be determined from the analysis 
of each condition separately. The 
result shows that there is no signifi- 
cant effect, pressures being judged 
the same whether accompanied by 
cold or warmth (F = .10, df = 1/19). 


DISCUSSION 
That the somesthetic senses somehow 


interact to provide perceptual patterns 


TABLE 1 


ANALYSIS OF VARIANCE OF MAGNITUDE EsTI- 
MATES OF PRESSURE MADE WITH 
CONCOMITANT WARMTH OR 


CoLD 
Concom. Concom- 
itant itant 
Warmth Cold 
Source df (wW) (C) 


Ss 19 | .48| 24.0%] .59 | 19.3" 
Warinth (or Cold) 3| .02| .66 04 | 1,3 
Pressure (P) 4 |3.63 | 45.5%" |4.56 91.2"* 
Ss XW (or C) 57| .03| 1.5* | .03| 1.0 
Ss XP 76| .08| 4.0**| .0S 1.6% 
P XW (or C) 12| 02| 1.0 | .04| 1.3 
P XW (orC) X Ss |228 02 03 

* P = ,05. 

** P = 01, 


TABLE 2 


ANALYSIS OF VARIANCE OF MAGNITUDE ESTI- 
MATES OF WARMTH AND COLD MADE 
WITH CONCOMITANT PRESSURE 


Estimates | Estimates 


Source 


Ss 
Pressure (P) 
Warmth (W) or Cold 


W (or C) XP 
Ss XW (or C) XP 


“pP <.01. 


such as “wet,” “oily,” and the like is a 
commonplace observation which was at 
one time, at least, the subject of careful 
study (cf. references in Boring, 1942, 
pp. 521-522). Also indicative of inter- 
action is the tendency for cold objects 
to be judged heavier than warm ones 
(Weber, 1905). From neurophysiological 
work, including the recording of action 
potentials from human nerve fibers 
(Hensel & Boman, 1960), we know that 
cold stimulation and pressure stimula- 
tion affect, in part, the same fibers, and 
we also know that there is some over- 
lapping of representation in the cerebral 
cortex (Landgren, 1957). Even without 
accepting in detail the arguments of 
Weddell et al. (1955) and Sinclair (1955), 
among others, that there is no real differ- 
entiation of the skin senses, we should 
still expect, on the basis of both the 
perceptual and neurophysiological litera- 
ture, some significant degree of interac- 
tion. We have found very little, if any. 
Pressure can be judged independently 
of concomitant temperature stimula- 
tion, and cold and warmth can be judged 
independently of concomitant pressure 
stimulation, with the possible exception 
of a small interaction of cold and pressure. 

Psychologically, that is, as far as the 
total system response is concerned, 
there is very little to say about the 
results beyond their demonstration and 
the suggestion that further work may 
refine the results presented here, es- 


108 


pecially in regard to the effects of chang- 
ing the temporal relationships among the 
stimuli. The neurophysiological impli- 
cations, however, require comment. In 
the first place, the possibility of inde- 
pendence of magnitude judgments re- 
quires that there be differentiation among 
receptors. As Brindley (1957) has 
pointed out for the visual system, the 
nervous system cannot create informa- 
tion out of an undifferentiated peripheral 
response, no matter how complex the 
subsequent transformations. Support is 
lent, therefore, to the position that sepa- 
rate receptors are involved in the re- 
sponse to different varieties of stimula- 
tion. In the second place, it appears 
that the judgments of magnitude are 
made on the basis of some steady state in 
the case of temperature, rather than 
upon the initial burst of impulses obtain- 
able not only from cold receptors but 
from presumed pressure receptors asso- 
ciated with large axones upon the 
application of cold (Hensel & Zotterman, 
1951). It is possible that different 
temporal relationships among the stimuli 
would lead to the discovery of interac- 
tions that our particular choice of timing 
has not revealed, but for the present such 
a suggestion is purely speculative. A 
third, and final point is that the over- 
lapping in cortical representation of the 
various skin modalities, even though 
involving convergence on the same cor- 
tical neuron in some instances, somehow 
does not interfere with modality-specific 
judgments. Furthermore, the facilita- 
tory effect of pressure stimulation sug- 
gested by Landgren (1957) has no 
psychological counterpart in our results, 
consideration of the latencies involved 
would tentatively suggest that the non- 
Specific cells found in the cortex are 
telated to the arousal system rather 
than to a specific sensory pathway, and 
would not be expected to be directly 
involved in judgments of magnitude. 

e may conclude, therefore, that our 
results are compatible with the existence 
of separable mechanisms of response to 
Pressure and thermal stimulation. Al- 
though we cannot argue for the accept- 


F. N. JONES, D. SINGER, AND P. A. TWELKER 


ance of any particular neurological 
mechanism as underlying our results, 
the ideas advanced here are parsimonious 
and in harmony with the weight of 
neurophysiological evidence. In any 
case, no matter what the underlying 
mechanism, we have shown that pres- 
sure, cold, and warmth can be responded 
to selectively in the presence of con- 
comitant stimulation. 


SUMMARY 


Twenty Ss gave magnitude estimates of 
Pressure stimuli in the presence of con- 
comitant cold or warm stimuli, and magni- 
tude estimates of cold and of warmth in the 
presence of concomitant pressure stimuli. 
It was found that judgments of magnitude can 
be made independently of concomitant 
stimulation in another modality. It was 
Suggested that this result is consistent with 
the assumption of separable neurological 
mechanisms for the skin senses under con- 
sideration. 


REFERENCES 


Borine, E. G. Sensation and perception in 
the history of experimental psychology. 
New York: Appleton-Century, 1942. 

BRINDLEY, G. S. Two theorems in colour 


vision. Quart. J. exp. Psychol, 1957, 9, 
101-104. 

Cocnran, W. G., & Cox, G. M. Experi- 
mental designs. (2nd ed.) New York: 
Wiley, 1957. 

HENSEL, H., & Boman, K. K. A. Afferent 


impulses in cutaneous sensory nerves in 
human subjects. J, Neurophysiol., 1960, 
23, 564-578. 

HENSEL, H., & ZOTTERMAN, Y. The response 
of mechanoreceptors to thermal stimula- 
tion. J. Physiol., 1951, 115, 16-24. 

Jones, F. N. Some subjective magnitude 
functions for touch. In G. R. Hawkes 
(Ed.), Symposium on cutaneous sensitivity. 
Ft. Knox, Ky: United States Army 
Medical Research Laboratory, 1960. (No. 
424) 

Jones, F. N., TwELKER, P, A., & SINGER, 


D. A waterless thermal somesthetic 
stimulator. Amer, J. Psychol., 1962, 
75, 147-149, 

LANDGREN, S, Convergence of tactile, 
thermal, and gustatory impulses on single 


cortical cells, Acta physiol. Scand., Stockh., 


1957, 40, 210-221, 


INTERACTIONS AMONG SOMESTHETIC SENSES 109 


Srxciarr, D. C. Cutaneous sensation and Wenz, J. Vibratory sensitivity as a function 
the doctrine of specific nerve energy. of skin temperature. J. exp. Psychol., 1941, 


Brain, 1955, 78, 584-614. 28, 21-36. 
Srevens, S. S. On the psychophysical law. Wryrrasms, E. J. Experimental designs 
Psychol Rev., 1957, 64, 155-181. Š balanced for the estimation of residual 
WEBER, E. H. Der Tastsinn und das Gemein- ff of A X 
ihl. Leipzig: Englemann, 1905. effects treatments. ust. J. scient. 
gefü p E : Res. Ser. A, 1949, 2, 149-168. 


WEDDELL, G., PALMER, E., & PaLLIE, W. 
Nerve endings in mammalian skin. Biol. 
Rev., 1955, 30, 159-195. (Received June 6, 1961) 


Journal of Experimental Psychology 
1962, Vol, 64, No. 2, 110-116 


POSTCONDITIONING DELAY AND INTENSITY OF SHOCK AS 
FACTORS IN THE MEASUREMENT OF 
ACQUIRED FEAR! 

WALLACE R. McALLISTER AND DOROTHY E. McALLISTER 


Syracuse University 


That the presentation of a stimulus 
previously paired with electric shock 
can serve as the basis for the learning 
of another response is a well-estab- 
lished finding. The most convincing 
evidence is provided by a two-stage 
experiment. First, a neutral CS is 
paired with inescapable electric shock 
(UCS). Then the formerly neutral 
stimulus is presented without the 
shock, and Ss are allowed to make 
another (indicator) response which 
leads to the termination of the CS. 
Examples of such experiments are 
those of Brown and Jacobs (1949) 
and Kalish (1954), A variation of 
these procedures was used in the 
original study of Miller (1948). Theo- 
retically, it is assumed that fear, as a 
response, is classically conditioned to 
the neutral CS in the first stage of the 
experiment. In the second stage, 
fear elicited by the CS serves as a 
motivator and its decrease, occurring 
when the CS is terminated, acts as a 
reinforcer for the learning of the 
indicator response, 

Despite the number of experiments 
which have Supported the theory, 
difficulty in obtaining evidence for 
the acquired drive of fear with the 
above procedures has been reported 
(Brown & Jacobs, 1949; Solomon & 


National Institute of Mental Health, Public 
Health Service. The authors are indebted 
to Ronald A. Housman and Joseph F. Legg 


Brush, 1956, p. 221). Two prelim- 
inary experiments conducted in this 
laboratory, in which hurdle jumping 
was used as the indicator response, 
likewise yielded negative results, As 
a first step in attempting to account 
for these results, a study designed to 
determine the optimal shock intensity 
for conditioning was begun. The 
initial results indicated that, regard- 
less of the shock level employed, the 
hurdle-jumping response was not 
learned. Subsequent investigation 
revealed, however, that learning be- 
came evident when additional trials 
were given on the following day. 
Therefore, the study was redesigned 
and conducted as reported in Exp. I. 
The purpose of Exp. II was to ex- 
plicate the finding of Exp. I that 
learning occurred only on the second 
day of training. 


EXPERIMENT I 


The purpose of this study was to 
investigate the effect on hurdle- 
jumping performance of the intensity 
of shock used during fear conditioning. 


Method 


Subjects—The Ss were 100 naive, female, 
hooded rats from the colony maintained by 
the Psychology Department at Syracuse 
University. Nine additional Ss were dis- 
carded, 8 for apparatus failure and 1 because 
of extreme difficulty in handling during hurdle 
jumping. The Ss were randomly paired and 
five pairs were then assigned at random to 
each of 10 groups. Their ages ranged from 
103 to 258 days at the start of the experiment. 
The distribution of ages was approximately 
the same for the groups, with means varying 
between 143 and 168 days. 


110 


MEASUREMENT OF ACQUIRED FEAR 


A pparatus.—Two shock boxes and a hurdle- 
jumping apparatus were used. The hurdle- 
jumping apparatus consisted of two boxes, 
03 in. long X 44 in. wide X 5 in. high (in- 
terior dimensions), separated by a j-in. 
partition containing a guillotine door, 24 in. 
wide X 3 in. high, which rested on a hurdle 
2 in. high. The grid box, painted white, had 
a floor made of y-in. brass welding rods 
spaced 1% in. apart. The safe box, painted 
gray, had a plywood floor constructed so as 
to serve asa floor switch. A .01-sec. Standard 
Electric timer was started with the opening 
of the guillotine door and was stopped by 
depression of the floor. 

The grid and safe boxes each had another 
box, 13} in. high, hinged to its top which 
contained light sources and acted as a cover. 
The bottom of each upper box was covered 
with hardware cloth, 1 in. above which was 
inserted a pane of opal glass. A 74-w. lamp 
located in each of the upper boxes and a 40-w. 
lamp lecated in the box above the grid 
provided the intertrial and CS illuminations. 

So that Ss could be run in pairs, two 
separate shock boxes, wired in parallel, were 
used for the fear-conditioning phase of the 
experiment. They were constructed to appear 
identical with the grid box of the hurdle- 
jumping apparatus. Hunter interval timers 
were used to control the presentations of the 
CS and the UCS; a Haydon timer controlled 
the duration of the intertrial interval. Ex- 
cept for the use of a variac, monitored by a 
voltmeter, to control the level of shock to the 
grids, the circuit employed was that described 
by Wyckoff and Page (1954). A 100,000-ohm 
‘resistor was in series with each S. The grids 
were energized successively at a rate of two 
impulses, of about 13-msec. duration each, 
per grid per sec. 

Design and procedure-—The procedures 
for each S required 4 days. The first 2 days 
were devoted to handling and to exploration 
of the hurdle-jumping apparatus, On each 
of these days each S was handled for two 10- 
min. sessions and was allowed to explore each 
side of the hurdle-jumping apparatus for 10 
min. with the guillotine door closed. During 
handling, Æ was seated in front of a table, 
34% X 174 in., enclosed on three sides with 
curtains. Handling consisted of alternately 
picking up and stroking S and placing her 
on the table to explore. The sequence of 
treatment for 1 S of the pair was handling, 
exploration of the grid box, handling, and 
exploration of the safe box, for 10 min. each. 
Since Ss in a pair were treated concurrently, 
the sequence for the other member began 
with exploration of the grid box. During 


111 


the first 5 min. of handling on the first day 
and the last 5 min. on the second day, 5 
remained on the table and was not touched 
so that an activity measure could be obtained. 
These data will not be discussed in this paper, 

On the third day, Ss in each of five groups 
were given 35 forward-conditioning (FC) 
trials. The CS was an increase in illumina- 
tion of 6-sec. duration from 7 to 115 ft-c, 
measured with a Weston illumination meter, 
Model 756. The UCS was a shock of 2-sec. 
duration which was presented 4 sec. after 
the onset of the CS. The intensity of shock 
delivered to the groups was either 30, 40, 50, 
60, or 100 v. Five other groups were given 
35 backward-conditioning (BC) trials, one 
group at each shock level. For these groups 
15 sec. intervened between the offset of the 
shock and the onset of the CS. For BC Ss, 
trials were started 10 sec. after placement in 
the shock boxes. To equalize the amount 
of time spent in them, FC Ss were placed 
in the boxes 605 sec. before their first trial. 
A 2-min. intertrial interval was used for all 
groups. 

Ten seconds after the last conditioning 
trial, Ss were removed to separate holding 
boxes. Approximately 2 min. later 25 hurdle- 
jumping trials were begun. On each trial 
S was placed in the grid box facing the guillo- 
tine door which was raised after 10 sec. The 
CS was presented simultaneously with the 
raising of the door and was terminated by 
depression of the floor switch when S crossed 
the hurdle into the safe box. No shock was 
administered during this phase of the experi- 
ment. When S jumped, the door was closed, 
and after 10 sec. S was returned to the holding 
box. If no jump occurred within 60 sec., S 
was removed to the holding box and a latency 
of 60 sec. recorded. The Ss were run alter- 
nately with a minimum intertrial interval 
of 30 sec. After the trials, Ss were weighed. 

On the fourth day, 25 additional hurdle- 
jumping trials were given. On this day, if S 
failed to jump within 60 sec. on 10 consecutive 
trials, training was terminated and 60 sec. 
was recorded for each of the remaining trials, 
Throughout this paper, the 2 hurdle-jumping 
days will be referred to as Day 1 and Day 2. 

Food and water were available at all times 
in the home cages. 


Results 


In Fig. 1 the means of the recipro- 
cals of latency of hurdle jumping in 
five-trial blocks are plotted for each 
group. No evidence of learning ap- 


112 


-70 
.60) 


*—* 30 VOLTS 
¢---e 40 VOLTS 
4—A50 VOLTS 
4-260 VOLTS 
o—o 100 VOLTS 


MEAN RECIPROCAL 


Fie. 1, 


FORWARD CONDITIONING 


WALLACE R. McALLISTER AND DOROTHY E. MCALLISTER 


5 6 7 8 9 10 
BLOCKS OF FIVE TRIALS 


Mean reciprocal of latency of hurdle jumping as a function of blocks of five trials 


following forward and backward conditioning at each shock level. 


Pears until Day 2 of hurdle jumping 
when, in general, the performances of 
those groups conditioned with the 
higher levels of shock improve, Ex- 
cept for the unusual performances of 
both 50-v. groups, for which no satis- 
factory explanation can be offered, 
there is an increasing monotonic 
relationship between mean perform- 
ance on Day 2 and intensity of shock 
for both FC and BC groups. The BC 
groups (except the one with 50 v.) 
performed as well as their FC counter- 
parts. 

Because the assumption of homo- 
geneity of variance was untenable, a 
trend analysis of variance was not 
used. To evaluate the changes in 
performance over the hurdle-jumping 
trials, a £ test for related measures 
was computed for each group using 
the means of reciprocals of latency 


on Trial Blocks 1 and 10. Using a 
5% coefficient of risk, adopted for all 
Statistical tests reported in this paper, 
the 50- and 60-v. FC groups showed 
a significant gain in performance, 
while that for the 100-v. FC group ap- 
proached significance (¢ = 2.86, 3.18, 
and 1.93, df = 9, respectively), Of 
the BC groups, only the 100-v. group 
showed a significant gain (¢ = 2.80, 
df = 9), 

To determine the effects of intensity 
of shock and type of conditioning, the 
performance measures on Trial Block 
10 were transformed, because of the 
failure to meet the assumption of 
homogeneity of variance, and then 
subjected to a factorial analysis of 
variance. The measures for this 
block of trials were ranked over all 
groups and then converted to mean 
deviations using an extension of 


MEASUREMENT OF ACQUIRED FEAR 


Table XX from Fisher and Yates 
(1957) provided by Porter (1958). 
Since the interaction between in- 
tensity of shock and type of condi- 
tioning was significant (F = 3.31, 
df = 4/90), simple analyses of vari- 
ance were used to evaluate the dif- 
ferences among the means of the 
shock groups for each of the condi- 
tioning procedures. For the FC 
groups and for the BC groups, the 
means differed significantly (F=5.22 
and 2.76,df = 4/45, respectively). 
A comparison was then made be- 
tween the individual group means 
within each of the conditioning pro- 
cedures. For FC, the 50-, 60-, and 
100-v. groups were significantly su- 
perior to the 30- and 40-v. groups. 
For BC, the 100-v. group was sig- 
nificantly superior to the 30- and 
50-v. groups. No other difference 
was significant. 

A further analysis was made be- 
tween FC and BC groups for each 
level of shock, The 50-v. FC group 
was significantly superior to the 50-v. 
BC group. Since none of the other 
differences was significant and since 
both 50-v. groups performed in an 
unusual manner, it is difficult to 
interpret this finding. 


Discussion 

The general tendency for learning to 
improve with increases in the intensity 
of shock used during fear conditioning is 
in agreement with previous findings 
(Goldstein, 1960; Miller, 1951). Such 
a result would be expected since, pre- 
sumably, the amount of fear conditioned 
is related to the level of shock which, 
thereby, would determine the degree of 
learning of the indicator response. 

Two results are not consistent with 
those usually obtained. (a) Learning 
of the indicator response was not evi- 
denced until late in training (on Day 2). 
This finding will be discussed in connec- 
tion with the results of Exp. II. (b) 


113 


Learning of the indicator response oc- 
curred following BC procedures. Al- 
though Goldstein (1960) has recently 
reported similar findings, other investi- 
gators (Kalish, 1954; Porter, 1958) have 
found no learning in BC groups. Most 
likely such learning is based on fear 
conditioned to the apparatus cues which, 
when strong shock is used, is not extin- 
guished during the intertrial intervals. 
It should be noted that a period per- 
mitting extinction of fear to the appa- 
ratus cues was introduced between 
conditioning and hurdle jumping in the 
experiments of Kalish and Porter but 
notin that of Goldstein or in the present 
study. 
EXPERIMENT II 


The finding of Exp. I that learning 
of the hurdle-jumping response did 
not occur until Day 2 can be taken to 
indicate that either the elapse of a 
minimal time interval following con- 
ditioning or the administration of a 
minimal number of hurdle-jumping 
trials is a necessary condition for 
learning. The present experiment 
was designed to allow a choice be- 
tween these two alternatives. For 
some Ss there was a 1-day delay be- 
tween conditioning and hurdle jump- 
ing while for other Ss there was no 
delay. If the number of trials is the 
important variable, nodifference would 
be expected between the delay and 
no-delay conditions. If, on the 
other hand, the postconditioning delay 
interval is crucial, learning should 
occur on the first day of hurdle 
jumping following the delay. 


Method 


Subjects —The Ss were 40 naive, female, 
hooded rats ranging between 98 and 111 days 
of age at the beginning of the experiment. 
The source of Ss and the method of pairing 
and assigning Ss to groups were the same as in 
Exp. I. 

Apparatus—The same apparatus was 
used as in the previous experiment. 

Design and procedure-—Four groups of 10 
Ss each, two FC and two BC, were used. All 


114 WALLACE R. McALLISTER AND DOROTHY E. McALLISTER 


procedures were the same as in Exp. I except 
that the shock level was 70 v. for all groups 
and that for two groups (FC-D and BC-D) 
there was a delay of approximately 22% hr, 
between the completion of conditioning and 
the beginning of hurdle jumping. For the 
other two groups (FC-ND and BC-ND), 
‘there was no delay beyond the minimal time 


ge to prepare the apparatus (approxi- 
ely 2 min.). 
af) ee 


Results 


Thg means of the reciprocals of 
latency of hurdle jumping in blocks 
of five trials for each of the groups 
are presented in Fig. 2. The superi- 
ority of performance of the FC-D 
group as compared with the other 
three groups is clearly evident on Day 
1 and is maintained on Day 2. The 
BC groups and the FC-ND group 
appear to perform similarly and at 
about the same level throughout 
training. For the statistical analysis 


LATENCY 
3 


ô 8 8 


8 


MEAN RECIPROCAL OF 
5 8 


of 


Fic. 2. Mean reciprocal of latency 


3 “4 5 6 7 
BLOCKS OF FIVE TRIALS f ? 


of the differences between the groups, 
the reciprocals of latency on Trial 
Block 5 were transformed in the man- 
ner described in Exp, I and then 
subjected to a factorial analysis of 
variance with Delay (D or ND) and 
Type of Conditioning (FC or BC) 
as the factors. Since the interaction 
was significant (F = 9.03, df = 1/36), 
the simple effects were analyzed. 
For FC the D group performed better 
than the ND group (¢=4.45, df = 36) 
while for BC the groups did not differ. 
For the D condition, the FC group 
performed significantly better than 
the BC group (¢ = 3.95, df = 36) 
while for the ND condition the groups 
did not differ. Thus, under the condi- 
tions of this experiment, the Delay 
variable was effective only following 
FC while the Conditioning variable 
was effective only when a delay was 
used. 


DAY 2 


J of hurdle jumpin; i ials 
following forward conditioning (FC) a ard coustloabaa’ Pty soe triala 


with no delay (ND) between conditioning 


nd backward conditioning (BC) with a delay (D) or 
and hurdle jumping. 


ll = 


p 


MEASUREMENT OF ACQUIRED FEAR 115 


The changes in performance from 
Trial Blocks 1 to § were evaluated 
with ¢ tests for related measures. 
Only Group FC-D showed a signifi- 
cant improvement in performance 
(t = 4.36, df = 9). The failure to 
find any evidence for learning in 
Group BC-D probably can be at- 
tributed to the level of shock used 
since in later experiments, when 
higher levels of shock were employed, 
groups trained under that condition 
did learn. No evidence of learning 
was shown by the ND groups on either 
day. The lack of learning on Day 2 
in Group FC-ND is inconsistent with 
the results of Exp. I in which Group 
FC with a comparable level of shock 
(60 v.) did learn. From Fig. 1 and 2, 
however, it can be seen that the 
performances of these two FC groups 
are similar through the seventh block 
of trials. Only on the last three 
blocks of trials isthe performance of the 
Exp. I group noticeably higher. Since 
the absolute difference in performance 
is not great, it is probably most 
parsimonious to attribute the dis- 
crepancy to sampling error. The 
results for the BC groups with com- 
parable shock levels from the two 
experiments are consistent since learn- 
ing was not shown in either case. 


Discussion 


In this experiment, learning of the 
hurdle-jumping response occurred early 
in training only with an adequate post- 
conditioning delay. Thus, the failure 
to obtain any evidence of learning on 
Day 1 in Exp. I can be attributed to the 
use of an inadequate delay rather than 
to the lack of a sufficient number of 
hurdle-jumping trials. The results are 
consistent with those of other investi- 
gators (Brown & Jacobs, 1949; Kalish, 
1954; Kent, Wagner, & Gannon, 1960; 
Porter, 1958) who have reported rapid 
learning of the hurdle-jumping response. 
A similar delay was employed in these 


studies following all, or the major portion 
of, the conditioning trials, but its use 
seems to have been fortuitous, and 
the relevance of this temporal variable 
in this type of experiment seems not to 
have been recognized. The existing 
data do not allow a determination of the 
critical length of delay. 

No evidence is provided by the present 
data concerning the nature of the events 
occurring during the postconditioning 
delay which leads to its effect on hurdle- 
jumping performance. Several possible 
interpretations can, however, be men- 
tioned. Two of these assume that the 
strength of the fear response varies in 
time. One example is the incubation of 
fear hypothesis (Bindra & Cameron, 
1953, p. 197). Another, based on the 
findings of Perkins and Weyant (1958), 
assumes that the strength of the fear 
response to generalized stimuli increases 
during the postconditioning delay. Since 
conditioning and hurdle jumping were 
carried out in different, although similar, 
apparatuses, stimulus generalization might 
be an important factor. Other inter- 
pretations rely on the variation in time 
of the strength of some factor which 
interferes with the operation of the fear 
response. For imstance, it may be 
assumed that following conditioning a 
strong, general emotional state, which 
dissipates with time, is present as an 
aftereffect of shock (Amsel & Maltzman, 
1950). In this case, immediately fol- 
lowing conditioning, hurdle jumping 
might not result in a sufficient decrease 
in total emotionality to be reinforced. 
On the other hand, it could be assumed 
that the cues resulting from this general 
emotionality elicit some response in- 
compatible with hurdle jumping, such 
as crouching, which would interfere with 
the learning until the emotionality was 
dissipated. At this time there are no 
firm grounds for choosing between these 


alternatives. 
SUMMARY 


Two experiments concerned with the 
classical conditioning of fear in’rats and 
the measurement of its effect through the 
learning of another response (hurdle jumping) 


116 WALLACE R. MCALLISTER AND DOROTHY E. McALLISTER 


were conducted. In Exp. I the effect of 
intensity of shock (30, 40, 50, 60, or 100 v.) 
used during conditioning was investigated. 
At each shock level one group was given for- 
ward conditioning (light-shock) and one, 
backward conditioning (shock-light). For 
all groups hurdle-jumping trials in which S 
could escape the light by jumping a hurdle 
were administered immediately following 
conditioning and were continued on the next 
day. Evidence of learning was obtained 
following both forward and backward condi- 
tioning but only on the second hurdle-jumping 
day. Performance, in general, was better 
following conditioning with the higher shock 
levels. The results of Exp. II indicated that 
learning does occur on the first day of hurdle 
jumping when a postconditioning delay of 1 
day is used, 


REFERENCES 


AMSEL, A., & MALTZMAN, I. The effect upon 
generalized drive strength of emotionality 
as inferred from the level of consumma tory 
response, J. exp. Psychol., 1950, 40, 
563-569, 

Brypra, D., & Cameron, L. Changes in 
experimentally produced anxiety with 
the passage of time: Incubation effect. 
J. exp. Psychol., 1953, 45, 197-203, 

Brown, J. S., & Jacoss, A. The role of fear 
in the motivation and acquisition of re- 
a J. exp. Psychol., 1949, 39, 747- 

FISHER, R. A., & Yates, F. Statistical tables 
for biological, agricultural and medical 
research. London: Oliver & Boyd, 1957, 

GoLpstEIN, M. L. Acquired drive strength 
as a joint function of shock intensity and 


number of acquisition trials. J. exp. 
Psychol., 1960, 60, 349-358. 

Kauisu, H. I. Strength of fear as a function 
of the number of acquisition and extinction 
trials. J. exp. Psychol., 1954, 47, 1-9. 

Kent, N. D., WAGNER, M. K., & GANNON, 
D. R. Effect of unconditioned response 
restriction on subsequent acquisition of a 
habit motivated by “fear.” Psychol, Rep., 
1960, 6, 335-338. 

MILLER, N. E. Studies of fear as an acquir- 
able drive: I. Fear as motivation and fear- 
reduction as reinforcement in the learning 
of new responses. J. exp. Psychol., 1948, 
38, 89-101. 

MILLER, N. E. Learnable drives and rewards. 
In S. S. Stevens (Ed.), Handbook of experi- 
mental psychology. New York: Wiley, 
1951. Pp. 435-472. 

PERKINS, C. C., JR., & Weyant, R. G. The 
interval between training and test trials 
as a determiner of the slope of generaliza- 
tion gradients. J. comp. physiol. Psychol., 
1958, 51, 596-600. 

PORTER, L. G. Generalization of fear and of 
the inhibition of fear. Unpublished doc- 
toral dissertation, Syracuse University, 
1958. ` 

Sotomon, R. L., & Brusn, E. S. Experi- 
mentally derived conceptions of anxiety 
and aversion. In M. R. Jones (Ed.), 
Nebraska symposium on motivation: 1956. 
Lincoln: Univer. Nebraska Press, 1956. 
Pp. 212-305. 

Wycrorr, L. B., & Pace, H. A. 
administering shock. 
1954, 67, 154. 


A grid for 
Amer, J. Psychol., 


(Received June 22, 1961) 


i 


: 


Journal of Experimental Psychology 
1962, Vol. 64, No. 2, 117-122 


OVERLEARNING AND POSITION REVERSAL ' 


M. R. D'AMATO anv H. JAGODA 


New York University 


In a previous paper (D'Amato & 
Jagoda, 1961) it was shown that the 
overlearning effect (the facilitation 
of discrimination reversal by extended 
postcriterion training) did not occur 
in a visual discrimination task if Ss 
were required (via forced trials) to 
have a moderate amount of experience 
with the negative stimulus (S_) during 
the postcriterion training. The inter- 
pretation of this result was that con- 
tinued experience with S_ during 
overlearning prevented a reduction 
in avoidance of that stimulus (as 
normally occurs, presumably, during 
extended overtraining), and with 
avoidance of S_ maintained at a 
relatively high level, facilitation of 
reversal learning was precluded. 

The purposes of the present studies 
were twofold. First, the preceding 
analysis of the overlearning effect is 
not as easily applied to position dis- 
crimination reversal, where there exist 
no positive and negative stimuli as 
such. And yet, the overlearning ef- 
fect has been reported in a position 
discrimination task (Pubols, 1956). 
One aim of the present studies, there- 
fore, was to determine whether, in 
position reversal, a moderate propor- 
tion of forced trials to the incorrect 
side during postcriterion training 
would similarly result in the disap- 
pearance of the overlearning effect. 
Second, in most overlearning studies 
(e.g., Birch, Ison, & Sperling, 1960; 
Bruner, Mandler, O'Dowd, & Wal- 

1 This research was supported by Research 
Grant M-2051 from the National Institute 
of Mental Health, National Institutes of 


Health, United States Public Health Service, 
and Grant G-14724 from the National Science 


Foundation, 


lach, 1958; Pubols, 1956; Reid, 1953) 
the overtraining trials were distrib- 
uted over an extended period, some 
12 to 30 days, while the control Ss 
were reversed immediately upon 
reaching acquisition criterion. Since 
there is some evidence that mere 
delay between acquisition and reversal 
serves to facilitate reversal learning 
(Bunch, 1939; Murofushi, 1957; 
Stevenson & Weir, 1959), the evalua- 
tion of this variable by the addition 
of an appropriate control group would 
seem advisable; such a control group 
is incorporated in two of the studies 
reported below. 


EXPERIMENT | 
Method 


Subjects and apparatus—The Ss were 72 
experimentally naive albino rats (25 males 
and 47 females), 80 to 130 days of age at the 
start of the study. Four automatic Y maze 
discrimination apparatuses, fully described 
elsewhere (D'Amato & Jagoda, 1960), were 
used. 

Pretraining.—Four days prior to the be- 
ginning of acquisition, Ss were placed on a 
standard 23-hr. water deprivation regimen. 
Each S was placed in a darkened arm of the 
maze and permitted to drink five dipperfuls 
of water on the day preceding the beginning 
of acquisition; three dipperfuls were allowed 
on the subsequent day. 

Position acquisition —All Ss were trained 
ona right position response. Ten trials were 
given on Day 1, S being forced on Trial 2 
to the side opposite to that chosen on Trial 1; 
on Trial 7, S was forced to the same side as 
that chosen on Trial 1. On all subsequent 
acquisition days, 20 trials per day were given, 
with 1 out of every 5 trials forced; 2 of the 4 
daily forced trials were to the incorrect 
(left) side. In order to facilitate generaliza- 
tion between free and forced trials, trans- 
parent plastic doors were used. 

On a correct choice the sequence of events 
was as follows, The S was permitted 1,75 sec. 


117 


118 


+.25 sec. drinking (reward interval), timed 
from S’s first lap of water, after which the 
dipper retracted out of sight. The lights in 
the maze remained on for an additional 8 sec. 
A 45-sec. + 3 sec. intertrial interval followed, 
during which all lights in the maze were 
extinguished. Finally, a flashing light (60 
flashes per min.) of 4.5 sec. + .5 sec. duration 
signaled the beginning of the next trial. On 
an incorrect choice the sequence of events 
was identical, except that the dipper was 
retracted during the reward interval. The 
illumination in the right and left arms was 
identical and, using our standard procedure, 
measured 1,0 ft-c. It should be noted that, 
unlike most choice situations, no extramaze 
cues are available in our apparatus; thus, 
the position response, being uncorrelated 
with any cue, is very nearly a “pure” turning 
response, 

Fifteen to 30 min. after the day's trials, 
Ss were watered for 1 hr. During acquisition, 
overlearning, and reversal the estimated 
average duration of water deprivation at the 
start of a day’s trials was 22.5 hr. 

As Ss reached acquisition criterion (18 
correct responses out of any 20 successive 
trials) in squads of 4, they were assigned at 
random to one of the following four groups 
(18 Ss per group). Group C was the standard 
control group, Ss of this group being placed 
on reversal the day after reaching criterion. 
Group CD was the added control group that 
had the beginning of its reversal training 
delayed the same length of time (8 days) 
occupied by the postcriterion training given 
to the overlearning groups. During these 
8 days, Group CD was maintained on its 
regular 23-hr. water deprivation schedule. 
Groups EC and EI were experimental groups 
that received the overlearning experience 
described below. 

Overlearning and reversal.—Groups EC and 
EI received 160 overlearning trials, 20 per 
day for 8 days, with 4 (20%) of each day’s 
trials being forced. For Group EC, all forced 
trials were to the correct (right) side; for 
Group El, all forced trials were to the incor- 
rect (left) side. Apart from this difference, 
the two groups received identical treatment 
during overlearning. 

For all groups reversal training consisted 
of 20 trials per day, all free. Reward, of 
course, now appeared in the left arm of the 
maze. The reversal criterion was the same 
as employed during acquisition. 


Results and Discussion 


Acquisition and overlearning.—The 
mean numbers of free trials to acquisi- 
tion criterion were 38.1, 40.3, 39.4, 


M. R. D'AMATO AND H. JAGODA 


and 40.1, for Groups C, CD, EC, and 
EI, in that order. Group EC aver- 
aged 5.6% errors on the 128 free 
overlearning trials, while for Group 
EI the corresponding value was 9.9%, 
the difference between the groups 
being nonsignificant. 

Reversal—Using the individual 
(trials to criterion) acquisition scores, 
Ss of each of the four groups were 
divided into three levels (6 Ss per 
level), reflecting the speed with which 
the acquisition criterion was attained. 
A Treatments X Levels analysis of 
variance was then applied to the 
trials to reversal criterion data. The 
F for treatments alone proved signifi- 
cant (F=21,12; df =3/60; P<.01). 
In the pairwise ¢ tests that were 
applied to the group means (which 
appear in the first row of Table 1), 
Group EI emerged significantly, in- 
ferior to all other groups (P < .001 
in all comparisons). However, all 
other pairwise differences fell far short 
of significance. Similar analyses based 
on the mean numbers of errors to 
reversal criterion yielded similar re- 
sults. 


In agreement with previously reported 
results (D'Amato & Jagoda, 1961), the 
data of the present experiment clearly 
show that forced trials to the incorrect 
side during overlearning sharply inter- 
fere with position reversal learning. 
However, it is equally ‘clear that the 
overlearning effect has not been obtained. 

One possible explanation for the ab- 
sence of the overlearning effect is sug- 
gested by the results of Group CD, the 
group that had reversal training delayed 
for 8 days. It will be observed from 
Table 1 that this group had the smallest 
mean number of trials to reversal cri- 
terion, though not significantly so. 
Since, as earlier mentioned, many studies 
reporting the overlearning effect have 
distributed the overtraining trials over 
a minimum of 12 days, and since there 
is some evidence that delay between 
acqusiition and reversal can facilitate 
reversal learning, the possibility arises 
that the overlearning effect is some joint 


OVERLEARNING AND POSITION REVERSAL 119 


TABLE 1 
MEAN NUMBERS OF TRIALS TO REVERSAL 
CRITERION 
Group 
N per 
Group 
c cD EC 
76.4 | 66.6 | 85.8 | 181.9 18 
91.2 | 79.4 | 93.9 | 177.6 10 
86.0 13 


— | 101.8] 157.0 


function of number of overtraining trials 
and the time elapsed between achieve- 
ment of acquisition and the beginning 
of reversal learning. There is a plausible 
similarity between the operation of over- 
learning and that of interposing a delay 
between acquisition and reversal learn- 
ing. In the latter, S is removed from 
the experimental situation for a period 
of time and therefore has no experience 
during this delay with either the positive 
or the negative stimulus. In the former, 
since few errors are normally made dur- 
ing overtraining, experience with the 
negative stimulus (or with the incorrect 
response) can be considered to be vir- 
tually terminated during the greater 
part of the overlearning phase. 

Thus, a second experiment was run 
which was closely similar to Exp. I with 
the important difference that 10 rather 
than 20 trials per day were given through 
all phases of the experiment. The 160 
postcriterion trials therefore required 
16 rather than 8 days. 


EXPERIMENT ÍI 
Method 


Subjects and apparatus. —Forty experi- 
mentally naive albino Ss (23 males and 17 
females) 85 to 100 days of age were used. 
In this study opaque rather than transparent 
doors were used, a change necessitated by 
the requirements of other, concurrently run, 
studies. 

Procedure.—The only differences in the 
“procedure of this and the preceding experi- 
ment were that (a) 10 trials per day were given 
during acquisition, overtraining, and reversal; 
and (b) the intertrial interval was increased 
from 45 to 60 sec. Two (20%) of the daily 
acquisition and overlearning trials were 
forced. 

The 40 Ss were assigned to Groups C, CD, 


EC, and EI, 10 Ss per group, in such a 
manner as to equate for acquisition means. 


Results and Discussion 


Acquisition and overlearning.—The 
mean numbers of free trials to acquisi- 
tion criterion for Groups C, CD, EC, 
and EI, respectively, were 57.7, 53.6, 
53.1, and 50.1. Of the 128 free post- 
criterion trials Group EC averaged 
8.1% errors and Group EI, 4.8%. 
The difference between the group 
means again proved statistically non- 
significant. 

Reversal.—A simple analysis of vari- 
ance applied to the trials to criterion 
data showed the group means to 
differ significantly (F=5.45 ;df =3/36; 
P <.01). The group means are 
presented in the second row of Table 
1, and it may be seen there that the 
results of this experiment are in close 
agreement with those of Exp. I. 
Group EI once again reversed sig- 
nificantly more slowly than any other 
group (P < .005 for all pairwise ¢ 
tests). Again Group CD had the 
lowest mean number of trials to 
reversal criterion, but not significantly 
so. And once again the overlearning 
effect was not obtained. 


Although the detrimental effect on 
reversal learning of forced overlearning 
trials to the incorrect side has been veri- 
fied, the overlearning effect remains elu- 
sive. Only one reasonable possibility 
occurred to us as a likely explanation, 
namely, that the number of overlearning 
trials, although at least equal to that 
employed in other reported studies, 
may have been insufficient. Thus, Exp. 
III was initiated in which the number of 
overtraining trials was increased from 
160 to 300. 


EXPERIMENT III 
Method 


Subjects and apparatus.—The Ss were 39 
experimentally naive albino rats (18 males 
and 21 females) 90-95 days of age at the start 
of the study. Opaque doors were again used 
in the Y maze apparatus. 

Procedure—This experiment followed 
closely Exp. I, with the following exceptions: 


120 M. R. D'AMATO AND H. JAGODA 


(a) a $7-sec. intertrial interval was employed ; 
(b) 300 overlearning trials were given (20 
per day); (e) the acquisition and reversal 
criteria were made more stringent by requiring 
that the last 10 of the criterional trials be 
correct, 

Since the acquisition-reversal interval was 
approximately the same as employed in the 
previous experiment, Group CD was elimi- 
nated from the present study. Thus, Groups 
C, EC, and El contained 13 Ss each, assigned 
in the usual manner. 


Results and Discussion 


Acquisition and overlearning.—The 
mean numbers of free trials to acquisi- 
tion criterion were 61.7, 62.7, and 
61.7 for Groups C, EC, and EI, in 
that order. On the 240 free over- 
learning trials Groups EC and El 
averaged 2.9% and 4.1% errors, 
respectively. The difference between 
the group means was not significant. 

Reversal—The mean numbers of 
trials to reversal criterion are pre- 
sented in the third row of Table 1. 
The results closely parallel those of 
Exp. I and II. A simple analysis of 
variance applied to the trials to 
criterion data produced an F of 5.42 
(P = .01). Group EI was signifi- 
cantly inferior to Group EC (P <.02) 
and to Group C (P<.005). Once 
again the overlearning effect failed 
to occur. (The Group C vs. Group 


EC comparison falls far short of 
significance.) 


The results of the three experiments 
are consistent in demonstrating that 
severe retardation of reversal learning 
can be induced by a relatively small 
amount of forced incorrect responding. 
The marked inferiority of the EI groups 
in reversing suggests that avoidance of 
the incorrect place, or of the negative 
stimulus, is far from asymptotic at the 
time S achieves criterion and reasonable 
habit mastery. Apart from theoretical 
considerations, one practical implication 
of this fact is that in a choice situation 
habit persistence can possibly be aug- 
mented much more efficiently by forced 
incorrect responding than by the more 


natural method of permitting a moderate 
amount of free overtraining. 

Our consistent failure to find the over- 
learning effect is difficult to understand 
in view of the relatively large number of 
reported studies in which the overlearn- 
ing effect was obtained. It is not likely 
that the absence of the overlearning 
effect can be attributed to an excessive 
rate of incorrect responding in Group 
EC, as might be deduced from our 
interpretation of the overlearning effect. 
In Exp. III, Ss of Group EC averaged 
only 2.9% errors during overlearning; 
furthermore, analysis of the individual 
error scores showed that speed of reversal 
was not strongly related either to errors 
made during overlearning or to the sum 
of acquisition and overlearning errors. 

A major difference between the present 
experiments and those in which an over- 
learning effect was reported lies in our 
use of forced trials during acquisition 
and overlearning. In Exp. IV the possi- 
ble effect of this variable in the pre- 
ceding studies was evaluated by eliminat- 
ing forced trials during the acquisition 
and overlearning phases. In addition, 
the number of postcriterion trials was 
extended to a maximum of 800. 


EXPERIMENT IV 
Method 


Subjects and apparatus—The Ss were 36 
experimentally naive female albino rats, 
115-150 days of age at the start of the study. 
The Y mazes with opaque doors were again 
used. 

Procedure-—Except for the elimination of 
forced trials and the number of postcriterion 
trials employed, the procedure closely paral- 
leled that of Exp. III. The Ss were trained 
to criterion on a right-turning response and 
placed on reversal learning after 0, 200, 400, 
or 800 postcriterion trials (9 Ss per group)- 
Twenty trials per day were given through 
all phases of the experiment, so that the 800 
group was on overlearning for 40 consecutive 


days. All Ss were given a minimum of 7 days 
on reversal learning. 


Results 


Acquisition —The mean numbers 
of trials to acquisition criterion for 
the groups having 0 (the control 


i 


group), 200, 400, and 800 posteri- 
: terion trials were, in order, 45.9, s08, 
P $1.6, and 54.6. 


Reversal.—The mean numbers al 
trials to reversal criterion were, in the 
same order, 58.7, 111.7, 91.1, and 
103.7. A simple analysis of variance 
showed the group means not to differ 
significantly (F = 2.18; df = 3/32; 

3 10 < P <.15) A trend analysis 
f based on the data of the first 7 re- 
versal days indicated that the groups 
did not differ significantly with re- 
spect to the sum of correct responses 
made over the 7 reversal days. The 
differences between the groups’ linear 
trends, as well as between their 
quadratic trends, were also non- 
significant. 


Discussion 


The results of Exp. IV are in accord 
with those of Exp. I, I, and IIT and 
restrict the range of conditions over 
which the overlearning effect can be 
expected. Although one may maintain 
that the 800 postcriterion trials were 
insufficient to obtain the overlearning 
effect, it should be noted that this num- 
ber of o ng trials is vastly greater 
than the numbers used in previous 
studies that have reported the effect. 
As to the question of the disparity be- 
tween the present results and those of 
Pubols (1956), we can only offer the 
suggestion that the difference may be 
related to the fact that extramaze cues, 
apparently abundantly available in the 
latter study, are totally lacking in our 
apparatus. However, this factor could 
not easily explain the appearance of the 
overlearning effect in the “response 
learning” groups of the Brookshire, 
Warren, and Ball (1961) study. 

It is unlikely that the absence of the 
~ overlearning effect in the present studies 
could have been predicted from the 
recent suggestion (e.g, Birch et al., 1960) 
that the effect may be attributable to a 
nonmonotonic relation holding between 
number of acquisition trials and resist- 
ance to extinction. According to this 


OVERLEARNING AND POSITION REVERSAL 


121 


interpretation, reversal iw faster after 
overiearning simply because overtraining 
leads to faster extinction of the approach 
response of, more generally, of the orig- 
inal habit. ‘This hypothesis, It may be of 
interest to note, was rst proposed by 
Jackson (1932) in what was most likely 
the first overlearning study. 

Apart from the problem of handling 
the results of the present experiments, 
the hypothesis of noa ty be 
tween amount of acquisition training 
apparently 


ship between acquisition level and re- 
sistance to extinction, 
exists some confirming evidence (Murillo 
& Capaldi, 1961; North & Stimmel, 
1960; Senko, Champ, & Capaldi, 1961). 
The major coun comes from 
studies employing the bar pressing re- 

in a simple instrumental setting 
(eg, Harris & Nygaard, 1961; Margu- 
lies, 1961). In our own laboratory, using 
a continuous reinforcement schedule, 
we have found resistance to extinction 
of the bar pressing response to increase 
over groups having 278, 995, and 7070 
mean numbers of acquisition responses, 
distributed over 2, 3, and 21 days, 
respectively* 

We are apparently faced with the 
following situation. Where the over- 
learning effect occurs, one can logically 
attribute the more rapid reversal of the 
overtrained Ss to faster extinction of the 
formerly correct response, and there 
even exists some independent evidence 
that in certain situations this may be 
the correct interpretation (Birch et al., 
1960). Where the overlearning effect 
fails to occur, as in the present experi- 
ments, it is of course conceivable that 
the failure is assignable to the absence 
of a nonmonotonic relation between 
amount of acquisition training and 
resistance to extinction, i€., within the 
latter experimental conditions acquisi- 
tion level and resistance to extinction 
enjoy the classical monotonic relation- 
ship. As already indicated the available 
evidence, even within the confines of an 


2The research was conducted by F. 
Sperber and S. Gillman. 


122 M. R. D'AMATO AND H. JAGODA 


instrumental response, is not unam- 
biguous. The North and Stimmel (1960) 
study is the only one known to the 
writers in which extinction of a simple 
instrumental response (starting time in a 
runway) was found to be nonmono- 
tonically related to amount of acquisition 
training. In the Murillo and Capaldi 
(1961) and the Senko, Champ, and 
Capaldi (1961) studies S was required to 
guess whether or not a piece of cloth 
was present in a covered well by re- 
sponding “in” or “out.” Their ‘“ex- 
tinction” trials (cloth no longer present 
in well) were really reversal trials since 
S was reinforced for responding “out.” 
Thus, in reality they were dealing with 
the effects of overtraining on reversal 
learning rather than on extinction, and 
it is only inferentially that their results 
can be claimed as support for the hy- 
pothesis of nonmonotonicity between 
amount of training and resistance to 
extinction. 

It seems apparent that if the overlearn- 
-ing effect is to be explained in terms 
of an underlying relationship between 
amount of acquisition training and 
resistance to extinction, the variables 
influencing the latter relationship will 
have to be well specified before it can 
serve as a useful explanatory mechanism. 
Most probably these variables can best 
be isolated and studied within the 


context of a simple instrumental re- 
sponse. > 


SUMMARY 


Four experiments were conducted in- 
volving extensive overtraining of a position 
discrimination habit in rats. In Exp. I, 11, 
and III, reversal learning of the position 
response was consistently and markedly re- 
tarded in those Ss that had a moderate pro- 
portion of their postcriterion trials forced 
to theincorrect side. In all three experiments, 
however, Ss that had the same proportion 
of Postcriterion trials forced to the correct 
side did not show the overlearning effect, i.e. 
they did not reverse faster than control Ss that 
_ received no overlearning experience. In 

Exp. IV, run with all free trials, the over- 
learning effect again failed to appear, although 
the number of postcriterion trials was 
increased toa maximum of 800, 


REFERENCES 


Brrcu, D., Ison, J. R., & SPERLING, S. E. 
Reversal learning under single stimulus 
presentation. J. exp. Psychol., 1960, 60, 
36-40. 

BROOKSHIRE, K, H., WARREN, J. M., & BALL, 
G. G. Reversal and transfer learning, fol- 
lowing overtraining in rat and chicken. 
J. comp. physiol. Psychol., 1961, 54, 98-102. 

BRUNER, J. S., MANDLER, J. M., O'Dowp, 
D., & Warlaca, M. A. The role of over- 
learning and drive level in reversal learning. 
J. comp. physiol. Psychol., 1958, 51, 607-613. 

Bunca, M. E. Transfer of training in the 
mastery of an antagonistic habit after 
varying intervals of time. J. comp. 
Psychol., 1939, 28, 189-200. 

D’Amato, M. R, & Jacopa, H. Effects of 
extinction trials on discrimination reversal. 
J. exp. Psychol., 1960, 59, 254-260. 

D'Amato, M. R., & Jacopa, H. Analysis 
of the role of overlearning in discrimination 
reversal. J. exp. Psychol., 1961, 61, 45-50. 

Harris, P., & NyGAarp, J. E. Resistance 
to extinction and number of reinforcements. 
Psychol., Rep., 1961, 8, 233-234. 

Jackson, T. A. General factors in transfer 
of training in the white rat. Genet. psychol. 
Monogr., 1932, 11, 1-59. 

Marcuttis, S. Response duration in operant 
level, regular reinforcement, and extinction. 
J. exp. Anal. Behav., 1961, 4, 317-322. 

MURILLO, N. R., & CAPALDI, E. J. The role 
of overlearning trials in determining re- 
sistance to extinction. J. exp. Psychol., 
1961, 61, 345-349. 

Mvrorusut, K. The effect of cue on reversal 
learning after varying periods of rest. 
Psychologia, 1957, 1, 37-46. 

Norra, A. J., & STIMMEL, D. T. Extinction 
of an instrumental response following a 
large number of reinforcements. Psychol. 
Rep., 1960, 6, 227-234. 

Pusots, B. H., Jr. The facilitation of visual 
and spatial discrimination reversal by 
overlearning. J. comp. physiol. Psychol., 
1956, 49, 243-248. 

Rew, L. S. The development of noncon- 
tinuity behavior through continuity learn- 
ing. J. exp, Psychol., 1953,46, 107-112. 

Senko, M. G., Cuamp, R. A., & CAPALDI, 
E. J. Supplementary report: Resistance to 
extinction of a verbal response as a functior 
of the number of acquisition trials. J, exp. 
Psychol., 1961, 61, 350. 

STEVENSON, H. W., & Wem, M. W. Re- 
sponse shift as a function of overtraining 
and delay. J, comp. physiol. Psychol., 
1959, 52, 327-329, 


(Received June 26, 1961) 


x 
ee 


A 


Journal oj Experimental Psychology 
1962, Vol. 64, No. 2, 123-125 


. THE VON RESTORFF ISOLATION EFFECT WITH 
MINIMAL RESPONSE LEARNING? 


ARTHUR R. JENSEN 
University of California 


H. von Restorff (1933) found that 
when an “isolated” or perceptually 
emphasized item was included in a 
list of relatively homogeneous items, 
Ss would learn the isolated item 
quickly as compared with nonisolated 
items. The experimental literature 
and the numerous theoretical at- 
tempts to explain this phenomenon 
have been reviewed in a series of 
articles by Newman and Saltz (1958; 
Saltz & Newman, 1959, 1960). They 
concluded that the primary effect 
of isolation is to accelerate the learn- 
ing of the isolated item as a response 
(Saltz & Newman, 1959, p. 450). 
Their experiments showed that the 
isolated item was indeed learned more 
rapidly as a response than a non- 
isolated item in the same serial posi- 
tion; the isolated item was emitted 
more frequently and occurred more 
often as an intrusion. 

Learning a serial list of words or 
nonsense syllables, as in the Saltz 
and Newman studies, involves two 
phases: response learning and learn- 
ing the serial order of the items. 
Thus, we may ask whether or not 
the isolation effect is manifest in the 
serial learning phase as well as in the 
response learning phase. The facili- 
tation of response learning, empha- 
sized by Saltz and Newman, is 

ssibly only one result of isolation, 
and by itself maybe inadequate to 


~ explain the total phenomenon. 


The present experiment examined 
the isolation effect under conditions 


1 This research was aided by a grant from 
the National Science Foundation to the 
Center for Human Learning. 


in which (a) all the items in the list 
were already known to S so that all 
he had to learn was their serial order, 
and in which (b) S need not make a 
different response in the isolated than 
in the nonisolated condition, i.e., the 
isolated and nonisolated lists were 
identical in terms of the responses S 
was required to make. 


METHOD 


Subjects —Twenty men and 20 women were 
recruited from an introductory course in 
educational psychology. 

Procedure.—To eliminate or at least 
minimize response learning, the serial lists 
in the experimental and control conditions 
were composed of nine colored geometric 
forms: triangles (T), circles (C), and squares 
(S) colored red (R), yellow (Y), and blue (B). 
Each shape appeared once in each of the 
three colors; stimuli of the same shape or color 
were never adjacent to each other in the list. 
The nine-item series was always preceded by 
three small white dots which served as the 
signal for anticipating the first item. 

The stimuli were automatically projected 
onto a ground glass screen 2 ft. square. The 
figures were approximately 4 in. in size on the 
screen and the colors were vivid. The rate 
of presentation was 3 sec. per item, with a 
6-sec. intertrial interval. The S sat approxi- 
mately 10 ft. directly in front of the screen. 

The Ss were tested individually. The 
experimental and control groups were given 
identical instructions. The S was told he 
would have to learn to a criterion of one 
perfect trial the order in which nine stimuli 
repeatedly appeared on the screen. The 
stimuli were named for S, who was then 
asked to repeat the names, €g. RED TRI- 
ANGLE, etc. All Ss were easily able to give 
all the necessary reponses before beginning 
the serial learning. The Ss learned by the 
anticipation method, responding by saying 
RED SQUARE, etc, They were urged to begin 
guessing on the very first trial and to guess 
when in doubt on subsequent trials. 


123 


124 


ARTHUR R. JENSEN 


TABLE 1 
Summary Data FOR CONTROL AND EXPERIMENTAL GROUPS 


Trials for Mastery 


Percentage Errors at 


Order of Learning Percentage Intrusions 


of List Position 6 Position 6 of Item 6 
Group 
Mean SD Mean SD Mean SD Mean SD 
È 22.75 7.58 15.92 3.25 7.50 1.62 22.69 9.65 
E 23.30 9.87 10.31 3.63 4.55 2.35 18.96 7.12 
t<1l t = 5.02* t = 4.50* t = 1.36 
*P <01. 


Experimental conditions.—The order of the 
stimuli for Group C (Control) (NV = 20) was: 
BSX BS, YT, RC, BT,/YS, RT, BC. 

Previous experiments have shown that in 
a nine-item list Position 6 is generally the 
most difficult to learn. Therefore in the 
present experiment the sixth item was 
“isolated” or emphasized in Group E (Ex- 
perimental) (N = 20). The rest of the list 
was the same as that learned by Group C. 
For Group E, instead of an actual blue tri- 
angle in Position 6, the words BLUE TRI- 
ANGLE appeared, printed in letters 2 in. high 
on the screen. Thus, Group E was required 
to learn the same responses as Group C; only 
the stimulus properties of the item in Position 
6 differed for the two groups. The names 
of the shapes and colors, which are high 
frequency words in the Thorndike-Lorge 
word count, are probably so high in terms 
of response availability that it seems safe 
to assume there would be no appreciable 
difference in the strength of the naming re- 
sponse to the actual blue triangle and to the 
words BLUE TRIANGLE, especially none that 
would be evident under the 3 sec. anticipation 
time allowed in the present experiment. 


RESULTS AND Discussion 


The results are summarized in Table 
1 and Fig. 1. The serial-position 
curves in Fig. 1 were obtained by 
determining each S's percentage of 
errors at each position and averaging 
these percentages for each group. 
Though response learning per se was 
practically eliminated by the method 
of the present experiment, the isola- 
tion effect was clearly manifested, 
the difference between Groups E and 
C in percentage of errors at Position 6 


being significant (P < .001). The 
large percentage of errors at Position 
7, immediately following the isolated 
item, contradicts the idea that isola- 
tion has the effect of breaking the list 
into two parts, each of which may be 
learned as a single list. ‘This finding 
agrees with the conclusion of Newman 
and Saltz (1958) that the more rapid 
learning of the isolated item does not 
increase its effectiveness as a stimulus 
for eliciting the next item in the series. 

As can be seen in Table 1, the 
groups did not differ significantly in 
the number of trials required to learn 
the list, which is also what Newman 
and Saltz (1958) found. Unlike the 


Mean Per Cent Errors 


3.64 


$ ©.7 6,9 
Serial Position 


Fic. 1. Serial-position curves showing the 
isolation effect at Position 6 for Group E, 


VON RESTORFF ISOLATION EFFECT 


Newman and Saltz data, however, the 
conditions of the present study pro- 
duced no significant differences be- 
tween Groups E and C in the per- 
centage of intrusions of Item 6 as an 
error in other positions. Saltz and 
Newman (1959) found that the iso- 
lated item was more likely to be 
emitted on the second trial, i.e., 
after a single presentation of the list. 
In the present study the total fre- 
quency with which BLUE TRIANGLE 
was given as a correct response on 
Trial 2 in Groups C and E was 2 each. 
The frequencies of BLUE TRIANGLE 
as an incorrect response on Trial 2 
in Groups C and E were 10 and 13, 
respectively. The difference is non- 
significant. 


The positions in the list were ranked 
for each S in the order that S learned 
them. The rank of a position was based 
on the. number of the trial on which 
the last error occurred for that position. 
As shown in Table 1, the groups differed 
significantly in the mean rank order of 
learning Position 6. It has been found 
that when the items of a serial list are 
ranked in the order in which they are 
learned, the increment in errors on each 
jtem is a constant proportion of the total 
errors for all items (Jensen, 1962). A 
corollary is that for a given S the same 
number of trials (or reinforcements) 
is required to learn each item, once the 
previous item in the order of learning 
has attained the criterion of learning. 
In other words, it appears that all the 
items in a serial list are of equal difficulty 
as regards the learning of their serial 
positions. Since all the items cannot be 
learned in one trial (unless the whole 
list is within S's immediate memory 
span) they are necessarily learned in a 
particular order. The serial-position 
curve would result from the high degree 
of unanimity among Ss in the order of 
learning the items. Though isolation 
changes the order of learning, so that 
the isolated item is learned sooner, it 
does not seem to be any easier in rela- 
tion to the previously learned item 


125 


(regardless of its position) than is a 
nonisolated item. The differences be- 
tween the percentage of errors on Item 6 
and the previously learned item were 
2.43 and 2.69 for Groups C and E, 
respectively. The hypothesis that isola- 
tion of an item changes only its order of 
being learned but not its difficulty is 
consistent with the general finding that 
isolation does not facilitate the learning 
of the list as a whole. 


SUMMARY 


The von Restorff isolation effect was 
examined under conditions which minimized 
the role of response learning. Forty Ss 
learned by the anticipation method the serial 
order of nine colored geometric forms, all of 
which Ss could readily recall before having 
to learn their serial order. All Ss learned the 
same responses; only the stimulus properties 
of the isolated item differed in the experi- 
mental condition. 

‘The isolation effect was clearly manifested, 
showing fewer errors at the isolated position. 
The facilitation of response learning ap- 
parently is not the only effect of isolation 
and by itself cannot explain the total phe- 
nomenon. The number of intrusions of the 
isolated item did not differ significantly from 
that of the nonisolated item in the same posi- 
tion, nor did isolation facilitate learning the 
over-all list. It was suggested that when the 
effects of response learning per se are elimi- 
nated, isolation merely changes the order of 
learning the positions of the items in the 
serial list. 


REFERENCES 


Jensen, A. R. An empirical theory of the 
serial-position effect. J. Psychol., 1962, 
53, 127-142. 

Newman, S. E., & Satz, E. Isolation effects: 
Stimulus and response generalization as 
explanatory concepts. J. exp. Psychol., 
1958, 55, 467-472. 

Sartz, E., & NEWMAN, S. E. The von 
Restorff isolation effect: Test of the intra- 
list association assumption. J. exp. Psy- 
chol., 1959, 58, 445-451. 

Sartz, E., & NEWMAN, S. E. Test of a 
“common sense” theory of the von Restorff 
effect. Amer. Psychologist, 1960, 15, 451. 
(Abstract) 

von ResrorrF, H. Ueber die Wirkung von 
Bereichsbildungen im Spurenfeld. Psy- 
chol. Forsch., 1933, 18, 299-342. 


(Received July 7, 1961) 


is 2 Si Na 2, Hel 


A COMPARISON OF REACTION TIME AND VERBAL REPORT 


IN THE DETECTION 


OF MASKED STIMULI! 


ELIZABETH FEHRER axo IRVING BIEDERMAN 


Brook! 


Fehrer and Raab (1962) found that 
réaction time (RT) to a visual stim- 
ulus was not affected when the stim- 
ulus was “masked” by subsequent 
Stimulation of neighboring retinal 

areas (metacontrast). It was pointed 
Out, however, that even when the test 
Stimulus was phenomenally not pres- 
z ent it did affect the appearance of 
~ the masking stimuli, since under 
_ these conditions the masks exhibited 
a readily detectable phi movement. 
In other words, the test stimulus, al- 
_ though masked as such, exerted both 
phenomenal effects, since its presence 
= produced a change in the appearance 
_ Of the masks, and behavioral effects, 
Z since it elicited as fast an RT as when 
= it Was presented alone. 
2° The problem of the present experi- 
je ments was to determine whether RT 
= foa test stimulus would remain un- 
affo under masking conditions in 
_ which the presence of the test stimulus 
could not be detected phenomenally. 
gy: tn the Fehrer and Raab study, the 
f= Stimulus display consisted of three 
adjacent 2 X 2 in. light cells. The 
_ center square provided the test flash; 
ee the two flanking Squares, the masking 
ee flashes. The luminance of all stimuli 
= Was approximately 18 ft-L. 
B __ A suitable testing condition for the 
~ Present experiments required a test 
= stimulus which was (a) sufficiently 
_ weak so that its presence could not be 
detected when the masking stimuli 
followed it, but also (b) sufficiently 
intense so that RT to it alone was 
faster than RT to the masking stimuli 


_ ' This research was supported by G; 
_ G-6456 from the National Science ie 


yn College 


plus the stimulus-onset delay (42). 
Unless this second provision obtains, 
it is not possible to determine whether 
a reaction to the combined stimuli 
has been initiated by the flashing of 
the test stimulus or by the masks. 
For example, if RT to the test stimu- 
lus when presented alone is 200 msec., 
and RT to the masks alone is 150 
msec., then, at At = 50 msec., it is 
not possible to tell whether an RT of 
200 msec. to the combined stimuli 
has been set off by the test or by the 
masking stimuli. At At= 75 msec., 
however, RTs averaging 200 msec. 
can be attributed to the test stimulus 
since here, if the reactions were to the 
masks, they would average 225 msec. 
(150 msec. plus 75 msec. for the At.) 

These two requirements could not 
be fulfilled when bath test and mask- 
ing stimuli were brief light flashes. 
As the luminance of the test flash was 
reduced, RT to it became too long 
to meet the second requirement. In 
other words, at Al's at which phe- 
nomenal report failed, RT to the test 
stimulus alone was longer than RT 
to the masks plus the At. 

A different condition, however, 
was found to be suitable. When 
the test stimulus consisted of a brief 
extinction of the otherwise constantly 
illuminated center square, then, when 
the masks (the two flanking squares) 
were flashed on, either simultaneously 
with the darkening of the center or 
slightly later, the darkening of the 
center square became impossible or 
very difficult to detect. Moreover, 
RT to the extinction of the test 
square alone was sufficiently rapid to 

126 


DETECTION OF MASKED STIMULI rt 


distioguish ig from RT to the mssks 
at certam are it was this condition, 
therefore, that was explored in the 
preent experiments, 


Mat Exreximent 


pperetas— 

previews repent chnr & Rash, 1967), 

ples for the presentation of brief fashes of 

o ny A durkeess at speciied 
ya 


adjacent 2 X 2 in. light celle, cach housing 
a cold cathode fluorescent amp. The lamps 
were of the mercury vapor type, and coated 
. When flashed, 


flashes of light in the two flanking squares. 
The delays studied were 0, 


Each trial began with a I-sec. warning tone 
which was followed by a 2.9-, 3.2-, or 3.5-sec. 
silent foreperiod. The three foreperiods were 
switch selected. RT was measured from the 
onset of the extinction of the light in the test 


= 


cbt 
bil 


rT 
F 
t 
Ẹ 
: 


i 
| 
i 
i 


i 
i 


A 
t 
5 
& 


i 
i i H 
i rail 
Hia 


Í 

i 
fs 
i 
if 


E 

f 

H 

ri 
$ 
í 


i 
t 
il 
i | 
H 


i 
: 
Fi 


AR 
obepi teat 
al 
hihi 
HIR 


The mean RTs to the test, masking, 
and combined stimuli at each of the 
six Af's are shown in Table 1. The 
SDs (not shown) averaged 15.5, 10.4, 
and 11.7 msec., respectively. The 
percentages of correct verbal reports 
appear in the last column of the table. 
Better than chance (P < .05) detec- 
tion is indicated by percentages of 


128 


TABLE 1 


MEAN REACTION TIMES AND PERCENTAGES 
or Correct VERBAL Reports CLASSI- 
FIED BY STIMULUS-ONSET DELAY 


(At) 

| Reaction Times % 

4j S Correct 
Reports 

Test Masks Comb, 

0} 1B | 170.0 | 159.4 | 162.9 63 
EF | 182.3 | 156.7 | 155.9 53 
10)IB | 176.7 | 175.9 | 171.38 55 
EF | 183.0 | 167.8 | 166.7 56 
20|IB_| 180.7 | 185.0 | 174.9%» 68 
EF | 179.5 | 176.0 | 167.9. 55 
30|IB | 177.4 | 194.3 | 176.8 68 
EF | 175.5 185.1 | 170.3 67 
50| IB | 183.9 | 215.0 | 180.1» 83 
EF | 182.7 | 207.2 | 179.8» 88 
75|IB | 173.3 | 234.8 172.3 100 
EF | 178.4 | 229.0 | 175.0» 97 


* Mean RT to combination significantly (P < .05) 
faster than RT to test stimulus. 


> Mean RT to combination significantly (P < 01) 
faster than RT to masking stimuli, 


60 or higher. (When N = 100, the 
standard error of 50% is 5%.) 

It is readily apparent that the 
accuracy of the verbal reports im- 
Proved with increase in At. This 
trend is, therefore, different from that 
found when both test and masking 
stimuli are equally luminous flashes 
of light presented in an otherwise 
dark room. In the latter situation, 
degree of masking (in this case, 
darkening) of the test flash increases 
with At up to about 75 msec. The 
Present experiment has shown that 
maximum masking of a brief period 
of darkness (as a result of the illumi- 
nation of adjacent retinal areas) 
occurs with simultaneous presenta- 
tion of test and masking stimuli. 
For 1 S, better than chance detection 
of the pulse of darkness required a At 
of 30 msec. The other S performed 
somewhat above chance level at all 
Af’s other than 10 msec, 


ELIZABETH FEHRER AND IRVING BIEDERMAN 


RT to the masks alone was faster 
than that to the test stimulus alone. 
The mask values as presented in 
Table 1 include the delay time. Net 
values for the purpose of this com- 
parison, therefore, require subtraction 
of At. 

At the At of zero, the mean RT to 
the combined stimuli was virtually 
the same as RT to the masks and thus 
considerably faster than RT to the 
test stimulus. This indicates that 
the reaction to the combination was 
most probably initiated at the time 
of the presentation of the masks. 
The test stimulus had no clearly 
discernible effect. 

Subject EF’s reaction to the com- 
bined stimuli at Af = 10 msec. also 
Seems to have been initiated by the 
masks. Subject IB, on the other 
hand, reacted as fast to the test 
stimulus as he did to the masks plus 
the At. His RT to the combination, 
faster than that to either component, 
cannot be attributed to either stimulus 
alone. 


At the At of 20 msec., both Ss reacted 
about as fast to the test stimulus as they 
did to the masks plus the At. RT to the 
combination, however, was reliably faster 
than that to either component alone. 
The facilitation at this Aż and that for 
Subject IB at At = 10 msec. is similar 
to the facilitation in RT found at certain 
At's in the study by Fehrer and Raab 
(1962). It is also similar to an inter- 
sensory (light and sound) facilitation 
reported recently by Hershenson (1962). 
The fact that this facilitation is suffi- 
ciently great to be significant (see Table 
1) only at Afs at which both stimuli 
yield about the same RT is consistent 
with Hershenson’s finding that maximum 
facilitation occurs when sound and light 
stimuli are separated by, an interval 
equal to the difference in their respective 
RTs. Smaller amounts of facilitation 
are evident at all other delay intervals. 
In 11 out of the total 12 rows, RT to the 
combination was the fastest of the three. 


( 


DETECTION OF MASKED STIMULI 


The magnitude of the facilitation at 


P At = 20 msec. is surprising in view of the 
f fact that the test stimulus darkening was 


phenomenally a most inconspicuousevent 
and required close attention for RT. 

. RTs to the test stimulus, on the other 

* hand, were reasonably fast. Speed of 

RT, rather than phenomenal appearance, 

may be the more important variable in 
facilitation. 

The data at At = 30 msec. are the 
most relevant for the problem that 
initiated the present study, namely, the 
comparison of RT and verbal report in 
detection of the test stimulus. At this 

At, only 68% of the verbal reports were 

correct. RT to the combination, how- 
“ ever, was not significantly different from 
RT to the test stimulus, but very sig- 
nificantly faster than RT to the masks 
plus the At. For each S, the distribu- 
tion (not shown) of RTs to the combina- 
| tion was virtually the same as that to 
f the test stimulus alone. There was no 
evidence of bimodality as there should 
have been if about one-third of the RTs 
to the combination had been set off by 
the masks. In addition, the SD of RTs 
to the combination was smaller than the 
| SD for the test stimulus, 14.1 vs. 19.0 
| msec. for 1 S and 8.9 vs. 17.3 msec. for 

the other. The data, therefore, show 

that, even though careful observation 

failed in many cases to detect the pres- 
5 ence of the test stimulus, RT apparently 
detected accurately at each stimulus 
presentation. 

The same conclusion can be drawn 
from the results at At = 50 msec. It is 
apparent that RT to the combination 
was consistently initiated by the test 
stimulus. Verbal report was not entirely 
accurate. 


| 
| 
| 
4 
ti 
: 
| 


SUPPLEMENTARY EXPERIMENT 


A brief experiment was run to 
determine whether the RT results of 
the main experiment might be due 
to the fact that both Ss knew of the 
presence of the test stimulus and, 
during all RT trials, were set to look 
for and react to the slight darkening. 
Since the sequence of stimuli was 


129 


random, S never knew when this, 
a difficult event to detect, would 
occur. It seemed possible that naive 
Ss, knowing nothing of the test 
stimulus and therefore not being set 
for it, might not react more rapidly 
to the combination than to the masks 
alone. 

Four naive undergraduate Ss re- 
acted to the combination with a Af of 
30 msec. and to the masks alone. 
The two stimuli were presented in 
random order with randomized fore- 

iods. Six sets of 36 RTs each were 
run with each S, two sets a day. Each 
set comprised 18 presentations of 
each of the two stimuli. The longest 
and the shortest RT of each 18 were 
discarded before the set means were 
computed. Since these Ss had had no 
practice in RT and exhibited very 
variable RTs on their first day, the 
first two sets of data were discarded. 
The final means, each based on 64 
cases (four sets of 16 trials each), 
are presented in Table 2. 

All Ss reacted reliably faster to the 
combination than to the masks, thus 
confirming the results of the main 
experiment. On the average, the 
naive Ss reacted 14.3 msec. faster 
to the combination than to the masks, 
a value very close to the mean differ- 
ence of 16.2 msec. for the authors. 

An attempt was made with 3 of the 


TABLE 2 


MEAN Reaction TIMES AND PERCENTAGES 
or CORRECT VERBAL REPORTS 
or Natve Ss 


Reaction Times 


s 7h erts 
Masks Pombia t 

{| 211.0 196.2 5.14 = 

2 192.8 178.6 6.97 63 

3| 217.0 205.2 5.86 50 

4| 195.7 179.8 6.53 49 


Note.—For combined stimuli, At was 30 msec. 


130 


naive Ss to determine whether they 
could learn to distinguish between 
the masks alone and the combination 
without informing them of the actual 
stimulusconditions. They were shown 
the two stimuli, labeled a and b, five 
times each in alternation, and there- 
after were asked to guess which condi- 
tion was presented. After each re- 
port, they were informed whether they 
were correct. Four sets of 20 trials 
each were run, two sets a day on 
each of 2 days. The.S2 discriminated 
somewhat better thanchance (P =.05). 
The other 2 Ss performed at chance 
levels. None reported any center 
darkening. The only differences they 
suggested were related to the dura- 
tion of the masking stimuli. 


Discussion 


The data of these two experiments 
confirm and add to the results of two 
previous studies which showed that RT 
is determined by very brief changes 
in energy, and that therefore stimulus 
events originating later, though they are 
important in determining phenomenal 
characteristics, do not (except for certain 
facilitation effects) affect RT to the 
stimulus delivered first. It has already 
been pointed out that Fehrer and Raab 
(1962) showed that RT to a flash of light 
is not increased by subsequent light 
stimulation even though this results in 
the phenomenal darkening of the first 
flash. In another study (Raab, Fehrer, 
& Hershenson, 1961) it was shown that 
RT is independent of flash duration over 
the range of 10 to 500 msec., even though 
phenomenally the longer lasting stimuli 
are far brighter than the brief ones and 
might, therefore, be expected to elicit 
a faster RT. The present main experi- 
ment showed that RT can be initiated 
and determined by an event which is so 
successfully masked that it is often not 
detected by careful phenomenal obser- 
vation. The supplementary experiment 
showed, further, that RT can be initiated 


ELIZABETH FEHRER AND IRVING BIEDERMAN 


by an event whose presence is not even 
suspected by the reacting S. 

It should, perhaps, be added that the 
present study falls in the general research 
area that includes the many recent 
experiments on subception, discrimina- 
tion without awareness, etc. Eriksen’s 
(1960) recent excellent review of many 
of these studies implies that behavioral 
indices, such as GSR, have proved to be, 
if anything, less sensitive in discrimina- 
tion than well controlled phenomenal 
indices, such as forced choice. Our data, 
on the other hand, show that RT, a 
voluntary, objective response, can pro- 
vide a more sensitive measure than ver- 
bal report in stimulus detection under 
masking conditions. 


SUMMARY 


In the present study, we have compared 
the accuracy of two measures, reaction time 
and verbal report, in the detection of an event 
subjected to retroactive masking. A 5-msec. 
darkening of an otherwise steadily illuminated 
area was followed, after delays varying from 
0 to 75 msec., by a 100-msec. illumination of 
two adjoining areas. At certain critical delays 
at which verbal detection of the test stimulus 
was little above chance accuracy, RT to the 
darkening of the test stimulus was not 
affected by the delayed presentation of the 
masking lights. Compared with verbal 
report, therefore, RT provided a far more 
accurate measure of the presence of the 
masked stimulus event. 


REFERENCES 


Eriksen, C. W. Discrimination and learning 
without awareness: A methodological sur- 
vey and evaluation. Psychol. Rev., 1960, 
67, 279-300. 

FEHRER, E., & RAAB, D. Reaction time to 
stimuli masked by metacontrast. J. exp- 
Psychol., 1962, 63, 143-147. 

HERSHENSON, M. Reaction time as a meas- 
ure of intersensory facilitation, J. exp. 
Psychol., 1962, 63, 289-293. 

RAAB, D., FEHRER, E., & Hersuenson, M. 
Visual reaction time and the Broca-Sulzer 
phenomenon. J. exp. Psychol., 1961, 6l, 
193-199. 


(Received July 10, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No, 2, 131-136 


DIFFERENTIAL EYELID CONDITIONING AS A FUNCTION 
OF THE CS-UCS INTERVAL ' 


THOMAS F. HARTMAN 


University 


The purpose of the investigation 
was to explore the relationship be- 
tween conditioned discrimination and 
the CS-UCS interval. This relation- 
ship has not been systematically in- 
vestigated, although Hilgard, Camp- 
bell, and Sears (1937) used intervals 
from 350 to 550 msec. and reported 
that intervals of 550 msec. produced 
better differential conditioning than 
did intervals of 400 msec, or less. 
This observation evidently led Hil- 
gard and his co-workers to utilize 600- 
or 650-msec. CS-UCS intervals in 
subsequent differential conditioning 
(e.g, Hilgard, Campbell, & Sears, 
1938: Hilgard, Jones, & Kaplan, 
1951). Most other differential condi- 
tioning studies using the eyelid have 
involved a CS-UCS interval of ap- 


` proximately 500 msec., the interval 


optimal for simple eyelid conditioning 
(e.g., Spence & Beecroft, 1954; Spence 
& Farber, 1954). 

A number of considerations might 
lead one to expect that the optimal 
CS-UCS interval for differential eye- 
lid conditioning would be greater 
than the optimal CS-UCS interval 
for simple eyelid conditioning. Dif- 
ferential conditioning is more com- 
plicated than simple conditioning, 
and Hartman, Grant, and Ross (1960) 
point out that the limiting distribu- 
tion for the latency of responses in 


1 This research was supported in part by 
the National Science Foundation and in part 
by the Research Committee of the Graduate 
School of the University of Wisconsin with 
funds provided by the Wisconsin Alumni 
Research Foundation. 

Now at Thomas J. Watson Research 
Center, IBM, Yorktown Heights, New York. 


2 anD DAVID A. GRANT 


of Wisconsin 


the eyelid conditioning situation ap- 
parently is determined by some of the 
factors affecting reaction time, ‘so 
that longer CS-UCS intervals might 
be appropriate in the differential 
conditioning situation. The work 
of Wickens et al. (e.g., Wickens, 1959) 
also implied that mediated responses 
to a stimulus complex may provide 
conditioned stimuli which will extend 
the usual optimal CS-UCS interval 
to a longer value. Certainly the use 
of the interval that is optimal for 
simple conditioning has led to rela- 
tively poor differential conditioning 
in earlier studies, and considerations 
such as those outlined above suggested 
that longer CS-UCS intervals might 
produce better conditioned discrimi- 
nation. 
METHOD 


Apparatus.—Except for the presence of 
two milk-glass windows for the presentation 
of the CS, the apparatus was essentially 
the same as that described by Hartman and 
Grant (1960). The two windows consisted 
of 10-cm. circular milk-glass disks with their 
centers horizontally 15 cm. apart. Ambient 
illumination was approximately 1 mL., and 
the CS consisted of an 0.8-mL. increase in 
brightness of one of the milk-glass windows. 
The UCS, a corneal air puff, was of 50-msec. 
duration, and its intensity was regulated by a 
150-mm. column of mercury. The puff was 
sufficient to evoke a sharp reflex closure of the 
eyes of most of the Ss. 

Procedure.—Each S was given 44 rein- 
forced trials and 44 unreinforced trials. 
These were assigned randomly, balancing 
within blocks of 8 trials. For half of the Ss 
the right window was always reinforced, and 
for half the left window was always reinforced. 
The Ss were subdivided into four groups; one 
group receiving a CS-UCS interval of 400 
msec., the others 600 msec., 800 msec., and 
1000 msec. For all groups the CS duration 


131 


132 


was 1100 msec. The intertrialintervals varied 
between 20 and 40 sec. with a mean of 30 sec. 
as programed by means of a Western Union 
tape transmitter. 

Before each session Ss were given the 
“neutral” instructions used at the Wisconsin 
laboratory, and after 10 trials each S was 
interrupted and told that he should not aid 
or inhibit his natural eyelid responses. 

Subjects —The Ss were 56 women and 24 
men who volunteered from classes in ele- 
mentary psychology at the University of 
Wisconsin. Assignment of Ss to conditions 
was random within each replication of the 
eight experimental conditions until it became 
necessary to modify the assignment to equal- 
ize sexes in each group. Three Ss were dis- 
carded; 2 adapted to the UCS and 1 gave a 
record impossible to score because of a high 
random blink rate. 


RESULTS 


Owing presumably to the low inten- 
sity stimuli and short duration UCS 
used in the present experiment the 
random blink rate was low enough so 
that CS-UCS interval effects were 
apparent without a correction for the 
scoring interval. Therefore all eyelid 
closures with latencies from 210 msec. 
to 10 msec. after the UCS onset were 
scored as anticipatory eyelid responses, 
giving scoring intervals of 200, 400, 
600, and 800 msec., respectively, for 
the CS-UCS intervals of 400, 600, 
800, and 1000 msec. 

Figure 1 shows the mean percentage 
of anticipatory responses to the posi- 
tive and negative stimuli plotted 
as a function of successive blocks 
of trials. The first block was four 
trials, subsequent blocks were eight 
trials each. Because the right or left 
position of the positive CS produced 
no difference in results, this factor 
1s ignored in this and all other figures 
and computations. It is readily 
apparent from Fig. 1 that conditioned 
discrimination increased as the CS- 
UCS interval was extended from 
400 through 600 to 800 msec, There 
is some diminution in conditioned 


THOMAS F. HARTMAN AND DAVID A. GRANT 


si 400 600 
79 
“o AEE 

od 
» 
0 
20 
20 
10 
3 800 1000 
10 
eo 

er 

to 
40 
3 
20 


RESPONSE 


PERCENT 


r ee ee a 
it a a 2 7 4 6 € 
BLOCKS OF TRIALS 


Fic. 1. Percentage frequency of anticipa- 
tory responses to the positive and negative 
stimuli during successive blocks of acquisition 
trials for the four CS-UCS intervals. 


discrimination with the 1000-msec. 
CS-UCS interval. The standard er- 
ror of the differences between pairs of 
positive and negative points in Fig. 1 
ranged between 2.00 and 3.50 per- 
centage units, so that for all groups 
there was statistically significant dis- 
crimination between the positive and 
negative stimuli in the last three 
blocks of trials. It is of interest to 
note that with the short CS-UCS 
intervals the responses to both the 
positive and negative stimuli increased 
during the training session, whereas 
with the longer CS-UCS intervals 
the responses to the negative stimuli 
actually decreased during the course 
of training. The greater conditioned 
discrimination obtained with the long- 
er CS-UCS intervals is thus due to 
a lowered response to the negative 
stimulus. 

On the last 32 trials Ss within each 
group were separated into two groups 
according to the criterion suggested 


DIFFERENTIAL EYELID CONDITION! NG 


by Hartman and Ross (1961) for 
detecting Ss who give the voluntary 
form response discovered by Spence 
and his co-workers (Spence & Ross, 
1959; Spence & Taylor, 1951). All 
Ss with time derivatives (dx/dt) of 
anticipatory responses to light greater 
than 35% of the time derivative of 
the reflex to the UCS on more than 
half of their responses were classified 
as voluntary responders (Vs). Other 
Ss were called conditioners (Cs). 
The derivative criterion classed 29 
Ss and Vs; 7 in each of the 400-, 600-, 
and 1000-CS-UCS interval groups, 
and 8 in the 800-msec. group. There 
were thus 12 or 13 Cs in each group. 
The mean percentage of anticipatory 
closures to the positive and negative 
stimuli over the last 32 trials is plotted 
for both the Vs and Cs as a function 
of CS-UCS interval in Fig. 2. The 
difference between the positive and 
negative curves can be used as an 
index of the degree of discrimination. 
Noting these differences, it will be 


v+ 
v- 
C+ 
c- 


RESPONSE 


PERCENT 


600 800 1000 
cs-ucsS INTERVAL 


Fic. 2. The percentage of anticipatory 
eyelid closures to the positive and negative 
stimuli for voluntary responders (V) and 
nonvoluntary responders (C) on the last 32 
acquisition trials as a function of the CS-UCS 
interval, 


400 


133 


seen that degree of discrimination in- 
creases progressively as CS-UCS inter- 
val increases for the Vs. For the Cs, 
however, there appears to be an 
optimum degree of differential condi- 
tioning at the 800-msec. CS-UCS 
interval. Because the frequency of 
responses to the negative stimulus 
in the Cs approaches zero at the 1000- 
msec, interval and presumably would 
stay low at higher intervals and 
because the responses to the positive 
stimulus gradually decrease as the 
CS-UCS interval is extended beyond 
800 msec., the difference between 
response rates to the positive and 
the negative stimuli may be expected 
to decrease with longer CS-UCS 
intervals. 

Even without a correction for the 
scoring interval the Cs show the same 
response function to the positive 
stimulus that is usually obtained in 
simple conditioning (e-8., Kimble, 
1947). Responses to the negative 
stimulus are also clearly affected by 
the CS-UCS interval in spite of the 
fact that these stimuli are never rein- 
forced so that there is no real CS-UCS 
interval for the negative stimuli. 
This phenomenon involves some sort 
of transfer from the positive stimulus 
where CS-UCS interval has direct 
meaning. 

The differences between the CS- 
UCS interval functions for the Vs 
and the Cs indicate that in the differ- 
ential conditioning situation as in 
the simple conditioning situation 
(Gormezano & Moore, 1962; Hart- 
man & Grant, 1962; Spence & Ross, 
1959), the Vs follow different be- 
havioral laws in the eyelid condition- 
ing situation than do the Cs. 

In view of the differences in the 
response functions between the Vs 
and Cs, one might legitimately ask 
why their data were combined in 
Fig. 1. The answer is that if the 


134 


TABLE 1 


ANALYSES OF VARIANCE OF DIFFERENCES IN 
FREQUENCY OF RESPONSES TO THE 
POSITIVE AND NEGATIVE STIM- 

ULI ON Last 32 Acguisi- 

TION TRIALS 


Voluntary Ss_| Conditioned Ss 


Source of Variation 


a | Fr | af | F 
Between CS-UCS | 
intervals (3) | 212% | @) | 5.3608 
Linear t | 830| 1 9.8 0ex 
Quadratic 1 0.23 1 3.00 
Čubic 1 | 10s 1 3.27 
5 | (25.43) |47 | (20,19) 


Error (MS) 2 


Note.—The between CS-UCS sum of squares was 
computed from weighted means and is unbiased. In 
the orthogonal decomposition of the trend unweighted 
means were used to simplify the problem of unequal 
numbers of cases, but the bias was negligible because 
the variation the numbers of cases per CS-UCS 
interval group was never greater than 1. 

+P = 05. 


* P =.01, 


curves of Fig. 1 were given separately 
for Vs and Cs they would show es- 
sentially the same acquisition func- 
tions except for higher response level 
in the Vs, especially to the positive 
stimulus at the 1000 CS-UCS interval, 
as shown in Fig. 2. 

Table 1 summarizes the analysis 
of variance of the difference scores 
of responses to the positive and nega- 
tive stimuli on the last 32 trials as 
shown in Fig. 2. The significant 
linear components of the trends of the 
difference scores show that condi- 
tioned discrimination increases with 
increases in CS-UCS interval. Al- 
though there was no significant quad- 
ratic trend for the Cs, it seems evident 
that maximum discrimination occurs 
with CS-UCS intervals of 800 to 
1000 msec. 

Examination of the response la- 
tencies for high and low discriminators 
among both the Vs and the Cs at each 
CS-UCS interval showed no consistent 
latency differences between the high 
and low discriminators. The Vs, 
however, had consistently shorter 


THOMAS F. HARTMAN AND DAVID A. GRANT 


response latencies than the Cs in each 
group. The latency differences be- 
tween these two types of Ss increased 
with increases in the CS-UCS interval. 
When responses were made to the 
negative CS, these responses did not 
differ in form or latency from re- 
sponses made to the positive CS. 


DISCUSSION 
Differential conditioning or condi- 
tioned discrimination was found to 


increase with increasing CS-UCS inter- 
vals. This was due largely to the greater 
reduction in responses to the negative 
stimulus with the longer CS-UCS inter- 
vals. Although there was no statis- 
tically significant evidence for an opti- 
mum degree of conditioned discrimina- 
tion within the CS-UCS range studied, 
there is some indication that there is such 
an optimum for Ss who do not show the 
voluntary form of eyelid response. There 
may be a corresponding optimum CS- 
UCS interval for conditioned discrimina- 
tion among Ss showing the voluntary 
form response, but it is probably at a 
longer CS-UCS interval than those 
explored. 

The experiment provides some indica- 
tion as to why better discrimination 
occurs at the higher CS-UCS intervals. 
This is particularly true if attention were 
to be concentrated on Ss who do not 
show the voluntary form of eyelid re- 
sponse, that is, the Cs. For them the 
curve relating the amount of conditioning 
to the positive stimulus as a function of 
CS-UCS interval is conventional with 
an optimum conditioning at a CS-UCS 
interval of about 600 msec. or possibly 
less. If the measure of conditioned 
discrimination is to be taken as the 
difference between percent response to 
the positive and to the negative stimuli, 
attention should be directed to the func- 
tion relating responses to the negative 
stimuli to the CS-UCS interval, for the 
greater discrimination at longer cs-UCS 
intervals is due to the rapid drop in this 
function. Actually it is not immediately 
obvious why there should be a functional 


DIFFERENTIAL EYELID CONDITIONING 135 


relationship of this form for responses 
to the negative stimulus. There is no 
true CS-UCS interval for the negative 
stimulus as the UCS never follows the 
negative CS. Therefore the interval is 
defined by events on trials where the 
positive CS is given. To get a function 
to the negative stimulus requires that 
events associated with the positive 
stimulus affect responses to the negative 
function, but it requires some manipula- 
tion of this concept to say the least. 
Alternatively, some additional mecha- 
nism that cannot act at the shorter 
CS-UCS intervals may be involved in the 
longer CS-UCS intervals. This conjec- 
ture is made attractive by the fact that 
the acquisition curves to the negative 
stimulus in Fig. 1 progress downward at 
the 800- and 1000-msec. intervals and 
upward at the 400- and 600-msec. 
intervals. 

The difference might depend upon 
perceptual or upon reaction systems OF 
both. On the one hand, longer CS-UCS 
intervals might permit a more complete 
perceptual response to the positiveness 
or negativeness of the CS, and this per- 
ceptual response might provide the 
effective CS conditioning along the lines 
proposed by Wickens et al. (Wickens, 
1959). On the other hand, inhibition of a 
response may simply require more time 
and be favored by longer CS-UCS inter- 
vals. Of course, a combination of both 
principles may be involved. Certainly 
longer CS-UCS intervals produce longer 
latency responses (Boneau, 1958). Also 
the Vs show generally shorter latency re- 
sponses than do the Cs who inhibit more 
effectively. But there are no differences 
in latencies between the good and poor 
discriminators among either the Vs or 
the Cs. Actually the experiment pro- 
vides little evidence for details of the 
mechanism of the better discrimination 
at the longer CS-UCS intervals, but the 
fact that the Cs give the conventional 
maximum for positive conditioning at 
about 600 msec. CS-UCS interval for 
responses to the positive stimulus may 
favor an interpretation in terms of 


inhibition requiring longer time rather 


than an interpretation based on a mediat- 
ing perceptual response. 

It should be noted that the Vs gen- 
erally give more responses and are poorer 
at inhibition than are the Cs. In this 
respect their performance is like that 
reported by Hartman and Grant (1962) 
and also by Gormezano and Moore (1962), 
who note that the Vs show poorer extinc- 
tion generally. As was pointed out by 
Hartman and Grant, the Vs are by no 
means uniformly appropriate in their 
responses to the stimuli. In some re- 
spects their performance is reminiscent 
of Pavlov’s excitable type (Pavlov, 
1927, pp. 285-300; 1928, pp. 360-390). 

The results of the present experiment 
may provide a possible clue to the dis- 
crepancy between the results of Hilgard, 
Jones, and Kaplan (1951) who found less 
conditioned discrimination in high anx- 
ious Ss and the results of Spence and 
Farber (1954) and Spence and Beecroft 
(1954) who found greater discrimination 
in high anxious Ss. Hilgard, Jones, and 
Kaplan used a CS-UCS interval of 650 
msec., Whereas Spence and Farber and 
Spence and Beecroft used 500 and 490 
msec., respectively. The latter may have 
dealt with simpler basic response prin- 
ciples, whereas the former may have had 
additional complicating inhibitory prin- 
ciples operating to reverse the anxiety 
discrimination relationship. Whether 
the difference in CS-UCS interval will 
turn out to be the basis of the anxiety 
discrepancy or not, CS-UCS interval 
certainly seems to account for the hither- 
to puzzling discrepancy in the published 
acquisition curves of responses to the 
negative stimulus. Hilgard, Jones, and 
Kaplan found responses to the negative 
stimulus to decrease with successive 
acquisition trials as did our Ss at CS-UCS 
intervals greater than 600 msec. Spence 
and Beecroft found increasing respon- 
siveness to the negative stimulus as 
acquisition progressed. In this respect 
their findings were more like our 400- 
or 600-msec. CS-UCS interval groups. 
The responses to the negative stimuli 
in the Stanford and Towa experiments 
were thus appropriate to the cs-UCS 
intervals utilized. 


136 


SUMMARY 


Differential eyelid conditioning was stud- 
ied at four CS-UCS intervals (400, 600, 800, 
and 1000 msec.) with 20 Ss in each interval. 
All Ss received 88 training trials, 44 rein- 
forced trials with the positive CS, and 44 
unreinforced trials with the negative CS. 
The CS was a light and the UCS was a corneal 
air puff; the positive and negative CS ap- 
peared in two glass windows. The principal 
findings were as follows: 

1. Conditioned discrimination increased 
as the CS-UCS interval was increased. For 
Ss who never or rarely showed the voluntary 
form of eyelid response (Cs) the amount of 
conditioning to the positive stimulus showed 
the conventional optimum at about the 600- 
msec, CS-UCS interval or less. The increased 
discrimination was caused by rapid decrease 
in the percentage of responses to the negative 
stimulus as the CS-UCS interval was extended. 

2. There were indications of optimum 
discrimination in the Cs at about the 800- 
msec. CS-UCS interval, but the difference 
between responsiveness to the positive and 
negative stimulus increased progressively 
over the CS-UCS range studied for Ss who 
showed the voluntary response form (Vs). 

3. Although it was conjectured that the 
superior differential conditioning at the 
longer CS-UCS intervals might have been 
due to a more complete mediating perceptual 
response, an interpretation in terms of longer 
time intervals required for inhibition also 
seemed plausible. 


REFERENCES 


Boneau, C. A. The interstimulus interval 
and the ae of conditioned eyelid 
responses, - exp. Psychol, 
apap p. Psychol, 1958, 56, 

Gormezano, I., & Moore, J. W. Effects 

of instructional set and UCS intensity on 

the latency, percentage, and form of the 

eyelid response. J, exp. Psychol., 1962, 63, 

487-494, 

Hartman, T. F., & Grant, D. A, Effect of 
intermittent reinforcement on acquisition, 

aei and etree recovery of the 
conditioned eyelid response, J 5 
Psychol., 1960, 60, 89-96. ie 

HARTMAN, T. F., & GRANT, D. A. Effects 
of pattern of reinforcement and verbal 


THOMAS F. HARTMAN AND DAVID A. GRANT 


information on acquisition, extinction, and 
spontaneous recovery of the eyelid CR. 
J. exp. Psychol., 1962, 63, 217-226. 

HARTMAN, T. F., Grant, D. A., & Ross, L. F. 
An investigation of the latency of “in- 
structed voluntary” eyelid responses. Psy- 
chol. Rep., 1960, 7, 305-311. 

Hartman, T. F. & Ross, L. E. An alterna- 
tive criterion for the elimination of “volun- 
tary” responses in eyelid conditioning. 
J. exp. Psychol., 1961, 61, 334-338. 

Hitearp, E. R., CAMPBELL, A. A., & SEARS, 
W. N. Conditioned discrimination: The 
development of discrimination with and 
without verbal report. Amer. J. Psychol., 
1937, 49, 564-580. 

HILGARD, E. R., CAMPBELL, R. K., & SEARS, 
W. N. Conditioned discrimination: The 
effect of knowledge of stimulus-relation- 
ships. Amer, J. Psychol., 1938, 51, 498-506. 

HiLGarD, E. R., Jones, L. V., & KAPLAN, 
S. J. Conditioned discrimination as related 
to anxiety. J. exp. Psychol., 1951, 42, 
94-99, 

KIMBLE, G. A. Conditioning as a function 
of the time between conditioned and 
unconditioned stimuli. J. exp. Psychol., 
1947, 37, 1-15. 

Paviov, I. P. Conditioned reflexes. (Trans. 
by G. V. Anrep) London: Oxford Univer. 
Press, 1927. 

Pavtov, I. P. Lectures on conditioned reflexes. 
(Trans. by W. H. Gantt) New York: 
International, 1928. 

Srence, K. W., & Beecrort, R. S. Differen- 
tial conditioning and level of anxiety. 
J. exp. Psychol., 1954, 48, 399-403. 

Spence, K. W., & Farger, I. E. The relation 
of anxiety to differential eyelid condition- 
ing. J. exp. Psychol., 1954, 47, 127-134. 

SPENCE, K. W., & Ross, L. E. A methodo- 
logical study of the form and latency of 
eyelid ‘responses in conditioning. J. exp. 
Psychol., 1959, 58, 376-385. 

Spence, K. W., & Taytor, J. Anxiety and 
the strength of the UCS as determiners of 
the amount of eyelid conditioning. J. exp. 
Psychol., 1951, 42, 183-188. 

Wickens, D. D. Conditioning to complex 
stimuli. Amer. Psychologist, 1959, 14, 
180-188. 


(Received July 13, 1961) 


Journal of Experimental Psychology 
1962, Vol 64, No. 2, 137-141 


MIXING OF TWO TYPES OF S- 


REACTION 


ROBERT E. MORIN 


University of Arizona 


Recent studies by Brainard, Irby, 
Fitts, and Alluisi (1962) and Mow- 
bray (1960) have shown that the time 
required to name a numeral is inde- 
pendent of the number of numerals 
in the stimulus set. Cast in the con- 
text of information theory, these 
results broaden the spectrum of tasks 
for which independence of trans- 
mitted information and reaction time 
(RT) has been found. Other investi- 
gators, all of whom used key pressing 
responses, have obtained similar re- 
sults with extended practice (Mow- 
bray & Rhoades, 1959), high stimulus- 
response compatibility (Leonard, 
1959), and stimulus uncertainties of 
greater than 3 bits (Seibel, 1959). 

The above findings are of interest, 
first, because they contradict what has 
been regarded as well-established em- 
pirical generalization. Since the work 
of Merkel (1885), it has been rather 
generally accepted that disjunctive 
RT increasesas the number of stimulus 
alternatives is increased. In the past 
decade, with the application of in- 
formation measures to RT tasks, the 
generalization has been refined to 
state that choice RT is a positive 
linear function of transmitted infor- 
mation. 

The new results are of further in- 
terest because they imply a mode of 
information processing different from 
that suggested by earlier studies. If 

1This study was supported by National 
Science Foundation Grant G-14292 and by 
a grant from the University Research Insti- 
tute of the University of Texas. The research 
was conducted at the University of Texas. 
The authors are indebted to Ted Langford 
for assistance in data collection and analysis. 


AND 


R ASSOCIATIONS IN A CHOICE 
TIME TASK ' 


BERT FORRIN 
University of Washington 


choice RT is a positive linear function 
of transmitted information expressed 
in bits, it is easy to conceptualize 
the process of transmitting informa- 
tion as a series of binary decisions, 
each decision taking constant time. 
On the other hand, if there is inde- 
pendence of choice RT and trans- 
mitted information, the picture is one 
of a parallel processing mechanism 
with each S-R pair having its own 
“private line.” 

In view of the contrasting results 
of very recent and earlier studies, 
two types of S-R associations may be 
distinguished in terms of behavioral 
criteria. One type, hereafter called 
Type N (N for independence), in- 
cludes associations which normally 
produce independence of choice RT 
and transmitted information (e.g. 
naming numerals). The second type, 
hereafter called Type D (D for de- 
pendence), includes associations for 
which RT has been found to be pro- 
portional to transmitted information 
(e.g., numeral responses to geometric 
symbols). All of the studies upon 
which these preliminary distinctions 
are based have used homogeneous 
sets of associations, either all Type N 
or all Type D pairs. The present 
investigation is concerned with the 
effects of mixing the two types of pairs 
in a single task. Each type of pair 
will serve as a context for the other. 
The fundamental question is whether 
or not the two types of pairs interact. 
Are RTs for Type N pairs affected by 
the inclusion of Type D pairs, and 
what effect, if any, does the mixing 
have on RTs for the latter pairs? 


137 


138 


The answers to these questions should 
help to define the generality of recent 
findings showing independence of 
transmitted information and RT. 
Furthermore, they should extend the 
basis for theorizing concerning the 
translation mechanisms involved in 
information processing. 


METHOD 


Subjects—The Ss were 50 male students 
recruited from classes in elementary psy- 
chology at the University of Texas, Partici- 
pation in the experiment /partially fulfilled a 
course requirement. Ten Ss were assigned at 
random to each of five conditions, The Ss 
were tested individually. 

Apparatus.—The/S was seated approxi- 
mately 2 ft. in front of a milk glass screen 
behind which was located an automatic slide 
projector. The dimensions of the illuminated 
field were 4.0 X 5.5 in., and the height of the 
projected images, which varied in shape, 
ranged from 2.0 to 2.5 in. 

The RTs were determined to the nearest 
01 sec. by a Hewlett-Packard electronic 
counter, and were printed on paper tape by 
a Hewlett-Packard digital recorder. A 
verbal response by S activated a voice key 
which stopped the counter, terminated the 
Projected image, and initiated an automatic 
slide changing operation. The interval be- 
tween a response and the appearance of the 
subsequent slide was 2.1 sec, Errors were 
recorded by Æ. 

Procedure.—After a brief period in which 
Ss learned the S-R associations from an 
instruction card, 12 blocks of 32 trials were 
given. There was a 4-min, rest after Block 6 
and shorter rests of approximately 30 sec. 
while E changed slide trays after other blocks. 
Stimulus sequences were constructed to 
equate both frequencies of occurrence of all 
stimuli within a condition and first-order 
transitional probabilities, The Ss were 
Instructed to react as rapidly as possible and 
to keep errors at a minimum. 

Experimental conditions.—The five experi- 
mental conditions ‘are described in Fig. 1. 
For purposes of exposition, the conditions 
oat TONS two groups, Exp. A 

ond. I, ITI, and IV) and Exp, ab ty 
Ii and VY. ) P. B (Cond, I] 

dn Exp. A, RTs to the critical Type N 
pairs (pairs common to all conditions of the 
experiment) were studied as a function of 
both the presence and type of associations 


ROBERT E. MORIN AND BERT FORRIN 


EXPERIMENT A EXPERIMENT B 
CRITICAL CRITICAL CONTEXT! 
COND. PAIRS JEONTEXT COND. PAIRS 
È +-4 
TAs ae 1 
8-8 a-7 
- -4 +-4 -2 
S, 4 
s-8 | m-7 m-7 | 8-8 
2-2 | 4-4 +-4 | @-2 
w Y 
8-8 7-7 ag-7 a-8 


Fic. 1. Schematic diagram of the 
experimental conditions. 


used as context. In all conditions of this 
experiment, the critical stimuli were 2's and 
8's; the correct response was to name the 
numeral displayed. In Cond. I only the 
critical pairs were employed; they were with- 
out context. In Cond. III two Type D pairs 
provided the context for the critical Type N 
pairs. The stimulus set was increased by the 
inclusion of crosses and squares to which the 
correct responses were 4 and 7, respectively. 
In Cond. IV two Type N pairs (4-4, 7-7) 
were used as context for the critical pairs. 

In Exp. B, RTs to the critical Type D 
pairs (cross-4, square-7) were investigated 
under conditions of no context (Cond. II), 
a context of Type N pairs (Cond. III), and a 
context of Type D pairs (Cond. V). It is to 
be noted that data of Cond. III are included 
in both experiments. 

Response measures.—Four response meas- 
ures were computed for each S: (a) mean RT 
to the critical stimuli; (6) information trans- 
mitted by the total set of stimulus elements 
(Tı); (©) information transmitted by the 
critical stimuli (T); and (d) rate of informa- 
tion transmission for the critical stimuli 
(T./RT). 

In Cond. III, IV, and V all response 
measures but the second were based on the 
192 trials on which critical stimuli (2's and 8's 
in Exp. A, crosses and squares in Exp. 
were presented. In Cond. I and II, since 
critical stimuli were presented on all trials, 
two alternative procedures were possible: 
(a) to determine response measures from the 
data of all 384 trials, and (b) to use only the 
initial 192 trials. The first alternative serves 
to equate total amount of practice on the 
task for all conditions; the second provides 
for comparable amounts of practice in re- 
sponding to the critical stimuli. In the case 
of the three response measures in question, 
both procedures were followed with nearly 


5 


CHOICE REACTION TIME 


identical results. Decisions concerning differ- 
ences among experimental conditions were 
not affected by the number of trials upon 
which response measures for Cond. I and II 
were based. For this reason all statistical 
analyses reported involve the comparison 
of conditions with total practice on the task 
equated. 

As noted above two measures of trans- 
mitted information were calculated for the 
experimental conditions. The first, Te was 
based on responses to all stimuli (384 trials) 
and is the measure commonly employed in 
studies relating choice RT to transmitted 
information. The second, Te was based 
solely on responses to critical stimuli. The 
latter measure was required because Te does 
not discriminate between errors to critical 
and to contextual stimuli. T, and Te were 
identical in Cond. I and II. The maximum 
values for T, attainable with errorless per- 
formance, were 1 bit in Cond. I and II and 
2 bits in Cond. III, IV, and V. The maximum 
value for T, was 1 bit for all conditions. 


RESULTS 


Means for each of the response 
measures are given in Table 1. For 
three measures—RT, Te and trans- 
mission rate—overall comparisons ©! 
the means for the three conditions 
within each experiment were made. 
When ananalysis yielded a statistically 
significant F, Duncan’s range test was 
applied to make more analytical 
comparisons. Levels of significance 
cited below are for Duncan’s test. 

* In Fig. 2 mean RTs for the condi- 
tions of both experiments are plotted 


TABLE 1 


MEAN RESPONSE MEASURES FOR THE 
Conpittons or Exp. A AND B 


a n [TRT 

Exp. | Cond. | ÈT) | is) (its) (Bite/ 
A I 52 .93 | .93 | 1.81 
(Type N| III | .60 | 1.92 | 1.00 | 1.68 
pairs) | IV | .49 1.97 | .98 | 2.04 
B Il 59 86 | .86 | 1.48 
(Type D| 111 | <71 | 1.92 | -86 | 1-22 
pairs), | V | .72 | 1.84] 93 | 1.82 


75 


70) am 
a 
S est 
o 
re 
% 60 
gi om 
w 
2 55 
F 
z [i ETTEN 
S s0] Etsan sasana 
5 tsani IY 
z LEGEND 
# ash CIRCLES : EXPERIMENT A 
z TRIANGLES : EXPERIMENT B 
7 SOLID LINE = TOTAL PRACTICE 
: EQUATED 
DASHED LINE : PRACTICE ON CRITICAL 
STIMULI EQUATED 
100 150 200 


TOTAL INFORMATION TRANSMITTED IN BITS 


Fic. 2. Reaction time in seconds plotted 
as a function of total transmitted information 


in bits. 


as a function of T. As is apparent 
from the figure, the number of trials 
on which response measures for C ond. 
Land II are based does not materially 
alter the nature of the RT-T, func- 
tions. 

Initially the results may be ex- 
amined to determine whether the 
associations used met the behavioral 
criteria for Type D and Type N pairs. 
The significant slope of the line de- 
fined by the means for Cond. II and 
V (P < .001) satisfies the requirement 
for the identification of symbol-num- 
eral associations as Type D pairs. 
For the numeral-numeral associations 
(Cond. I and IV) RT was independent 
of mean Te In fact, the observed 
slope, though nonsignificant (P>.05), 
was slightly negative. However, fewer 
errors to the critical stimuli in Cond. 
IV than in Cond. I resulted in a 
significantly higher T. (P < .01) and 
a significantly faster transmission 
rate (P < .05) in the former condi- 
tion. Though these findings indicate 
that the two conditions were not 
behaviorally equivalent in all respects, 
it is nevertheless clear that perform- 


140 ROBERT E. MORIN AND BERT FORRIN 


ance in Cond. IV was not inferior to 
that in Cond. I. 

The effects of mixing the two types 
of pairs can be determined by com- 
paring performance in Cond. III with 
that in other conditions of each experi- 
ment. In Exp. A the mean RT for 
Cond. III was significantly greater 
than the means for Cond. I (P < .01) 
and IV (P <.001). Reactions to 
numeral stimuli (2's and 8's) were 
slowed by the addition of geometric 
symbols to the stimulus set. The 
increase in RT in Cond. III was 
accompanied, however, by a some- 

“what compensating decrease in error 
rate to the critical stimuli such that 
mean transmission rate, though lower 
than that for both other conditions, 
differed significantly only from Cond. 
IV (P < .001). 

In Exp. B reactions to the critical 
stimuli (crosses and squares) were 
similarly affected whether numerals 
or geometric symbols were used as 
contextual stimuli. Conditions III 
and V did not differ significantly on 
any response measure. As compared 
to Cond. II, however, performance in 
Cond. III was characterized by a sig- 
nificantly higher mean RT (P <.001) 
and a significantly lower mean trans- 
mission rate (P < .001). 


DISCUSSION 


The results lend support to the gen- 
eralization that RT is independent of 
mean transmitted information in a 
numeral-naming task. The response to a 
particular numeral is not degraded by the 
fact that other numerals might have 
occurred. Thus, numeral-numeral asso- 
ciations appear to be functionally isolated 
from one another. 

A test of the generality of this con- 
clusion is provided by the mixing of 
Type N and Type D pairs. If Type N 
associations are in functional isolation 
not only from one another but from 
Type D associations as well, RTs for 


Type N pairs should be unaffected by 
the inclusion of Type D pairs in the S-R 
set. Complete functional isolation of 
Type N pairs would further imply that 
the presence of such pairs as context 
should exert no influence on reactions 
to Type D pairs. If the translation 
process involved in the naming of a 
numeral is an encapsulated and auto- 
matic event, Ss should, so to speak, be 
able to let the numeral-numeral associa- 
tions take care of themselves; reactions 
to crosses and squares in Cond. III 
should be much as they were in Cond. II. 

Neither of the outcomes consistent 
with the extension of the concept of 
functional isolation to the case of mixed 
pairs was obtained. Response latencies 
for Type N pairs were lengthened by the 
addition of Type D pairs. Though in- 
creased RTs were partially compensated 
for by a decrease in errors it is apparent 
that numeral naming was sensitive to 
contextual associations. In addition, 
RTs for Type D pairs were clearly 
related to the bivariate probability 
distribution of all stimulus-response 
events. A context of Type N pairs 
(Cond. III) produced increases in R 
of the same magnitude as a context of 
Type D pairs (Cond. V). The Ss in 
Cond. III did not react as though crosses 
and squares were the only stimuli which 
might occur. 

The present results indicate the RTs 
for numeral-numeral pairs can be affected 
by the context in which they are im- 
bedded. The disruption of functional 
isolation for Type N pairs suggests that 
an elementary parallel processing mode 
is too simple for characterizing transla- 
tion mechanisms, even for highly over- 
learned associations. 


SUMMARY 


Two types of S-R associations were dis- 
tinguished in terms of behavioral criteria. 
Type N associations (e.g., numeral-numeral 
pairs) normally produce independence 0 
choice RT and transmitted information. For 
Type D associations (e.g., symbol-numeral 
pairs) choice RT is a positive linear function 
of transmitted information. The present 
study investigated the effects of mixing 


iz 


CHOICE REACTION TIME 


Type N and Type D associations upon the 
RT-transmitted information function. 

Ten male Ss were randomly assigned to 
each of five experimental conditions defined 
by the character of the S-R set: (1) two Type 
N pairs, (II) two Type D pairs, (III) two 
Type N and two Type D pairs, (IV) four 
Type N pairs, and (V) four Type D pairs. 
Analyses examined the reactions to the two 
critical Type N pairs common to Cond. I, 
Ill, and IV (Exp. A) and to the two critical 
Type D pairs common to Cond. II, IHI, and 
vV (Exp. B). In Exp. A, mean RT to the 
critical Type N pairs was not degraded by 
the expansion of the S-R set from two to four 
Type N pairs (Cond. I vs. Cond. IV); in 
contrast, mean RT was increased significantly 
by the addition of two Type D pairs (Cond. 
I vs. Cond. III). In Exp. B, response laten- 
cies to the critical Type D pairs were shortest 
in Cond. II. The addition of two Type N 
pairs to the S-R set (Cond. III) lengthened 
mean RT to the critical Type D pairs by an 
amount comparable to that obtained by the 
addition of two Type D pairs (Cond. V). 

The results of this study were consid 
to limit the generality of the proposition that 
RT for numeral-numeral associations is 


141 


independent of the informational properties 
of the task. 


REFERENCES 


Brarnarp, R. W., Irsy, T. S., Fitts, P. M., 
& ALLUISI, E. A. Some variables influenc- 
ing the rate of gain of information. J. exp. 
Psychol., 1962, 63, 105-110. 

LEONARD, J. A. Tactual choice reactions: I. 
Quart. J. exp. Psychol., 1959, 11, 76-83. 
MERKEL, J. Die zeitlichen Verhältnisse der 
Willensthatigkeit. Phil. Stud., 1885, 2, 

73-127. 

Mowsray, G. H. Choice reaction times for 
skilled responses. Quart. J. exp. Psychol., 
1960, 12, 193-202. 

Mowsray, G. H., & RHOADES, M. V. On 
the reduction of choice reaction times with 
practice. Quart. J. exp. Psychol., 1959, 11, 
16-23. 

SEBEL, R. Discrimination reaction time as 
a function of the number of alternatives 
and of the particular stimulus-response 
patterns. Amer. Psychologist, 1959, 14, 
396. (Abstract) 


(Received July 15, 1961) 


al of Experimental Psychology 
leer Vol. rid 2, 142-150 


HUE GENERALIZATION AND HUE DISCRIMINABILITY 
“IN MACACA MULATTA! 


LEO GANZ? 
University of Chicago 


The principle that when an organ- 
ism has difficulty in discriminating 
two stimuli, he will be more likely to 
generalize a response from the one, 
a training stimulus (CS), to the other, 
a generalization stimulus (GS), is 
widely adopted by behavior theorists 
(Brown, Bilodeau, & Baron, 1951; 
Gewirtz, Jones, & Waerneryd, 1956; 
Hull, 1943; Lashley & Wade, 1946; 
Pavlov, 1927). 

In the context of such unanimity 
of opinion, the paucity of corrobora- 
tive material is surprising. The ex- 
perimental analysis of this question 
thus far has concentrated on general- 
ization gradients to hue in different 
spectral regions where it is known 
that sensitivity varies. In the pigeon 
these gradients have failed to reflect 
what we know of that S’s differential 
hue threshold function (Guttman & 
Kalish, 1956). A similar study in 
which intra-S gradient comparisons 
were possible did obtain consistent 
differences in generalization related 
to wave length, but not the ex- 
pected relationship to the jnd function 
(Blough, 1961). In the human, 
generalization gradients obtained from 
voluntary responses (method of single 
stimulus) have parallelled the jnd 


1 This investigation was supported by 
Research Grants B-771 and B-1590 from the 
National Institute of Neurological Diseases 
and Blindness of the National Institutes of 
Health, Public Health Service to Austin H. 
Riesen. A portion of this paper was sub- 
mitted by the author as part of a doctoral 
dissertation, the University of Chicago, 1959. 
The author wishes to thank Austin H. Riesen 
for his counsel throughout the study. 

* Now at Brown University. 


function (Kalish, 1958). The ques- 
tion remains, clearly, very much open. 

The evidence that is available to 
us suggests that the color vision of 
the rhesus macaque (Grether, 1939) 
bears a strong resemblance to human 
trichromacy. This makes it feasible 
to compare, as is done in the present 
study, the generalization gradients 
of individual rhesus macaques in a 
succession of different spectral re- 
gions with the corresponding human 
differential hue threshold function. 
Our working hypothesis is that in 
spectral regions where the jnd, in 
millimicrons of wave length, is small 
(good discriminability) the generali- 
zation gradient will be steep in slope 
for CS and GS contained in that re- 
gion; where the jnd is large (poor 
discriminability) the gradient will be 
shallow. 

METHOD 


Subjects:—The Ss were 6 pre-adolescent 
Macaca mulatta. Two Ss, Ka and Bu had 
been used in previous studies involving @ 
black-white discrimination for solid food 
reinforcement and a conditioned galvanic 
skin reflex with electric shock as UCS. The 
remaining 4 Ss were experimentally naive. 

A pparatus.—Discrimination training an 
generalization testing were administered with 
S seated in a primate chair enclosed within 
a light-tight cubicle (described elsewhere In 
greater detail, Ganz & Riesen, 1962). Es- 
sentially, the chair departs from current 
models in permitting control of head-positt) 
by the use of side-pieces above the neckboar¢- 
The neckboard is sufficiently wide to prevent 
the monkey from reaching his face. The 
S wears light-diffusing contact lenses, Stimu- 
lus input is thus rendered largely independent 
of receptor-orienting behavior. There is a 
key underneath the neckboard. When fe 
reinforcement is available, a key press w* 


142 


HUE GENERALIZATION AND HUE DISCRIMINABILITY 


activate a solenoid valve for 1 sec. This will, 
in turn, release 3-4 drops of 5% sucrose solu- 
tion, directed to S’s mouth via tube; a 60- 
cycle buzzer connected in parallel with the 
solenoid provides secondary reinforcement. 

The stimuli were produced by passing the 
beam of a 300-w. projection lamp through 
Bausch and Lomb second-order interference 
filters, a Wratten No. 8 filter, and plastic 
polarizing materials. The interference filters, 
Series 33-78 were used to obtain mono- 
chromatic pass-bands covering 449 through 
631 my in nominal 10-my steps (range of step 
size is 4-18 mp; half band width averages 
8 my; peak transmission averages 35%). 
The yellow gelatin absorbed third-order inter- 
ference peaks in the short wave lengths. The 
polarizing materials were used to approximate 
an equal luminosity spectrum. The energy 
transmission was estimated with a Weston 
photoelectric cell No. 8PVIAAF. By correct- 
ing for both the spectral sensitivity of the 
cell and of the human eye (photopic lumi- 
nosity function) an estimate of the luminosity 
was obtained. Each interference filter was 
then coupled with a pair of suitably rotated 
polarizers to give an equal luminosity spec- 
trum for the human O. The evidence sug- 
gests this is justified for the macaque, except 
possibly for the deep red region where lumi- 
nosity could be lower for that S (Grether, 
1939). The beam, after entering the cubicle, 
was finally focused by a lens system as a 7-in. 
disc on S’s left eye; the right eye carried a 
black opaque contact lens. With the 564-mz 
peak filter in place, illumination was estimated 
at 12.3 ft-c, using a Macbeth illuminometer 
in a heterochromatic match. The Tenite 
white contact lens diffused nonselectively in 
the visible region, with approximately 10% 
transmission. 

The stimulus presentations were cycled 
automatically. White noise was delivered 
by a speaker within the cubicle to mask relay 
cues. Periodic checks revealed complete 
generalization across a variety of changes in 
auditory cues. 

Training sequence—During initial training, 
S4 was continuously present for the 30-min. 
session. The response, a key press, was 
developed by a method of successive approxi- 
mation with continuous reinforcement. Once 
criterion was attained—a minimum of 50 
responses over 2 consecutive days—stimulus 
cycling was introduced with 15 sec. S4 alter- 
nating with 5 sec. blackout and reinforcement 
was given on a 7.5-sec. variable interval 
schedule (VI), to a similar criterion: level. 
Next, a 7.5-sec. delay of positive reinforce- 
ment (Ferster, 1958) was introduced to key 


143 


responses emitted during the blackout period, 
and again training was carried to the same 
criterion level. Next, the delay was increased 
to 15 sec. and carried to a criterion of a maxi- 
mum of 20% responses during blackout over 
2 consecutive days. Lastly, VI 15 sec. was 
introduced and training continued to a similar 
criterion (over 4 consecutive days). In the 
next sessions, generalization tests were ad- 
ministered. Following the tests, Ss were 
introduced to discrimination training to 
wave length. The S* now alternated in 
irregular fashion with Sê. During S4, VI 

7.5 sec. was reinstituted. In the presence of 
the blackout, or of S4, negative reinforcement 

was delivered by delaying positive reinforce- 

ment 7.5 sec. following each response (cri- 

terion: 80% responses emitted during S4 

and 100 responses per session, minimum over 

2 consecutive days). Finally, the reinforce- 

ment and delay schedules were increased to 

15 sec, Training was carried to a similar 

criterion, but over 4 days. 

Generalization testing entailed the presen- 
tation of a series of seven hues, from S# 
through S4. The SGs were cycled in the same 
manner as the S4 and S4, but without either 
type of reinforcement. There were 7 testing 
days and each followed the sequence: 10 
training trials, four SG presentations, 13 
training trials, three SG presentations, 30 
training trials. For reasons which will be 
given in the discussion, it seemed advisable 
to keep the number of consecutive generaliza- 
tion stimuli as small as possible. The order 
of presentation of the seven SGs was random- 
ized according to a 7 X7 Latin square— 
Days X Order. 

Experimental design.—The spectrum from 
449 to 631 mp was divided into three regions: 
449-509 my, 509-567 my, and 567-631 mz. 
The ends of each of these regions comprised 
the Sd and Så, This division achieves a rough 
symmetry, with a jnd maximum midway 
between 449 and 631 my (535 mz), a minimum 
45 mp from the ends (approximately 500 
and 587 my), and maxima at the ends of this 
spectrum (449 and 631 my). 

First, one of the hues 449, 509, 567, or 631 
mp was chosen as Sà for the undiscriminated 
operant. Then, each S was trained to dis- 
criminate pairs of hues in all three regions in 
succession, with generalization testing follow- 
ing the attainment of each discrimination. 
The 6 Ss were assigned to two groups with 
opposite S4, S assignments. The successive 
discriminations did not involve any reversals 
in absolute stimulus value. Within each 
group, each S followed a different succession 
of discriminations and generalization tests, 


144 LEO GANZ 


TABLE 1 


TRAINING AND GENERALIZATION STIMULI 
es 


Response Training Generalization 
Le ee ee 
Simple operant 631 567 509 or 449 631 | 604 | 567 540 | 509 483 | 449 
Group I Group H 
Discriminated 
pi si Shapes så 
631 | 567 | 567 | 631 625 | 610 | 604 | 588 | 580 | 567 
509 | 567 | 567 509 564 | 546 | 540 | 527 | 519 | 509 
509 | 449 | 449 | 509 | 509 | 503 | 489 483 | 469 | 457 | 449 


Note.—Wave length in millimicrons. 


as shown in Fig. 1. The details of this design 


SG. We predict, on the basis of the 
are presented in Table 1. 


human differential hue function 
(Wright, 1947, Fig. 95), relative 
shallowness of gradient in the 630-, 
530-, and 450-my regions (where jnd’s 
are relatively large) and relative steep- 
ness in the 580- and 490-my regions 
(where jnd’s are small). 


RESULTS 


The experimental hypothesis postu- 
lates an inverse relationship between 
the steepness of generalization and the 
magnitude of the jnd for the specific 


100 


Ko 
80 
60 
40 
si s : 
g 
5 
a 
$ 
4 
a 
Bu 
~ |- -3 
~ ~~ sa T, 
500 550 600 450 500 550 600 abo 500 550 600 


Wavelength in Millimicrons 


Fic. 1. Individual generalization gradients. (The dashed lines designate the simple 
he as gradients, arrow at the S4; the solid lines designate the discriminated operant gradients. 

e numbers identify the order of administration of the successive discriminations and gen- 
eralizations. Six Ss are depicted here.) 


— 


— 


HUE GENERALIZATION AND HUE DISCRIMINABILITY 145 


Gradients were obtained by simply 
summing responses obtained by each 
SG. Figure 1 depicts the generaliza- 
tion of both the undiscriminated and 
the three discriminated operants of 
the 6 Ss. Generalization of the un- 
discriminated operant was measured 
in only 4 Ss. It is apparent, first, 
that the undiscriminated operant gave 
an almost horizontal gradient. There 
is a mild downward tendency which 
does not appear related to jnd magni- 
tude in the manner predicted. Such 
near-horizontal gradients are of little 
value in testing the effect of differ- 
ences in jnd size. Any differences 
cancel out when cumulated over more 
than 50 mz of wave length. 

The three gradients following dis- 
crimination training are much steeper. 
Their shapes are, in almost all cases, 
bell-shaped, i.e., the response decre- 
ment first accelerates and then decel- 
erates. From a casual perusal, it 


appears Ss Ha, Ch, and Bu show 
steep gradients (descending) in the 
580-my region (jnd small) ; Ss Ka, Fo, 
and Be show steepness in the 490-myz 
region (jnd small) ; Ha, Ch, and Bu 
show shallowness of gradient in the 
450-my region (jnd large); Ka and Be 
show shallowness of gradient in the 
630-my region (jnd large). These 
gradient differences are in the direc- 
tion predicted by the experimental 
hypothesis. 

If we take each gradient and express 
the number of responses emitted 
during an SG presentation as a per- 
centage of the maximum emitted 
during any of the seven SGs, we-can 
plot relative gradients, which make 
slope comparisons easier. This is 
done for all 6 Ss in Fig. 2. The follow- 
ing would appear consistent for all 
3 Ss in Group I: the 509-449 my 
gradient is steeper than the 631-567 
my gradient in its descent from Sè to 


GROUP I 


Total 


Response 


Maximum 


Percent 


s0 10 20 30 40 50 60 
Wovelength 


Deviations 


Fic. 2. Individual generalization gradients of 6 Ss in three adjoining spectral regions. 


52 10 20 30 40 50 60 


s0 10 20 30 40 50 60 


From S In Millimicrons 


(Re- 


sponse totals are here expressed as percentages of the maximum response total.) 


146 LEO GANZ 


S4 (in this case the 631-mp S% has a 
larger jnd than the 509-mp $*). For 
Group II the 3 Ss show shallowness 
in the slope of descent in the 449-509 
mp region (jnd at the 449-mp S4 is 
relatively large) and Ss Ha and Ch 
show delayed ascent in the 631-mp 
region (jnd at the 631-mp S4 is rela- 
tively large). Moreover, the relation- 
_ ships between the three gradients in 
Group I are reversed in Group Il. 
This should follow if jnd’s are exer- 
cising any effect since the S$, Sa 
assignments are reversed from Group 
I.to Group II. The results suggest 
a generalization-discriminability rela- 
tionship in the direction predicted. 
Wenow turn to a statistical evaluation. 


As is customarily the case with stimulus 
generalization gradients, response strength 
was highly correlated with its variability, 
thus precluding analysis of variance either on 
the raw data or on any customary transforma- 
tions. Our main interest centered on some 
simple index of gradient slope. The slope of 
a generalization curve can be viewed as a 
measure of the variability of the stimuli 
which can elicit the training response. This 
can be estimated by taking the cross product 
of each S4-SG distance in millimicrons and 
the number of responses that SG elicited, 
summing over the seven cross products of a 
test day, and dividing by the total number 
of responses elicited that day. We obtain an 
average deviation statistic which is directly 
proportional to slope shallowness. Analysis 
of variance, mixed design, was performed on 
this statistic. If changes in jnd size modify 
gradient slope, this would be reflected here as 
a significant Group X Region interaction, 
which was in fact the case with a P < .02 
(F = 7.45; df = 2/24). The design, it will 
be recalled, placed one group’s predicted 
shallowness of gradient against the other's 
predicted steepness, thereby achieving a null 
combined effect for the two groups-together. 
Accordingly, the Regions SS was not above 
chance expectation (F < 1,00). A significant 
Day SS (F = 2.89; df = 6/8; P <.05) 
reflects the effect of extinction, the gradients 
becoming steeper on successive testing days. 


„Are the magnitudes of these gra- 
dient differences in proportion to the 
magnitudes of the corresponding jnd 


differences? Since the peak trans- 
missions of the filters are not spaced 
along equal steps, gradients across 
regions can be compared only if one 
interpolates, linearly in this case, 
along 10-my steps. Itis assumed that 
for distances of 10 mu, rhesus gen- 
eralization gradients do not depart 
seriously from linearity, which seems 
not unreasonable at the present level 
of precision. The question of interest 
is the departure of some specific gra- 
dient from an average gradient, 1.€., 
with a difference gradient. There- 
fore, the three gradients of each S$ 
were combined to obtain his average 
gradient, and then the individual 
gradients were expressed as deviations 
from this average. These are shown 
in Fig. 3 for the individual Ss. Values 
above zero reflect relatively shallow 
gradients; negative values reflect rela- 
tively steep gradients. Each of the 
six graphs represents a group’s per- 
formance in one region. It can be 
seen, first, that a certain measure of 
inter-S agreement is present in gra- 
dient deviations, for example, the 
Group II curves in the 567-631 my 
region, and the Group I curves for 
the same region. This again reflects 
the wave-length-tied discriminability 
function. Systematic deviations for 
the middle spectral region, 567-509 
mu, are weak if present at all. 

Figure 3 brings to light an inter- 
esting finding. There are never devia- 
tions from average gradients at the 
S and S themselves. The largest 
deviations appear between S4 and SE 
This is interesting because at 631 M4, 
e.g., one of the training stimuli, there 
is a jnd decidedly larger than average, 
yet no corresponding deviation 
generalization; the same can be said 
for 449 my. At other loci, €g» 480 
and 600 my, positioned between Stand 
St, there is a strong deviation from 
average both in the magnitude of the 


ran 


HUE GENERALIZATION AND HUE DISCRIMINABILITY 


GROUPT 


oe Ke S-Oe a-fo O-IND 


Millmicrons 


147 


Wavelength in 
GROUP N 
ecr e-n. A He OJo 


40 


Deviations From Averoge % Gradients 


+40) 


367 s87 7 567 sar 
s0 sa s0 


Wovelengih 


Fic. 3. Deviation gradients in three spectral regions. 
i The filled data points record deviations of individual subjects 


performance in one region. 


Deviations From Averoge Elapsed JND's 


x7 307 449 469 489 s09 
s* 3? 


Millimicrens 


(Each figure depicts one group's 


from their own average gradients. The dashed line depicts deviations from the cumulated 


average jnd.) 


jnd and in the generalization gradient. 
This seems to imply that the generali- 
zation gradient is responsive, in the 
present experiment, to the deviations 
inthe cumulated jnd, not to deviations 
in the jnd itself. 

A measure of deviations in cumu- 
lated jnd’s was derived. For the 7th 
spectral region, (i = 449-509 mp; 
509-567 mu; 567-631 my), an average 
jnd was first computed. The average 
jnd for the region 1 equals the differ- 
ence between S,3 and S; in milli- 
microns divided by the number of 
jnd’s included between S and Så. 
A specific generalization stimulus in 
the region z is then taken, SGi; (j=0, 
10 mp, 20 mp. . - ete. from Sł) and 
the distance is counted off between the 
training stimulus S4 and SG, in 
average jnd’s; also that same distance 


is counted off in the jnd’s obtained 
from the human A) function. The 
difference between the two counts 
gives the measure desired, depicted 
by the dashed lines in Fig. 3. Nega- 
tive values reflect an accumulation 
of larger-than-average jnd’s (poor 
sensitivity); positive values reflect 
an accumulation of smaller-than-aver- 
age jnd’s (high sensitivity). By 
definition, this measure is zero both 
at Sd and Sê. Figure 3 shows the 
function to be, if anything, inversely 
related to the gradient deviations. 
This is clearest in the 449-509 my 
region where Group II has, from 449 
to 489 mp accumulated larger-than- 
average jnd’s, thus giving a deviation 
of —.11 at 479 my (poor sensitivity) 
and positive deviation in generaliza- 
tion gradient at that point (shallow 


148 LEO GANZ 


slope). For Group I, as one progresses 
from the Så at 509 my one cumulates 
smaller-than-average jnd’s because 
we are in a relatively sensitive spec- 
tral region, to a maximum of +.11 
at 479 mg. The generalization gra- 
dient has an increasingly negative 
deviation (slope is steep) that in- 
creases until 479 my and then returns 
to zero when Sô is reached. In the 
631-567 mp region, Group I and Group 
II show the reverse progression but 
the same discriminability-generaliza- 
tion relationship. Now it is Group I 
that cumulates larger-than-average 
jnd’s as one progresses from the S° at 
631 mz to a deviation of —.10 at 
611 my (poor sensitivity) and a posi- 
tive deviation in generalization (shal- 
low slope of generalization). Group 
II accumulates smaller-than-average 
jnd’s from the S* at 567 my to 607 my 
(high sensitivity) and a negative 
deviation in generalization (steep 
slope of generalization). In the 
middle region, 567-509 mu, where the 
cumulated-jnd deviation function re- 
verses in midstream, the relationship 
is either absent or so attenuated as 
to be indiscernible. 


TABLE 2 


CORRELATIONS BETWEEN GENERALIZATION 
GRADIENT DEVIATIONS AND THE 
CUMULATED jnd DEVIATION 
Function 


Region 
— 


Group 1; 
Se 
631-567 mp 
509-567 my 
509-449 my 


—— 
Group II: 


567-631 my 
567-509 my 
449-509 my 


Subjects 


*P <05. 


The Spearman rank correlations 
for the 6 individual Ss and the two 
groups are given in Table 2. The 
group correlations, particularly in 
the 631-567 mp and 509-449 my 
regions are large enough to support 
the hypothesis that deviations in 
generalization gradient and cumula- 
tive deviations in jnd rate are inversely 
related, The individual rank corre- 
lations also generally support this 
position, with some individual excep- 
tions to the rule. 


Discussion 


What makes the wave length dis- 
criminability-generalization relationship 
elusive, such that it appears in two stud- 
ies and is totally absent in two others? 
In their discussion, Guttman and Kalish 
(1956) concluded that the action of dif- 
ferential sensitivity on generalization 
was effective in a discriminative situation 
but not in the emission of a simple 
operant. We will now continue this 
discussion in the light of subsequent 
studies, 

The first reason we propose why this 
is so is that without discrimination 
training, generalization gradients include 
more error variance, This in turn masks 
the rather subtle differential effect of 
the jnd. This additional error variance 
arises from at least two sources, One has 
to do with the inevitable discrimination 
training which occurs during the develop- 
ment of a simple operant. For example, 
this occurs at a moment when the pigeon 
is looking at the side of the box and 
pecking the wall, instead of the key. 
Such discriminations necessarily steepen 
gradients. Moreover, this adventitious 
discrimination training will necessarily 
vary from S to S and will set up inter-S 
variations in gradients. If within-S 
gradient comparisons are made, error 
variance arises from the course of this 
adventitious discrimination training at 
the successive S4, The E's introduction 
of discrimination training to a percentage 
criterion level has the effect of making 
such training explicit and uniform for 


| 


i 
| 
7 


HUE GENERALIZATION AND HUE DISCRIMINABILITY 149 


the stimulus dimension tested. It an- 
chors the slope of the gradient at two 
points on the continuum, S* and Sô, 
A second source of variability arises from 
stimulus preferences. For example, it 
is well established that the pigeon has a 
proclivity for 580 mu. When tested for 
generalization, it appears to generalize 
strongly into 580 mz. This is puzzling 
sometimes because at 580 my the jnd 
is quite small (high sensitivity). It 
appears likely that there is merely more 
pretraining response strength at 580 my. 
In discrimination training such prefer- 
ences are weakened. They can be iso- 
lated from jnd effects by balancing 
S3, Sê assignments in two groups. 

The second and more fundamental 
reason why the jnd effect on discrimina- 
tion is clear in a discrimination experi- 
ment and not in the generalization of a 
simple operant is that in a discrimination 
experiment we usually present only two 
stimuli at some arbitrary interval while 
in a generalization experiment we cus- 
tomarily present a repeated series at 
equal physical intervals. Repeated series 
of generalization trials were given in the 
Guttman and Kalish (1956) experiment 
(two series of 132 extinction trials) and 
in Blough’s (1961) study (three initial 
series of 66 trials). The generalization 
gradient is probably not invariant across 
changes in the distribution of the gen- 
eralization stimuli: the “frame of refer- 
ence” effect (Humphreys, 1939). When 
SGs are densely distributed, there will 
be less generalization. This follows in 
part from the fact that extinction also 
generalizes and summates. Suppose, as 
in previous studies, that the SGs are 
in uniform steps of 10 mp, but with jnd’s 
of varied sizes. If a series of GSs are 
presented repetitively under extinction, 
extinction will cumulate and generalize 
from one GS to another. Where jnd’s 
are large, a shallow gradient of general- 
ization is expected. Thus more positive 
response strength will generalize from 
Sd to GS. But more negative response 
strength resulting from extinction will 
also generalize, from one GS to all the 
others. If this is the case, extinction 
effects generated by equally spaced GSs 


would act precisely in such a manner as 
to cancel the positive differential effects 
of the jnd. The final result is that S$ 
appears to be generalizing to physical 
wave-length distances, rather than to 
the differential properties of his receptor 
system. The result is dependent on E's 
choice of GS spaced at physically equal 
10-my steps. In discrimination training, 
since only two stimuli are presented 
during each training sequence, the same 
cancellation effect should not appear 
and the jnd is then seen to be effective. 

The balancing of positive and negative 
response strength to cancel out the jnd 
effect was not manifested in the Kalish 
(1958) study because there is no extinc- 
tion, in the usual meaning of the term, 
during psychophysical measurements on 
human Ss, e.g., using the method of 
single stimuli. In the present study, 
this cancellation was not realized, first, 
because the stimuli were not evenly 
spaced, and second, because no series 
longer than four was presented. Thus, 
for different reasons, both studies yielded 
positive results. 


SUMMARY 


An experiment evaluated the widely held 
view that slope of generalization gradient 
and resolution capacity are inversely related. 
Four rhesus macaques were trained to emit 
a simple operant to a monochromatic hue in 
the 450-630 my range. Generalization was 
then measured under extinction across this 
range. This gradient was, in all cases, almost 
horizontal. Operants discriminated with 
respect to wave length (S4 — S4 about 60 ma 
apart) were then developed in these Ss and 2 
additional Ss. Generalization was measured 
as before but in nominal 10-myz steps. Each 
S was trained to discriminate and generalize 
three successive pairs of stimuli. A number 
of the gradients revealed differences in slope 
which were in accordance with the predicted 
inverse relationship. Analysis of the devia- 
tions in gradient slope suggested these were 
related not to deviations from the average 
jnd size but, rather, to deviations from aver- 
age cumulated jnd’s. A measure of deviations 
from average cumulated jnd's was derived 
and a high negative correlation to the gradi- 
ent deviations was shown to exist. It was 
concluded that some infrahuman generaliza- 


150 


tion gradients do reflect the S'-SG distance 
in cumulated jnd’s. Some factors that may 
have masked this relationship previously were 
discussed. 


REFERENCES 


Bioucu, D. S. The shape of some wave- 
length generalization gradients, J, exp. 
Anal. Behav., 1961, 4, 31-40. 

Brown, J. S., BILODEAU, E. A., & BARON, 

- R. Bidirectional gradients in the 
strength of a generalized voluntary response 
to stimuli on a visual-spatial dimension. 
J. exp. Psychol., 1951, 41, 52-61. 

Ferster, C. B. Control of behavior of 
chimpanzees and pigeons by time out from 
positive reinforcement. Psychol. Monogr., 
1958, 72(8, Whole No. 461). 

Ganz, L., & RIESEN, A. H. Stimulus gen- 
eralization to hue in the dark-reared ma- 
caque. J. comp. physiol. Psychol., 1962, 55, 

92-99, 

Gewiriz, J. L., Jonzs, L. V., & WAERNERYD, 
K. Stimulus units and range of experi- 
enced stimuli as determinants of generaliza- 
tion-discrimination gradients. J. exp, 
Psychol., 1956, 52, 51-57. 


LEO GANZ 


GRETHER, W. F. Color vision and color 
blindness in monkeys. Comp. psychol. 
Monogr., 1939, 15 (4, Whole No. 76). 

GUTTMAN, N., & KALISH, H. I, Discrimina- 
bility and stimulus generalization, J. exp. 
Psychol., 1956, 51, 79-88. 

Hutt, C. L. Principles of behavior. New 
York: Appleton-Century, 1943. 

Humenreys, L. G. Generalization as a 
function of the method of reinforcement. 
J. exp. Psychol., 1939, 25, 361-372. 

KALISH, H. I. The relationship between 
discriminability and generalization: A 
re-evaluation. J. exp. Psychol., 1958, 55, 
637-644. 

LASHLEY, K. S., & WADE, M. The Pavlovian 
theory of generalization, Psychol. Rev., 
1946, 53, 72-87. 

Mepnick, S. A., & FREEDMAN, J. L. Stimu- 
lus generalization. Psychol. Bull., 1960, 
57, 169-200. 

Paviov, I. P. Conditioned reflexes. (Trans. 
by G. V. Anrep) London: Oxford Univer. 
Press, 1927. 

Wricut, W. D. Researches on normal and 
defective colour vision. St, Louis: C. V. 
Mosby, 1947. 


(Received July 18, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 2, 151-157 


fa 
EFFECT OF PATTERN AND PLEONASM LOCATION IN 
SERIAL LISTS UPON ACQUISITION AND 
SERIAL POSITION ERRORS! 


RONALD L. ERNST, CHARLES P. THOMPSON, anv W. J. BROGDEN 


University of Wisconsin 


In a series of studies on verbal maze 
learning (Ernst, Hoffeld, Seidenstein, 
& Brogden, 1960; Namikas, Thomp- 
son, & Brogden, 1960; Thompson, 
Voss, & Brogden, 1957), the form of 
the serial position error curve was 
found to be altered by the location 
of a pleonasm in the standard maze 
pattern. The pleonasms used were a 
doublet (two successive identical 
items), a split-doublet (two identical 
items with a different item between 
them), a triplet (three successive 
identical items), and a quadruplicate 
(four successive identical items). Al- 
though there is no evidence that any 
of the experimental patterns produce 
any difference in acquisition from 
that for the standard pattern, each 
produces a significant, characteristic 
difference in the form of the serial 
position error curve at the locus of 
the pleonasm. Locus of pleonasm is 
significant for certain experimental 
patterns and not for others. The 
pleonasm effect may vary as a func- 
tion of stage of practice and of the 
error measure, Whether all errors per 
trial or only the first response per 
trial (first errors) are considered. 

Verbal maze learning necessarily 
involves the correction procedure 
and a discovery phase that may be 
distinct from and a prerequisite to 
a later acquisition phase (Melton, 


1 This research was supported in part by 
grants from the National Science Foundation 
and the Research Committee of the Graduate 
School from funds provided by the Wisconsin 
Alumni Research Foundation. 

_ 2 Now at Carnegie Institute of Technology. 


1950). If the same list represented 
by a maze is learned by the serial 
anticipation method, the noncorrec- 
tion procedure is a necessary coni- 
ponent and the discovery phase is 
either absent or of much lesser magni- 
tude. Thesedifferencesbetween meth- 
ods suggest that the effects of pat- 
terns including pleonasms will be dif- 
ferent. Because the first error meas- 
ure of maze learning is comparable 
to the error measure of serial antici- 
pation learning and because there is 
little or no discovery phase with serial 
anticipation learning, the effects of 
pleonasms for this procedure are 
likely to be represented by decreases 
in error relative to comparable posi- 
tions for the control list, and acquisi- 
tion of the experimental patterns 
should require fewer trials and show 
fewer total errors than the control 
list. The experiment to be reported 
was designed to test the above 
hypotheses. 
METHOD 


Design.—The experimental design consists 
of four patterns, each with a single pleonasm 
(doublet, split-doublet, triplet, or quadrupli- 
cate) at three locations (early, middle, or late 
in the list) and the control list composed of 
the numbers 10, 20, 30, and 40, each occurring 
four times to provide a length of 16. The 
control list is identical with the control maze 
pattern used in previous studies (Ernst et al., 
1960; Namikas et al., 1960; Thompson et aki 
1957). Each experimental pattern is a limited 
modification of the control pattern that pro- — 
vides for the location of the appropriate 
pleonasm without changing the frequency of 
the four numbers, introducing other pleonasms 
elsewhere, or altering the length of the list. 
Each of the 12 experimental patterns was 


151 


152 


permuted to obtain the four sublists so that 
each of the four numbers (10, 20, 30, 40) 
occurs once in each of the 16 positions. 

Subjects—The Ss were 312 University of 
Wisconsin students of elementary psychology. 
There were 24 .Ss in each of the 13 groups, and 
6 Ss in each group learned one of the four 
subpatterns. Each of 2 Es tested one-half 
of the Ss for each subpattern of each group. 
Assignment of Ss to E and to experimental 
conditions was random within each replica- 
tion. 

Procedure—The instructions of Thomp- 
son, Voss, and Brogden (1957) were modified 
as follows to make them appropriate to serial 
anticipation learning and were read by E 
to each S. 


In this experiment you will be required 
to learn a single list of numbers. These are 
the numbers used: 10, 20, 30, and 40. 
Each number will be used more than once 
in the list. The first trial is a study trial 
and will show you the series of numbers. 
On the next and all subsequent trials you 
will try to anticipate correctly each succes- 
sive number in sequence. For example, 
when the word START is presented on the 
screen, you will try to respond with the 
first number of the series. When the first 
‘number is on the screen, you will try to 
respond with the next number, and so on 
to the end of the trial. In other words, you 
will try to keep one step ahead of the pro- 
Jector making only one response per ex- 
posure. Please respond as fast as you can. 
When the first trial is completed, there will 
be a short rest before the second trial is 
started. We will continue with similar 
trials until you are able to make one 
repetition of the series without error. Are 
there any questions? 


After answering any questions, the appro- 
priate list was presented to S by means of a 
Dunning Animatic strip film projector. The 
duration of each stimulus item was 2 sec. 
and the items followed each other in sequence 
until the end of the list was reached. The 
intertrial interval was 30 sec, The projection 
Screen was a 5.5 X 7.5 in, rectangle of frosted 
glass set ina 2 X3 ft. piece of plywood painted 
flat black. The screen and projector were 
placed on a table at the screen end of which S 
was seated with an approximate viewing 
distance of 30 in. The stimuli were ł in. high 
at the screen. The procedure described in 
the instructions was maintained until S 
reached the criterion of one errorless trial, 
The E recorded the response for each position 
of the list on each trial 


R. L. ERNST, C. P. THOMPSON, AND W. J. BROGDEN 


‘RESULTS 


Evaluation of the effects of pattern 
and pleonasm location in the serial list 
upon speed of acquisition was made 
by separate analyses of variance each 
for trials and errors to the criterion 
for each pattern and its pleonasm 
location versus the control list, and 
between all patterns and pleonasm 
location excluding the data for the 
control list. Tests of homogeneity of 
variance show the data to be hetero- 
geneous. Because of this, statistical 
significance for these and all sub- 
sequent analyses was set at the 1% 
level in lieu of the 5% level. Of the 
separate analyses, the source of varia- 
tion represented by location of pleo- 


nasm versus the control pattern show . 


significant F ratios for the split- 
doublet and triplet with the trial 
measure and for the split-doublet, 
triplet, and quadruplicate with the 
error measure. In the analyses be- 
tween all patterns and pleonasm 
location with the control data ex- 
cluded, pattern, pleonasm locus, and 
the interaction of pattern and pleo- 
nasm locus are significant for both 
trials and errors. No other source of 
variation is significant in any of the 
above analyses except that for sub- 
pattern (permutation of numbers) in 
the case of the doublet with the trial 
measure, and this is without meaning 
in the experimental design. 

The means represented in the 
analyses are presented in Table 1. 
Range tests (Duncan, 1951) of the 
differences between means for sig- 
nificant sources of variation in the 
analyses of variance give almost 
identical results for the trial and 
error data. For the split-doublet, 
the mean for Locus 1 (early) is 
significantly smaller than the means 
for Locus 2 (middle), Locus 3 (late), 
and the control pattern, which do 
not differ significantly among them- 


A 


PATTERN AND PLEONASM LOCATION IN SERIAL LISTS 


153 


TABLE 1 


Grour MEAN TRIALS AND Errors TO ACQUISITION CRITERION 


Pleonasm Locus 


Overall Mean 


Pleonasm 1 (Early) 2 (Middle) | 3 (Late) 
Trials Errors | Trials | Errors Trials | Errors Trials | Errors 
~ > “| pepa = - 

Doublet 11.83 66.75 12.75 | 68.08 11.38 61.92 | 11.99 | 65.58 
Split-doublet 8.33 47.58 18.42 111.13 19.71 110.17 | 15.65 | 89.63 
Triplet 9.00 40.79 9.04 40.96 7.88 | 42.83 8.64 | 41.53 
Quadruplicate 10.17 44.88 9.67 44.42 11.75 | 59.79 | 10.53 | 49.69 
Overall mean | 9.96 | 50.00 | 1245 | 66.15 | 12.68 | 68.68 


Control group | Mean Trials = 14.88 


Mean Errors = 81.33 


selves. All means of the triplicate 
are significantly smaller than that 
for the control, but are not signifi- 
cantly different from each other. All 
means for the quadruplicate (error 
measure only) are significantly smaller 
than that for the control list, but do 
not differ among themselves. Of the 
overall means for locus regardless of 
pattern, that for Locus 1 is signifi- 
cantly smaller than those for Loci 2 
and 3. Of the overall means for 
pattern regardless of locus, the mean 
for the split-doublet is significantly 
larger than those for all other pat- 
terns, and the mean for the triplet is 
significantly smaller than that for the 
doublet. Of the means representing 
the interaction of pattern and locus 
of ploenasm, those for Loci 2 and 3 
for the split-doublet are significantly 
larger than all other means. There 
are no other significant differences 
between the means. 

Initial analysis of the form of the 
serial position error curves was ac- 
complished by a separate analysis of 
variance for each kind of pleonasm, 
involving the data for the three loci 
of pleonasm and the control list. In 
each of these four analyses, significant 
F ratios were obtained for serial posi- 
tion, and for the interaction of serial 


position and pleonasm locus. Since 
there are significant differences in 
total errors to the acquisition criterion 
for pleonasm location in the split- 
doublet pattern, and between the 
control and other patterns, the error 
data for each S at each serial position 
were converted to percentage of his 
total errors. Analyses of variance 
completed on the transformed data 
also show uniform results of significant 
F ratios for serial position and the 
interaction of serial position and pleo- 
nasm locus. The differences in the 
form of the serial position error curves 
are shown in Fig. 1 where each experi- 
mental curve is presented in com- 
parison with the curve for the control 
pattern, Although most of the exper- 
imental curves show differences in 
form from the control curve at the 
positions of the pleonasm, there also 
are differences between the experi- 
mental curves and the control curve 
at other serial positions. Since these 
latter differences do not appear to 
bear any relationship to any variables 
of pattern, all further analyses of the 
effect of pattern upon the form of the 
serial position error curve are re- 
stricted to those positions at which 
pleonasms are located. 

The difference in percentage of 


154 


EXPERIMENTAL 
o o 


OAR Gore D O Bo. PO -CAMO en 


N 


GROUP MEAN PER CENT ERROR TO CRITERIAL TRIAL 
@ 


oO 


1357911315 


13579111315 


R. L. ERNST, C. P. THOMPSON, AND W. J. BROGDEN 


13579111315 


SERIAL POSITION 


Fig. 1. Serial position error curves in terms of percent 


age of total errors as a function 


of pleonasm and pleonasm location. 


total error for each position of the 
pleonasm (plus the preceding and 
following position) from the mean 
percentage of total error for the con- 
trol group at the identical serial 
position was computed. Analyses of 
variance of the data for the three 
pleonasm locations for each of the 
four pleonasm types were completed. 
In each of these four analyses serial 
position and the interaction of serial 


position and pleonasm locus are 
significant sources of variation. The 
t test was used to establish fiducial 
limits for each of the four sets of data 
at the 1% level for the means over 
loci thus providing for assessment of 
the serial position effects shown in 
the far right column of Fig. 2 and 
for the means of the three loci, thus 
providing for assessment of the intef- 
action of serial position and location 


ey 


+ 


PATTERN AND PLEONASM LOCATION IN SERIAL LISTS 


shown for each row in the three left 
hand columns of Fig. 2. For the data 
over all loci, the effect for the doublet 
is a decrease in error at the second 
position; the split-doublet is an in- 
crease in error at the third position; 
the triplet is a decrease in error at 
the second and third positions; and 
the quadruplicate is a decrease in 
error at all four positions. Range 
tests (Duncan, 1951) show no signifi- 
cant differences in magnitude of 
decrease in error as a function of 
position for either the triplet or the 
quadruplicate. 

Since the detailed results of the 
analyses of the interaction of pleo- 
nasm and locus are complex, only the 
general results are noted. The maxi- 
mum effect for all pleonasms is at 
Locus 2. The minimum effect occurs 
at Locus 1 for the triplet and quad- 
ruplicate and at Locus 3 for the 
doublet and split-doublet. There is 
a unique form of the split-doublet at 
each locus, but the other pleonasms 
show consistency in form over loci. 

The error data were tabulated for 
each S for each half of trials to the cri- 
terion for each of the positions of the 


DIFFERENCE IN PERCENTAGE ERROR 


OF EXPERIMENTAL MINUS CONTROL 


sr 2 san ll 23a ri2 sae al 2sae 
SERIAL POSITION WITHIN PLEONASM 


Fic. 2. Pleonasm effects as a function of 
locus within the series and position within 
the pleonasm. (The dotted horizontal lines 
represent the fiducial lumits at the 1% level 
of confidence.) 


155 


pleonasins plus the preeeding and fol- 
lowing position, and converted to per- 
centage of total error. The difference 
from the mean of similar measures for 
the control list was obtained and anal- 
yses of variance were conducted sepa- 
rately for each pleonasm. Stage of 
learning is a significant source of varia- 
tion for the triplet, for the interaction 
of stage of learning and pleonasm locus 
for the triplet and quadruplicate, and 
for the triple interaction of stage of 
learning, pleonasm locus, and serial 
position for the split-doublet, triplet, 
and quadruplicate. All of these ef- 
fects indicate fewer errors during 
the second half of trials than during 
the first half except that the triple 
interaction for the split-doublet indi- 
cates an increase in error for the 
second half of trials at the third 
position for Locus 2. The form of the 
curves for the first and second halves 
of trials is substantially the same in 
all cases except for the split-doublet 
at Locus 2. 


Discussion 


The hypotheses which the experiment 
was designed to test were confirmed in 
part. The introduction of pleonasms 
either increases speed of acquisition over 
that for the control list or has no effect. 
Increased speed of acquisition occurs 
for the triplet and quadruplicate and 
for one locus of the split-doublet. No 
differences in speed of acquisition occurs 
for the doublet and for two loci of the 
split-doublet. Pleonasm locus has a 
differential effect upon the form of the 
serial position error curve in all cases. 
Maximum effect occurs for pleonasm 
locus in the middle of the list. The least 
effect occurs for the triplet and quadru- 
plicate with early locus and for the doub- 
let and split-doublet with late locus. The 
nature of the effect for a given pleonasm 
is similar at all loci, except for the split- 
doublet. The mean effect over all loci 
is a decrease in error at the second posi- 
tion for the doublet; an increase in error 
at the third position for the split-doublet; 
a decrease in error at Positions 2 and 3 


156 


for the triplet; an increase in error at 
the —1 position and decreases in error at 
Positions 1, 2, 3, and 4 for the quadrupli- 
cate. Stage of learning is not a significant 
factor in the form of the pleonasm effects 
except in the case of the split-doublet. 

In contrasting maze and serial antici- 
pation learning, the control pattern or 
list is acquired with a mean time of 
772.9 sec. and a mean of 110.8 total first 
errors as a maze (Ernst et al., 1960) and 
with a mean time of 476.2 sec. and a 
mean of 81.3 total errors as a serial list. 
Both differences in favor of serial learning 
are statistically significant. These re- 
sults are comparable to those obtained 
by Thompson and Brogden (1958) in 
comparing the correction procedure with 
a modified correction procedure similar 
to the procedure for serial anticipation 
learning. Serial position error curves 
for the control list in percentage of total 
error are presented in Fig. 3 for maze 
and serial learning. The form of these 
curves appears to be substantially the 
same for the two methods of acquisition 
and because of this no statistical analysis 
of difference in form was made. 

The effect of locus of pleonasm for 
maze and serial learning cannot be com- 


SERIAL LEARNING 
oe 


o-nvubu 


R. L. ERNST, C. P. THOMPSON, AND W. J. BROGDEN 


5 É| MAZE LEARNING 
2 œo ALL ERRORS 
0-9 FIRST ERRORS 


SERIAL LEARNING 
oe 


PERCENTAGE OF TOTAL ERROR 
o 


T234567890N 234156 Í 
SERIAL POSITION 


Fic. 3. Serial position error curves for the 
control list acquired by maze and serial 
anticipation learning. (The all-error and 
first-error curves for maze learning are from 
data of Ernst et al., 1960.) | 
r 
pared precisely because different loca- | 
tions were used in the two sets of experi- | 
ments. Comparison of pleonasm effect, 
regardless of locus, is possible but only 
in terms of the first error measure of | 
maze learning and the error measure of 
serial learning (Thompson & Brogden, 
1958). Figure 4 presents curves of per 
centage of total error for the appropriate 
pattern positions. Curves for the all- 
error measure of maze learning are also 


MAZE LEARNING 
Ono 


ALL ERRORS 


®--@ FIRST ERRORS 


& 


i i) AAN 


DIFFERENCE IN PERCENTAGE ERROR 
OF EXPEIMENTAL | MINUS CONTROL 
ae 


“1 1 241° -1 


pE DOUBLET _ SPLIT-DOUBLET, 


123% 


S 
TRIPLET = _ QUADRUPLICATE 


“11234 7-1 1 23 441 


PLEONASM AND SERIAL POSITION WITHIN PLEONASM 


~, . : z ‘ i 

" it 4. Pleonasm effect as a function of serial anticipation learning and of the all-errot en 

es measures of maze learning. (The curves of maze learning for the doublet and ar 
ublet pleonasms are from data of Ernst et al., 1960,and those for the tripletand quadruplica 


are from data of Namikas et al., 1960.) 


PATTERN AND PLEONASM LOCATION IN SERIAL LISTS 


included to provide complete presenta- 
tion of the pleonasm effects. The form 
of the first error curves for maze learning 
and the curves for serial learning are 
strikingly similar. The relative magni- 
tude of the effect is greater for serial 
learning than for maze learning for the 
doublet, triplet, and quadruplicate. This 
difference in magnitude is probably due 
to the discovery phase inherent in the 
maze method which results in an initial 
inhibitory effect of the pleonasm, and 
differences in form as a function of stage 
of practice. 

The split-doublet effect is similar for 
serial learning and both the first-error 
and all-error measures of maze learning. 
It is noteworthy in comparing the split- 
doublet effect to those for other pleo- 
nasms that it consistently occurs as an 
increment in error at the third position. 
The other pleonasms uniformly produce 
decrements in error except for the all- 
error measure for maze learning in the 

se of the doublet and triplet, and the 
position preceding the first position of 
the quadruplicate by all procedures. 

In considering the learning of lists 
of items comparable to digits by the 
maze and serial anticipation methods, 
a more rapid acquisition in terms of both 
time and errors should always occur for 
the latter method. The effects of pure 
pleonasms such as doublets, triplets, 
and quadruplicates should be reductions 
in error for learning by noncorrection 
and for first-error measures of learning 
by the correction method. The reduc- 
tions in error should be greater for the 
noncorrection than for the correction 
method and of sufficient magnitude to 
produce more efficient acquisition of the 
I| series containing a pleonasm than a 
/ random series of the same items. The 
discovery phase inherent in correction 
learning is responsible for the difference 
in magnitude just noted and also for 
the error increments produced by pleo- 
nasms when the all-error measure is used. 
Pleonasms represented by the split- 
doublet should have a consistent effect 
of error increment regardless of correc- 
$ tion or noncorrection learning, stage of 

. practice, or kind of error measure, 


157 


SUMMARY 


An experiment tested the effect upon 
acquisition and the form of the serial position 
error curve of type of pleonasm and pleonasm 
locus. The Ss learned by serial anticipation 
a series of 16 numbers, composed of 10, 20, 30, 
and 40 each occurring four times. The control 
list was a random sequence without pleonasms. 
The experimental lists included one pleonasm 
(doublet, split-doublet, triplet, or quad- 
ruplicate) at one of three loci (early, middle, 
or late in the list). Acquisition was consist- 
ently faster for lists with a triplet or quadru- 
plicate than for the control list. The form 
of the serial position error curve was altered 
by each pleonasm and there was significant 
interaction in form of the curve and locus 
of pleonasm. The doublet effect is a decrease 
of error at the second position, the split- 
doublet effect is an increase of error at the 
third position, the triplet effect is a decrease 
of error at the last two positions, and for the 
quadruplicate is an increase of error at the 
position just preceding and a decrease of error 
for all four positions of the pleonasm. Com- 
parison of the effects of these pleonasms ob- 
tained in earlier experiments on maze learning 
was made with the results of the present 
experiment. 


REFERENCES 


Duncan, R. B. A significance test for differ- 
ences between ranked treatments in an 
analysis of variance. Va. J. Sci., 1951, 2, 
171-189. 

Ernst, R. L., HOFFELD, D. R., SEIDENSTEIN, 
S., & Brocpen, W. J. Relation of serial 
position errors to doublet and split-doublet 
location in verbal maze pattern. J. exp. 
Psychol., 1960, 59, 94-103. 

MELTON, A. W. Learning. In W.S. Monroe 
(Ed.), Encyclopedia of educational research. 
(Rev. ed.) New York: Macmillan, 1950. 
Pp. 668-690. 

Namrxas, G., THOMPSON, C. P., & BROGDEN, 
W. J. Effect of triplet and quadruplicate 
location in verbal maze patterns upon 
serial position errors. J. exp. Psychol., 
1960, 59, 383-390. 

THompson, R. F., & BRocpEN, W. J. Acqui- 
sition of a verbal maze as a function of 
method of correction and number of alter- 
nate choices per unit. J. exp. Psychol., 
1958, 56, 501-506. 

Tuompson, R. F., Voss, J. F., & BROGDEN, 
W. J. Effect of pattern variation upon 
verbal maze learning. J. exp. Psychol., 

1957, 54, 253-258. 


(Received July 19, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 2, 158-165 


A TEST OF THE ALL-OR-NONE HYPOTHESIS FOR 
VERBAL LEARNING! 


JOANNA P. WILLIAMS? 
Yale University 


The view that associations develop 
gradually with repeated pairings of 
the stimulus and response has been 
a basic tenet of most theories of learn- 
ing (Hull, 1943; Spence, 1956; Thorn- 
dike, 1932). Recently the validity of 
this assumption has been questioned 
(Estes, 1960; Rock, 1957); it has been 
suggested instead that in learning 
situations involving simple stimuli and 
responses, acquisition occurs on an 
all-or-none basis. Repetition facili- 
tates learning in such cases simply 
by providing a greater number of 
opportunities for an association to be 
formed. A core assumption of this 
position is that there are no “strength- 
ening” effects of trials previous to 
the one on which the association is 
formed; therefore a response will 
have the same probability of being 
learned on each successive trial. 
From his examination of data from 
the first two trials of a paired-asso- 
ciate experiment, analyzed in such a 
way as to remove artifacts arising 
from averaging, Estes (1960; Estes, 
Hopkins, & Crothers, 1960) concluded 
that the probability of recalling an 
item on a given trial for the first time 
was in fact constant over trials. 

It should be noted that this finding 
is not inconsistent with a strength 
theory, such as that of Hull (1943) or 
Spence (1956). According to such 


Š This report is based on a dissertation sub- 
mitted in partial fulfillment of the require- 
ments for the PhD degree at Yale University. 
The author is indebted to N. E, Miller, chair- 
man, and E. A. Fleishman and A. R. Wagner, 
members of the dissertation committee, for 
their generous assistance, 

* Now at the University of Pennsylvania. 


a theory, associative strength must 
exceed the threshold of recall before 
a correct response is made. The 
easier the item, the fewer the repeti- 
tions required to bring it to threshold. 
Thus, on the first trials of a paired- 
associate list, the easier items will be 
recalled; on later trials, the more 
difficult items, having had the benefit 
of several repetitions, can reach 
threshold. In such a manner, accord- 
ing to the strength position, either 
an increase, a decrease, or no change 
in the mean probability of initial 
recall may be produced over trials 
by the proper selection of items of 
various degrees of difficulty. For a 
all-or-none approach, however, item 
heterogeneity could explain a decrease 
in probability over trials (easy items 
are quickly learned and are removed 
from the set of unlearned items so that 
more difficult items are selected as 
trials proceed), but it could not ac- 
count for an increase, 

Estes has limited his model to the 
one response measure of recall, How- 
ever, an all-or-none position might 
be expected to have implications for 
other commonly used response meas- 
ures as well. For example, it is likely 
that response latency would be sensi- 
tive to possible changes in response 
strength, especially in later stages of 
learning, when recall is nearly 100%. 
Latency has been used in previous 
studies (e.g., Brown & Huda, 1961; 
Simley, 1933) -but has not been 
analyzed in a way relevant to the 
all-or-none hypothesis, 

_ The present experiment was de- 
signed, using simple paired-associate 


158 


\ 


learning materials, (a) to test one of 
the fundamental assumptions of the 
all-or-none model—that the mean 
probability of initial recall is constant 
—over an extended number of trials, 
and (b) to examine the all-or-none 
position in the light of a continuous 
measure of learning, response latency. 

In addition, the degree to which 
items in such a simple paired-asso- 
ciate situation can be analyzed as 
independent units was investigated. 
All-or-none data might be produced 
if S paid attention to and rehearsed 
only a few items on each presentation 
of the list; this kind of behavior, of 
course, would make a “‘trial’’ non- 
equivalent for the various items. For 
example, once an item is learned, the 
one which precedes it in the list might 
have a higher probability of being 
learned, because the learned item 
would provide a “blank” period in 
which additional rehearsal of the 
previous items could take place. To 
test the possibility that the learning 
of a particular item does influence 
learning of others near it, opportunity 
for rehearsal was varied. Easily 
learned items were introduced which, 
once learned, could serve as “blank” 
periods during which extra rehearsal 
could take place. 

In addition, rate of presentation 
was varied. The relatively long ex- 
posure time used by Rock (1957) 
and Estes (1960) allowed much op- 
portunity for rehearsal of each item; 
a faster rate (more similar to that used 
in most paired-associate experiments), 
by cutting down on the time allowed 
on each item, would be expected to 
change the opportunity for rehearsal. 
Actually, performance under a fast 
presentation rate is more relevant 
to the issue of all-or-none learning, 
since with faster rates there is greater 
control of interitem repetition and 
rehearsal. 


ALL-OR-NONE HYPOTHESIS FOR VERBAL LEARNING 


159 


METHOD 


Subjets:—The Ss were 24 Yale under- 
graduates, who participated in the experi- 
ment to fulfill a course requirement. Seventy 
similar pretest Ss were used to standardize 
the materials. 

Apparatus.—The material to be learned 
was typed in capital letters on white adding 
machine tape, and was presented to S through 
a 1 X3 in. aperture in a 12 X 36 in. screen 
at rates predetermined by E. The S spoke 
into a small microphone, which he held in his 
hand during the learning trials. A system of 
relays, motors, and timers permitted the 
required variations in presentation time and 
also the recording of response latencies in units 
of .01 sec. The E sat behind the screen, 
hidden from S, recorded S's response on each 
item, and recorded latencies from two 
Standard Electric timers. 

Design.—Each S learned a list of 25 4- 
letter word pairs. All words in the list were 
taken from the Thorndike-Lorge lists of 
the 1,000 most frequently occurring words 
(Thorndike & Lorge, 1944). Choice of the 
items was made on the basis of pretesting: 
lists were learned by a total of 70 pretest Ss, 
and approximately 50% of the pairs, those 
most easily learned, were eliminated. The 
22 remaining pairs (e.g., WALL-CORN, PAST- 
FISH) presumably were more homogeneous 
than the original sample. In addition, three 
pairs (e.g., FAST-SLOw) were also designed to 
be easily learned by all Ss. 

The order of the pairs in the list was 
changed from trial to trial as is customary in 
paired-associate learning. However, to facili- 
tate analysis, the easy items always appeared 
in Positions 6, 13, and 20, although any par- 
ticular easy item appeared in different posi- 
tions on different trials. Each of the other 
items remained in the same position relative 
to that of an easy item on all trials. That is, 
the same three items always appeared just 
before an easy item (i.e., at Positions 5, 12, 
or 19), three other items always appeared 
two items in advance (i.e., at Positions 4, 11, 
or 18), etc.; again, any one item appeared at 
all three equally-distant positions (e.g., 5, 12, 
and 19) on different trials. Three different 
serial orders of the pairs were possible within 
these specifications. These orders were 
alternated, so that on consecutive trials, no 
pair was (a) in the same position, or (b) 
adjacent to the same pair as it had been on 
the previous trial. In addition, there were 
four other word pairs in the list, two at the 
beginning, and two at the end. The position 
of these four pairs varied within these four 


160 


end positions, but they never appeared at any 
other point in the list. 

Within the main portion of the list, the 
items were arranged in four different se- 
quences for different Ss. The position in 
which each item appeared was chosen ran- 
domly in two of these four sequences. The 
others were simply the first two in reverse 
order, i.e., the pair which had directly pre- 
ceded an easy pair now followed an easy pair 
and was three items removed from it (thus 
appearing at Positions 9, 16, or 23), etc. This 
was done in order to balance the pairs ap- 
pearing at the various positions in the list 
with respect to item difficulty. The four end 
items were completely different for the four 
sequences, and were chosen randomly from 
the data gathered on the pretest Ss. 

The anticipation method was used, and 
material was presented at two rates. For one 
group, both members of the word pair were 
exposed for 1 sec. (fast rate of presentation) ; 
for the other group, each pair was exposed 
for 4 sec. (slow rate). It was felt that these 
values would ensure the best possible manipu- 
lation of rehearsal time on the items. All 
other values were held constant, The first 
word of the pair was presented alone to both 
groups for 3 sec., and there was a 20-sec, 
intertrial interval. 

Twenty-four Ss were assigned randomly 
to one of the two conditions. Three Ss in each 
condition learned each of the four item se- 
quences described above, 

Procedure-—The task was described, and 
Ss were told that both their responses and 
the latencies of their responses would be 
recorded. A ready signal was given 2 sec. 
before the start of each of the 24 learning 
trials. After the experiment, Ss were asked 
to indicate the items which they had learned 
through simple memorization, and those for 
which they had utilized a mnemonic aid. 


ResuLTS 


3 The 4-sec. group (slow rate) required 
significantly fewer trials (M = 8.8, 
SD = 2.3) to reach a criterion of at 
least 20 out of 25 pairs correct (t= 3.93, 
dj=22,P< -01) than did the 1-sec, 
group (M = 14.3, sp = 4.3), There 
were also reliable differences in the 
mean number of items correct during 
learning: 4-sec, rate: M = 463.2, 
SD = 33.8; 1-sec. rate: M = 383.2, 
SD =74.8 (t=3.34, df =22, P<.01). 


JOANNA P. WILLIAMS 


While the items selected as “easy” 
were anticipated considerably more 
often (M = 266.0, SD = 8.9) than 
the other items (M =194.6, SD = 33.3) 
for both the 1-sec. and the 4-sec. 
groups (F = 42.37, MS = 619.5, 
df = 1/38, P < .001), there were no 
reliable differences in the learning 
of the other items that could be 
attributed to proximity to (i.e., either 
before or after) the easy items or to 
an interaction of proximity and pres- 
entation rate, either in terms of the 
mean number of errors before the 
learning criterion (20/25) was reached, 
or in terms of the mean number of 
correct anticipations during the first 
six trials. Thus no evidence was 
obtained that would seem to preclude 
treating each item as an independent 
unit for analysis. 

The four additional items placed at 
the ends of the list were not signifi- 
cantly different in ‘learning. rate 
(M = 16.2, SD = 3.9) from the regu- 
lar items (M = 14.9; SD 4,1), 
F=1.55, MS=20.3, df=1/44, Fur- 
ther analyses were done (a) including 
these four items and (b) excluding 
them. Since the results were identical 
in every case, only those analyses in 
which the end items were included 
are presented. 

The probability of correctly antic- 
ipating a response for the first time 
as a function of trials is shown in Fig. 
1 for the first 10 trials for both rate 
groups. Contrary to one of the major 
assumptions of an all-or-none position, 
these curves show a sizeable increase 
over trials. The obtained linear chi 
squares (Cochran, 1954) were 21.80 
and 41.08 for the 1- and 4-sec. rates, 
respectively (df =1, P < 001 for 
each group). Only those trials on 
which the probability is based on more 
than 100 observations 
in the analysis, 


At the slow rate of presentation, 


are included 


ALL-OR-NONE HYPOTHESIS FOR VERBAL LEARNING 


about half of the regular items showed 
no “breaks,” that is, once correct, 
they were always correct on subse- 
quent trials. However, at the fast 
rate, there were significantly fewer 
(94 out of 260) such break-free items 
G2 = 17.50, df =1, P < .001). At 
both rates there was a greater number 
of break-free items (1-sec. rate: 26 
out of 36; x? = 7.12, df = i <i 015 
4-sec. rate: 28 out of 36; x? = 11.11, 
df = 1,P < 01). There were no dif- 
ferences in the number of mnemonics 
reported by the two groups G= .25, 
df = 22), but in both groups items 
learned with the aid of a mnemonic 
showed a significantly greater num- 
ber correct over the learning trials 
(M = 17.2, SD = 2.7) than did the 
items memorized (M=15.1, SD =4.1), 
F = 6.33, MS = S74, df = 1/44, 
P < .025. In addition, there were 
fewer memorized items ($3 out of 94) 
among the break-free items (2 =12.7, 
df = 1, P < .001), whereas among 
the break items, about half were 
memorized and half learned with the 
aid of a mnemonic. 


40] e 
ri 
= 
a 4SEC. 
oA a SE 
j uM 
WITH EASY j 
ITEMS // 1 SEC. 
oo j RATE 


PROBABILITY OF INITIAL feta 
AS 


WITHOUT EASY ITEMS 


2 4 6 8 
TRIALS 


The probability of correctly 
anticipating an item for the first time, as a 
function of trials and presentation rate. 
(The statistical analysis was done on the 
portion of the curve that is based on more 
than 100 observations, as indicated.) 


Fic. 1. 


161 


2. nN 
Ne 
\ 
Nea 
Ne 
K Ssa ITENS with 
R B SAn 

1.60 Ne S 


__BREAK- Se 
MZ FREE ITEMS 


MEAN LATENCY (IN SEC) 


1.20 


SUCCESSIVE CORRECT RESPONSES 


Fic. 2. Mean latency of items on their 
first correct anticipation, and on successive 
correct anticipations. (Since there were no 
differences in latency between the two presen- 
tation rates, the data for both groups have 
been combined.) 


There is a similar ordering of items 
in terms of latency. The easy items 
exhibit the lowest initial latencies 
(M = 1.71 sec, SD = .29); and for 
those items which, once correct, are 
correct on all subsequent trials, the 
initial latencies are lower (M = 1.87, 
SD = .20) than for pairs in which 
there are breaks (M =2.02, SD=28)5 
These differences are significant at 
the .05 level (F = 7.85, MS = 707.54, 
df = 2/66). Correlations between the 
presence of breaks and initial latency 
were calculated: the point biserial r’s 
were .60 and .58 for the 1-sec. and 
4-sec. rates, respectively (df = 10, 
P < .05 for both cases). Moreover, 
there was a tendency, although it 
did not reach conventional levels of 
significance, for items which were 
learned with the aid of a mnemonic 
to show lower latencies (M = 1.58, 
SD = .17) than did those items which 


3 Most of the latency scores fell between 
1 and 2 sec. Because of this, and because of 
the small departure from normality in the 
distribution of scores, transformation of the 
data into reciprocals did not seem warranted, 


162 


JOANNA P. WILLIAMS 


TABLE 1 


LATENCY (IN SEC.) OVER BLOCKS OF TRIALS FOR ITEM TYPES AND 
RATE OF PRESENTATION 


Trials 1-3 Trials 4-6 Trials 7-9 
Items Rate 

Mean SD Mean SD Mean SD 

1 sec, 1.57 1.39 07 1.26 01 

kasy 4 sec, 1.65 17 1.43 06 1.27 ‘01 
1 sec, 1.73 12 1.46 08 1.32 02 

Break-free 4 sec. 1.75 12 1.49 ‘02 1.42 ‘03 
1 sec. 1.91 15 1.63 04 1.52 .06 

Break 4 sec. 1.89 43 71 06 157 ‘02 


were memorized (M =1.69, SD=.17), 
= stol, df = 1/32) P < 107. 

In order to assess changes in the 
strength of association after the point 
of initial recall, latencies of individual 
pairs were examined. Figure 2 pre- 
sents the mean latency of each item 
on the trial on which it was first 
correct, and on each successive correct 
trial. The individual item latencies 
decrease over trials, and the ordering 
of the items as it appeared in the 
initial latencies remains over the 
course of successive correct trials. 
The mean latencies for blocks of 
trials are shown in Table 1, and the 
analysis of variance is presented in 
Table 2, indicating that the main 
effects of blocks of trials and type of 


TABLE 2 
ANALYSIS OF VARIANCE OF RESPONSE 
LATENCIES 
Source df MS F 
Rate of presenta- 
tion (A) 1} 268.00| 3.66 
Item type (B) 2 | 3,551.83 | 48.44* 
Blocks of trials (C) | 2 | 5,907.24 | 80.57* 
AXB 2 1.62 <i 
AXC 2 8.82 <1 
BXC 4 13.37 <i 
AXBXC 4 33.83 <1 
Error |36 73.32 
*P <.001 yi are tay 


item (easy, break-free, break) are 
significant, that rate of presentation 
has no significant effect, and that 
there are no significant interactions. 

According to Estes (1960), “to 
determine whether the behavioral 
change associated with a decrease 
in latency is learned on an all-or-none 
basis, we would need a similar analy- 
sis [i.e his probability analysis on 
individual items] with some criterion 
of change in latency as the dependent 
variable” (p. 221). Followingsthis 
suggestion, two latency scores, 1.75 
sec. and 1.30 sec., were chosen as 
learning criteria. Figure 3 presents 


175 SEC 
PN LATENCY 
OR LESS 
'S with easy irems 
> 
© 
=g 
2 10} 
o 
S 130 SEC 
č LATENCY 
ot OR LESS 
aa ay | ta Nrc: 
TRIALS 
Fic. 3. The probability of achieving two 


different criterion latencies (1.75 sec. or less, 
and 1.30 sec. or less) as a function of trials. 


(All points shown are based on more than 
100 observations.) 


ALL-OR-NONE HYPOTHESIS FOR VERBAL LEARNING 


the probability over trials of achieving 
these latencies for the first time on 
each item. The data from Ss run 
under both rates have been combined, 
since there were no differences in 
latency as a function of rate. With 
these criteria, too, there was a signifi- 
cant increase over trials: the linear 
chi squares were 60.31,df=1,P< .001, 
and 66.15, df = 1, P < .001, for the 
1.75-sec. and the 1.30-sec. criteria, 
respectively. 


DISCUSSION 


The finding that repetition of un- 
learned items does in fact increase the 
probability that they will be “Jearned” 
on a subsequent trial is in direct con- 
tradiction to the all-or-none assumption 
that these probabilities are nonincreasing- 
It is consistent with a strength position, 
such as that of Hull (1943), indicating 
that some subthreshold learning is taking 
place on items before the point of initial 
recall. The greatest increase occurred 
beyond the first two trials, which were 
the ones presented by Estes. In fact, 
in Estes’ (1960) experiment, the prob- 
ability of initial recall went from .40 on 
the first trial to 46 on the second; per- 
haps this trend would have been reliable 
if the analysis had included further 
trials.’ 

When the curves for the obviously 
heterogeneous items (e.g, easy items 
included) are compared with those for 
the more homogeneous items, it can be 
seen that the effect of heterogeneity is 
to counteract the generally rising trend 
and to produce an actual decline between 
Trials 1 and 2. These empirical results 
support the expectation that the effect 
of heterogeneity of item difficulty should 
be to obscure the rise in performance with 


4 This analysis was also done on previously 
published (Williams, 1961) data from 60 Ss 
who learned 12-item lists of letter-number 
pairs by alternate training and testing trials, 
with 5 sec. exposure per item—conditions 
similar to those of Rock (1957). These Ss 
also showed a significant (P < .01) increase 
over trials in the probability of initial recall. 


163 


practice, and thus yield an artifact in 
the direction of results presented to 
support the all-or-none hypothesis. The 
greatest effect of heterogeneity in the 
present experiment is seen in the early 
trials, and it is possible that the two 
opposing factors, heterogeneity of item 
difficulty and repetition, were also oper- 
ating in the Estes situation. Indeed, it 
is difficult to conceive of any verbal 
materials which would be completely 
homogeneous, especially taking into con- 
sideration idiosyncratic sources of diffi- 
culty. Even Rock's (1957) experiment, 
which utilized simple letters and numbers 
as learning material, showed a selection 
bias due to differential ease of learning 
certain of the items (Williams, 1961). 

In this experiment it was impossible 
to identify and exclude items initially 
guessed correctly instead of recalled. 
To the extent that S is able to remember 
which response items he has already 
used correctly and which he has not, 
he will be able to increase the probability 
of a correct guess as the number of un- 
used response terms decreases on later 
trials. This factor would account for 
some increase in the probability of initial 
recall over trials. However, with the 
relatively long list used in the present 
experiment, the maximum possible in- 
crease in guessing efficiency over the 
first few trials is not very large. The 
change in the probability from 25 un- 
learned items (1/25) to 10 unlearned 
items (1/10) is only .06. The observed 
increase in probability during the same 
trials, however, was .22 for the 1-sec. 
and .37 for the 4-sec. rate. Moreover, 
these figures are taken from an examina- 
tion of the break-free items only, which 
eliminates most of the items on which 
the first recall was due to guessing. 
Furthermore, the actual benefit from 
guessing was probably considerably less 
than the allowance made above. The 
Ss made guesses on only 21 and .18 
(in the 1-sec. and 4-sec. groups, respec- 
tively) of the failures to respond cor- 
rectly; the remaining failures were omis- 
sions. Also, it is not likely that Ss were 
able to remember all the response terms 


164 JOANNA P. WILLIAMS 


in the list on every trial. For these 
reasons, it is felt that the effects of in- 
creased probability of guessing correctly 
cannot be used to explain the large in- 
crease obtained. 

As training proceeds, the number of 
as-yet-unlearned items decreases, and it 
might be argued that it is this change 
in the effective length of the list during 
learning which accounts for the rise in 
the probability of correct recall over 
trials. Further analysis of data from a 
previously published experiment (Wil- 
liams, 1961), however, suggests that this 
is not the case. For 10 Ss in that experi- 
ment, the items responded to correctly in 
a 12-item list were eliminated oneach trial 
and new ones were substituted for them, 
so that on every trial S was presented 
with 12 “unlearned” items. In this 
manner the effective list length was held 
constant. These Ss showed an increase 
in the probability of initial recall over 
trials (P < .001) just as did the Ss run 
in the present experiment. 

The finding that latency decreases as a 
function of practice suggests that after 
the point of initial recall, and after there 
is no further change with respect to 
the recall criterion, a latency measure 
will reflect a still increasing strength of 
association. This of course is not di- 
rectly relevant to the Estes model, which 
is limited to recall, but it would bear 
on the formulation of a more compre- 
hensive theory of all-or-none learning. 

According to the all-or-none position, 
all items which are learned are learned 
to full strength on one trial: “breaks” 
after initial recall are attributable to 
forgetting of a learned item or to the fact 
that the initial recall constituted a 
“guess” rather than a learned response. 
The present data indicate that the items 
which show breaks during learning tend 
to be the ones which have relatively high 
latencies on the trials on which they were 
answered correctly. This suggests that 
all items which are learned to a criterion 
of simple recall are not necessarily equal 
in associative strength—as an all-or- 
none position implies—either in terms of 
subsequent recall of the items, or in 
- terms of latency scores. 


Recall was greatly influenced by rate 
of presentation. However, the slower 
rate, which produced an increase in the 
probability of recall, did not result in 
decreased latency. Perhaps a change 
in rate has a specific effect on latency 
which counteracts the effects to be ex- 
pected by strength. For example, S 
may develop a set or rhythm based on 
the rate. 

Several operations for influencing 
“strength’—number of trials, rate of 
presentation, and type of item (easy, 
break-free, break)—were included in this 
experiment, and there is also evidence 
that there is some influence of the availa- 
bility of a mnemonic aid on strength of 
association. The utility of an interven- 
ing variable such as strength of associa- 
tion depends on demonstrating a rela- 
tionship between more than one manipu- 
lation and more than one measure 
(Miller, 1959). With the one exception 
noted above, excellent agreement was 
found among the several manipulations 
and measures in the present study, sug- 
gesting that response probability and 
response latency are indeed both meas- 
ures of the same intervening variable, 
strength. 


SUMMARY 


Subjects learned a paired-associate list by 
the anticipation method at one of two rates 
of presentation: each pair was exposed for 
either 1 sec. or for 4 sec., and the anticipation 
interval (S term alone) was held constant at 
3 sec. The list was composed of 25 simple 
word pairs. The order in which the pairs 
were presented varied from trial to trial, 
but the list was arranged so that each item, 
though at different positions on consecutive 
trials, always remained the same distance 
before (or after) one of three items designed 
to be easily learned. Recall and latency of 
response were measured. 

1. There were no differences in learning 
rate among the items that could be attributed 
to proximity to the easy items, thus indicating 
that items could be treated as independent 
units for analysis. 

2. The probability of responding correctly 
to an item for the first time increased as a 
function of trials. This was true for both a 


simple recall criterion and for a criterion in 
terms of latency. 


ALL-OR-NONE HYPOTHESIS FOR VERBAL LEARNING 


3. Evidence for a selection artifact was 
demonstrated, in that heterogeneity of dif- 
ficulty among the items tended to obscure 
the rise in probability over trials. That is, the 
easy items were learned on the early trials, 
and thus the probabilities on those trials were 
much greater than when only the more ho- 
mogeneous items were included in the analysis. 

4. The latency of individual pairs de- 
creased as a function of successive correct 
responses. 

5. Latency was a function of the type of 
item: the items chosen to be easily learned 
exhibited the lowest latencies; and items 
which, once correct, were correct on all sub- 
sequent trials showed lower latencies than did 
items which contained breaks, i.e., errors 
after the first correct response. 

The results were interpreted as supporting 
a strength theory. 


REFERENCES 


Brown, J., & Hupa, M. . Response latencies 
produced by massed and spaced learning 
of a paired-associates list. J. exp. Psychol., 
1961, 61, 360-364. 

Cocuran, W. G. Some methods for strength- 
ening the common x? tests. Biometrics, 
1954, 10, 417-451. 

Estes, W. K. Learning theory and the new 
“mental chemistry.” Psychol. Rev., 1960, 
67, 207-233. 

Estes, W. K., HOPKINS, B. L., & CROTHERS, 
E. J. All-or-none and conservation effects 


165 


in the learning and retention of paired 


associates. J. exp. Psychol., 1960, 60, 
329-339. 
Hutt, C. L. Principles of behavior. New 


York: Appleton-Century, 1943. 

MILLER, N. E. Liberalization of basic S-R 
concepts: Extensions to conflict behavior, 
motivation, and social learning. In S. 
Koch (Ed.), Psychology: A study of a science. 
Vol. 2. New York: McGraw-Hill, 1959. 
Pp. 196-292. 

Rocx, I. The role of repetition in associative 
learning. Amer. J. Psychol., 1957, 70, 
186-193. 

Smuey, O. A. The relation of subliminal to 
supraliminal learning. Arch. Psychol., 
N. Y., 1933, 22, No. 146. 

Spence, K. W. Behavior theory and condi- 
tioning. New Haven: Yale Univer. Press, 
1956. 

THORNDIKE, E. L. The fundamentals of 
learning. New York: Teachers College, 
Columbia University, 1932. 

THORNDIKE, E. L., & LORGE, I. The teacher's 
word book of 30,000 words. New York: 
Teachers College, Columbia University, 
1944. 

Witrams, J. P. Supplementary report: A 
selection artifact in Rock’s study of the 
role of repetition. J. exp. Psychol., 1961, 
62, 627-628. 


(Received July 19, 1961) 


Journal of erimental Psychology 
1962, Vol. 6 No 2, 166-171 


INFLUENCE OF A SMALL NUMBER OF PARTIAL REIN- 
FORCEMENT TRAINING TRIALS ON RESISTANCE 
TO EXTINCTION ! 


E. J. CAPALDI anb DICK HART 


University of Texas 


The present experiment under- 
taken within the context provided by 
the Hull-Sheffield hypothesis was con- 
cerned with the effect of a small num- 
ber of partial reinforcement training 
trials on resistance to extinction as a 
function of pattern of reinforcement. 
According to the generalization decre- 
ment hypothesis (Sheffield, 1949) 
extinction necessarily involves the 
introduction of stimuli different from 
those conditioned to the instrumental 
response in acquisition. The hy- 
pothesis also holds that resistance to 
extinction should decrease to the 
extent that the response is inde- 
pendent of the control of such newly 
introduced stimuli. 

The application of the general 
_ hypothesis to partial reinforcement 
involves the assumption that rein- 
forcement or nonreinforcement on a 
particular trial gives rise to distinctive 
stimuli which become part of the 
total stimulus complex on the sub- 
Sequent trial. When stimuli charac- 
teristic of nonreinforcement constitute 
a portion of the stimulus complex 
on a particular trial and reinforcement 
occurs, S learns to perform the instru- 
mental response in the presence of 
cues characteristic of extinction. Con- 
sistently reinforced Ss are denied the 
opportunity for such conditioning. 
Evidently, then, at the start of ex- 
tinction there is less change in the 
conditioned stimulus pattern for the 


1 This study was supported in part by a 
summer research grant to the senior author 
from the Research Institute, the University 
of Texas. 


partial Ss; accordingly, the consistent 
Ss should extinguish more rapidly. 

The generalization decrement hy- 
pothesis suggests that resistance to 
extinction depends upon pattern of 
reinforcement; our attention is di- 
rected to transitions from nonrein- 
forced to reinforced trials (N-R tran- 
sitions). However, early pattern 
learning experiments appeared not 
to support the Hull-Sheffield view. 
For example, when a moderate num- 
ber of training trials were employed 
single alternation of reinforcement 
(SA), which involves the maximum 
number of N-R transitions for a given 
number of training trials, and random 
training (R), which necessarily in- 
volves a smaller number of N-R 
transitions, yielded about the same 
degree of resistance to extinction 
(Capaldi, 1958), Ostensibly even 
more damaging to the hypothesis, 
when considerable numbers of train- 
ing trials were employed SA training 
was actually followed by lesser re- 
sistance than R training (Capaldi, 
1958; Tyler, Wortz, & Bitterman, 
1953). It remains to be determined 
whether or not the Hull-Sheffield 
hypothesis is adequate to deal with 
patterning effects when only a small 
number of training trials are employed. 
Accordingly, the following predic- 
tions from the hypothesis were tested 
in the present study; (a) when only a 
small number of N-R transitions are 
employed, the typical partial rein- 
forcement effect of increased resist- 
ance to extinction will not occur, 
because the cues characteristic of 


166 


RESISTANCE TO EXTINCTION 167 


nonreinforcement are but weakly con- 
ditioned to the instrumental response ; 
(b) when two patterns of partial rein- 
forcement involve the same small 
number of trials and the same number 
of reinforcements and nonreinforce- 
ments, i.e., identical marginal prob- 
abilities, the pattern involving the 
greater number of N-R transitions 
will be followed by the greater 
resistance to extinction. 


EXPERIMENT I 
Method 


Subjects.—The Ss, approximately 90 days 
old at the start of the experiment, were 18 
male and 21 female experimentally naive 
albino rats from the colony maintained by 
the Department of Psychology, the Univer- 
sity of Texas. The males and females were 
distributed equally throughout three groups 
of 13 Ss each by means of a random procedure. 

A pparatus.—A straight-alley runway with 
an overall inside length of 74 ft. and width of 
4 in., enclosed by 8-in. high sides, served as 
the apparatus. A depression plate at the 
start end and a photo electric cell at the goal 
end of the alley served to start and stop, 
respectively, an electric timer measuring in 
‘01 sec. The distance between the tip of the 


. depression plate and the light beam was 6 ft. 


4 in. A sliding inset with two identical- 
appearing compartments large enough for 
reward containers, which could be manipu- 
lated appropriately on reward and nonreward 
trials, was situated to the side and at the end 
of the goal portion of the alley. A guillotine 
door 1 ft. from the end of the alley could be 
lowered so as to confine S to the goal box. 
The entire confinement area was covered with 
}-in., hinged hardware cloth. The apparatus 
was constructed of wood and painted gray 
throughout. 

Procedure—On the initial day Ss were 
individually housed and deprived of food for 
93 hr. On all succeeding days of the experi- 
ment Ss were fed for 1 hr- in the home cage. 
On Days 2 through 7 Ss were handled in 
groups of 6. On Day 2 food was available 
during the 1-hr. handling period. On each 
successive day feeding time outside the home 
cage was reduced by 15 min, in order to 
gradually adjust Ss to the 23-hr. deprivation 
schedule. Immediately following the han- 
dling period Ss were given the 1-hr. daily ration 
in the home cage. On Days 6 and 7 Ss were 
allowed to explore the runway in groups of 2 
each for 30 min., no food being available. 


The Ss were fed for 1 hr. in the home cage 
immediately following exploratory training. 
Since, in the experimental phase proper, 
approximately 15 min. were required to 
administer the daily trials to eac h S it can be 
seen that by now each S was being fed at 
approximately the time its daily trials were 
due to terminate. 

Beginning on Day 8 nine acquisition trials 
per day for 3 days were given. Group C 
(consistent) was given food reward in the 
form of a wet mash following each run, 
Group SA was rewarded on Trials 1, 3, 5% 
and 9 of each day. Group R was rewarded 
on Trials 1, 2, 5, 6, and 7 on Day 1 and on 
Trials 1, 2, 3, 6, and 7 on Day 2. The pattern 
on Day 3 was the same as that given on Day 1. 
Thus, Groups SA and R received equal num- 
bers of reinforcements (R) and nonreinforce- 
ments (N). However, while Group SA re- 
ceived four N-R transitions per day, Group R 
received only one per day. Goal-box confine- 
ment was 15 sec. on both N and R trials. 
The intertrial interval was 15 sec. 

Two days (Days 11 and 12) of extinction 
training were given, 10 trials per day. On the 
initial day of extinction a single reinforced 
trial preceded the 10 extinction trials. The 
length of confinement in the goal box and the 
intertrial interval remained at 15 sec. If S 
did not enter the goal box within 75 sec., it 
was removed from the alley for the 15-sec. 
intertrial interval. Two such consecutive 
failures to respond resulted in discontinuance 
of work with a particular S. An arbitrary 
time of 75 sec. was assigned for the remainder 
of the trials, Each S was fed for 1 hr. in the 
home cage immediately following the termi- 
nation of its daily trials. 


Results 


Acquisition —A repeated measures 
analysis based on the daily median 
for each S (data not shown) indicated 
that the groups did not differ reliably 
(R= 1.39; df = 2/36) in acquisition. 
The interaction between groups and 
trials was not significant (F < 1) nor 
did performance over trials reach a 
conventional level of significance 
(F = 3.00, df = 2/27, 05 < P< 10); 

A clear understanding of the data 
presented in Table 1 requires con- 
sideration of the following points. 
Consider the SA pattern on nine 
daily trials (R, N, etc.) employed in 
the present experiment. Trials 2, 4, 6, 


168 


and 8 are termed trials following 
reinforcement (TFR) and Trials 3, 5, 
7, and 9, trials following nonrein- 
forcement (TFN). Comparison of 
the performance of Groups SA and C 
employing only Trials 2, 4, 6, and 8 
is said to involve comparable trials 
following reinforcement (CFR). A 
similar comparison employing Trials 
3, 5, 7, and 9 is said to involve com- 
parable trials following nonreinforce- 
ment (CFN). Of course, the expres- 
sion CFN is literally incorrect when 
applied in connection with the C 
pattern; its general meaning is that 
the comparison between the partial 
groups and Group C involves the 
same ordinal trials in the sequence 
with the occurrence of nonreinforce- 
ment in the partial pattern deter- 
mining which trials are to be employed 
for analysis. It should be indicated 
that in computing TFN for Group R 
it was deemed advisable to omit those 
trials which followed a second non- 
reinforced trial, including only those 
which followed a single nonreinforce- 
ment in the sequence. The tendency 
was for Ss to run more slowly follow- 
ing two consecutive nonreinforce- 
ments, 

Table 1 presents performance on 
CFR and CEFN for Groups C and SA 
and for Groups C and R. The entries 
for CFR and CEFN for Group C, which 
appear in the table immediately above 
those for Group SA, were determined 
employing Group SA as the basis of 
comparison. Similarly, the CFR and 
CFN entries for Group C which em- 
ployed Group R as the basis of com- 
parison appear immediately above 
those for Group R. It will be noted 
that for Group C daily differences be- 
tween CFR and CFN are small re- 
gardless of whether Group SA or 
Group R was employed as the basis for 
computing these means. In marked 
contrast it can be seen that Groups 
SA and R ran more rapidly on CFR 


E. J. CAPALDI AND DICK HART 


TABLE 1 


MEANS OF LoG RUNNING TIMES ON 3 Days 
OF ACQUISITION FOR GROUPS C AND 
SA, Usinc CFR anp CFN, AND 
FOR GROUPS C AND R, USING 
CFR anp CFN 


CFR CFN 


Group 


Day 1| Day 2| Day 3 
E 1.12 | .98 | .82 
SA -83 | .64 | .56 
G 1:08 | .95 | .83 
R .90 | .83 | .64 


Day 1| Day 2| Day 3 
1.08 | .94 | .82 
1.06. | .83] .73 
1.02 | .94| .76 
1.05 


1.22 | 1.20 


than on CFN. Differences between 
CFR and CFN for Group SA were 
significant on each of the 3 days of 
acquisition (Day 1, F=24.33, P <.01; 
Day 2, F = 7.32, P < 105; Day 3, 
F = 5.88, P < .05; df = 1/12 in each 
case). Group R also ran reliably 
faster on CFR as opposed to CFN 
on each of the 3 days (Day 1, F=8.27, 
P < 05; Day 2, F = 15.11, P <01; 
Day 3, F=20.28, P<.01; df=1/12 
in each case). i 

While Group SA ran more rapidly 
than Group C on CFN and CFR on 
each of the 3 days of acquisition, only 
the differences on CFR reached sig- 
nificance (Day 1, F = 8.25, P < 01; 
Day 2, F = 791, P < .01; Day 3, 
F=1017, P<.01; df = 1/24 m 
each case); the differences on CFN 
yielded F <1 in each case. These 
results demonstrate that the faster 
running of partially reinforced as 
opposed to consistently reinforced 
groups is not exclusively a late trial 
phenomenon (e.g., Goodrich, 1959). 
Such faster running appears to occur 
quite early in training; the practice 
of pooling TFR and TFN has ap- 
parently obscured it. 

Extinction —As Fig. 1 indicates, 
the greatest degree of resistance was 
shown by Group SA, the least by 


i 


RESISTANCE TO EXTINCTION 169 


Group C. A repeated measures analy- 
sis over the initial 10 trials, employing 
the log times on each trial, indicated 
that the groups differed significantly 
(F = 18.06, df = 2/36, P < .01). A 
significant extinction effect occurred 
over trials (F = 30.34, df = 9/324, 
P < .01); however, the interaction 
between Group and Trials was sig- 
nificant at only slightly beyond the 
10% level (F = 1.57, df = 18/344). 
A factor operating to reduce the sig- 
nificance level of the interaction was 
the arbitrary ceiling time of 75 sec. 
Many Ss in Group C and quite a few 
in Group R failed to respond within 


-75 sec., while only 1 S in Group SA 


required the full time. Seven Group 
C Ss and 3 Group R Ss met the cri- 
terion of extinction on the initial day. 
Differences between the individual 
means were tested using Duncan’s 
multiple range test (Edwards, 1960). 
The test indicated that all the groups 
differed from each other at beyond 
the .01 level. 

A similar repeated measures analy- 
sis over Trials 11-20 of extinction 
indicated that the Group (F = 13.52, 
df = 2/36, P< 01) and Trials 
(F= 60.65, df = 9/324, P < .01) dif- 
ferences were significant as was the 
interaction (F = 3.66, df = 18/324, 
P < .01). The significant interaction 
term is not too meaningful over this 


g ee 7 8 S IOA ENS HO WS IF In1a 29 


TRIALS 


Fic. 1. Mean log running time for each 
group on each of the 20 trials of extinction 
in Exp. 1. 


block of trials since it merely serves 
to indicate that the SA group was 
approaching the 75-sec. limit, some- 
thing Groups C and’R had largely 
achieved earlier. Almost all Ss had 
reached the criterion of extinction at 
the end of the second day. Duncan's 
test indicated that Groups SA and ‘C 
differed beyond the .01 level while 
the C vs. R and the SA vs. R com- 
parisons were significant beyond the 
.05 level. 


EXPERIMENT II 


The results of the initial experiment 
are consistent with deductions from 
the aftereffects hypothesis. However, 
as an examination of the predictions 
from the hypothesis which were con- 
sidered earlier will make’ clear, the 
aftereffects view implies that follow- 
ing SA training the usual partial 
reinforcement effect can be obtained 
while at the same time Group R and 
Group C are about equally resistant 
to extinction. It was this spectrum 
of results that Exp. I was designed 
to obtain. Apparently a wrong 
“guess” was entertained and slightly 
too many N-R transitions were given 
in the case of Group R, thus allowing 
this group to show the usual partial 
reinforcement effect. In order to 
obtain the desired results, the number * 
of training trials was reduced in Exp. 
II from 27 to 18. The experimental 
procedure was the same as that em- 
ployed in Exp. I except for the 
changes noted below. 


Method 


Subjects—The 39 naive albino rats were 
110 days old at the start of the experiment. 
The 15 males and 24 females were distributed 
equally throughout the three groups by means 
of a random procedure. 

Procedure-—Since only 2 days of acquisi- 
tion training were to be given, instead of 3, 
i additional day on the feeding schedule was 
employed. The patterns of reinforcement 
employed on Days 1 and 2 of acquisition in 
the preceeding experiment were employed 


170 


here on those days. On the day following the 
final acquisition trial, all Ss were given a 
single reinforcement followed by 15 extinction 
trials, with no further training being given. 


Results 


Acquisition—The acquisition re- 
sults were highly similar to those 
reported in the initial experiment. 
Accordingly, these data are not pre- 
sented here and the results of a par- 
ticular statistical analysis will be 
mentioned only if it possesses special 
relevance or if the significance level 
deviates from that reported earlier, 

The repeated measures analysis 
based on all trials yielded an F of 
3.03 for Groups (.05 < P < .10, 
df = 2/36), of 1.20 for Trials (not 
significant for 1/36 df), and of less 
than 1.00 for the interaction between 
Group and Trials. Unlike the results 
of Exp. I, differences between Groups 
SA and C on CFN on Day 2 were 
significant (F = 4.95, df = 1/24, 
P < .05), Group SA running more 
rapidly. From the two analyses 
reported, it can be seen that the dif- 
ferences between Groups SA and C 
were greater in the present experi- 
ment than in the initial one. Differ- 
ences between Group C and Group R 
were of about the same order as 
. previously reported. 

Extinction —As Fig. 2 shows, Groups 
R and C did not differ appreciably in 
extinction while Group SA appears 
to show the usual partial reinforce- 
ment effect. The log times on each 
of the 15 trials were summed and an 
analysis was performed which yielded 
an F of 9.68, which for 2/36 df is 
significant beyond the .01 level. 
Duncan’s test indicated that in order 
for Groups R and C to differ at the .05 
level the shortest significant range 
required was a value of 2.743. The 
obtained value of 1.180 fell far short 
of significance. Duncan’s test further 
indicated that differences between 


E. J. CAPALDI AND DICK HART 


a AN 
A \ vA ee SR > 
f s * 
y ~ 


es ree a 
, x 


? Pa | 
pew 


res STS PL OR R 


TRIALS 


Fic. 2. Mean log running time for each 
group on each of the 15 trials of extinction 
in Exp. II. 


Group SA and Groups R and C were 
significant well beyond the .01 level. 


Discussion 


The major™finding of the present 
experiments seems to be that when 
relatively few training trials are em- 
ployed SA patterns result in greater 
resistance to extinction than R patterns. 
This result can be obtained when the R 
group is more resistant and when it is 
about equally resistant to extinction as 
compared with Group C. It should be 
noted that current evidence indicates 
that the tendency of SA reinforcement 
to result in equal or greater resistance 
than R patterns occurs prior to the 
appearance of pattern running, i.e., rela- . 
tively rapid running on reinforced trials, 
relatively slow running on nonreinforced 
ones (Capaldi, 1958). The lesser re- 
sistance following extensive SA as tom- 
pared to R training may, therefore, be 
related either to learning or to over- 
learning (in the sense of manifesting 
appropriate pattern running) the SA 
pattern of reinforcement (Murillo & 
Capaldi, 1961). 

An SA reinforcement pattern repre- 
sents the extreme case of N-R transi- 
tions. In this sense the present results 
are of the same genre as those reported 
by Grosslight, Hall, and Murnin (1953). 
In that experiment human Ss given N-R 


RESISTANCE TO EXTINCTION 171 


patterns were found to be more resistant 
than those given R-N patterns who were, 
in turn, more resistant than those given 
R-R patterns. These results are similar 
to those of Exp. I of this report. The 
finding reported in Exp. I, that follow- 
ing a smal! number of trials C and R 
patterns result in about the same degree 
of resistance, confirms the results of an 
earlier investigation by Amsel (1958) who 
also employed small numbers of trials in 
connection with C and R patterns. 

The present results can be understood 
in terms of the Hull-Sheffield hypothesis. 
As previously indicated, variations in- 
volving SA and R patterns in connection 
with moderate to extensive numbers 
of training trials appear to fail to support 
the generalization decrement view. 
is usual in cases of this kind, movement 
in one of two general directions is per- 
missible. On the one hand, the hypothe- 
sis in question may be abandoned on 
the ground that attempts to modify it 
even if successful may not prove to be 
especially profitable. On the other hand, 
attempts to modify the hypothesis by 
considering alternatives which involve 
logical extensions of it may ultimately 
prove to be a fruitful course of action. 
It is our impression that the latter alter- 
native is deserving of serious considera- 
tion primarily because pattern learning 
experiments involving transfer so clearly 
indicate that the reinforcement outcome 
of the previous trial results in a modifica- 
tion of the stimulus complex on the 
subsequent trial (Bloom & Capaldi, 
1961; Capaldi & Senko, 1962). Accord- 
ingly, it seems reasonable to assume in 
view of the predictive success of the 
hypothesis in connection with small 
numbers of trials that the current in- 
adequacies of the Hull-Sheffield view 
invite nothing so much as more diligent 
theoretical analysis and further experi- 
mental activity. 


SUMMARY 


Two experiments were performed in which 
rats were trained to traverse a straight alley 
under either continuous, irregular, or single 
alternation of reward. In the initial experi- 
ment, which employed 27 training trials, the 


continuous group was found to be least 
resistant to extinction. The single alternation 
group was found to be more resistant than 
the irregular one. Previous experiments have 
shown that following a moderate number of 
training trials the alternation and irregular 
groups are about equally resistant to extine- 
tion, while following considersble training 
the alternation group is lew resistant than 
the irregular one, In the second experiment, 
extinction training was given following only 
18 training trials. While the irregular and 
continuous groups failed to differ in extinetion, 
the single alternation group showed the 
typical partial reinforcement cflect. The 
findings were di in connection with 
the Hull-Sheffield aftereffects hypothesis, 


REFERENCES 


AwseL, A. The role of frustrative nonreward 
in noncontinuous reward situations. Psy- 
chol. Bull., 1958, 55, 102-119. 

Bioom, J., & Caratot, E. J. The behavior 
of rats in relation to complex patterns of 
partial reinforcement. J. comp. physiol. 
Psychol., 1961, 54, 261-265. 

Caratnt, E.J. The effect of different amounts 
of training on the resistance to extinction 
of different patterns of partially reinforced 

J. comp. physiol. Psychol., 1958, 
$1, 367-371. 

Carano, E. J., & Senko, M, G. Acquisition 
and transfer in partial reinforcement. Fa 
exp. Psychol., 1962, 63, 155-159. 

Epwarps, A. L. Experi 
psychological research. 
York: Rinehart, 1960. 

Gooprics, K. P. Performance in different 
segments of an instrumental response chain 
as a function of reinforcement schedule. 
J. exp. Psychol., 1959, 57, 57-64. 

Grossticnt, J. W., Hatt, J. F., & MURNIN, 
J. Patterning effect in partial reinforce- 
ment. J. exp. Psychol., 1953, 46, 103-106. 

Muro, N. R., & Caratpr, E. J. The role 
of overlearning trials in determining re- 

_ sistance to extinction. J. exp. Psychol., 
1961, 61, 345-349. 

SHEFFIELD, V. F. Extinction as a function 
of partial reinforcement and distribution 
of practice. J. exp. Psychol., 1949, 39, 
511-526. 

TYLER, D. W., Wortz, E. C., & BITTERMAN, 
M.E. Theeffectof random and alternating 
partial reinforcement on resistance to ex- 
tinction in the rat. Amer. J. Psychol., 1953, 
66, 57-65. 


(Received July 20, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 2, 172-176 


CONTIGUOUS CONDITIONING ! 


W. J. BROGDEN 


University of Wisconsin 


The status of reinforcement as a 
necessary condition for learning has 
been the focus of much of the con- 
troversy between proponents of dif- 
ferent theories of learning (Kimble, 
1961). It is difficult to take the 
Opposing view when the proponent of 
reinforcement theory falls back on 
secondary reinforcement. This latter 
phenomenon is of course an example 
of acquisition, of the very thing for 
the occurrence of which reinforcement 
is proposed as an essential condition. 
Nevertheless, it has been argued that 
the evidence contrary to reinforce- 
ment as a necessary condition pro- 
vided by experiments of latent learn- 
ing (Tolman & Honzik, 1930) and 
Sensory preconditioning (Brogden, 
1939) is not valid because of the 
possible involvement of primary or 
secondary reinforcement factors in 
these phenomena. Contiguous con- 
ditioning appears to be completely 
independent of variables or conditions 
currently related either to primary or 
Secondary reinforcement. In the ex- 
periment to be reported, contiguous 
conditioning is represented by cage- 
turning responses of cats in the rotator 
toa tone CS. This CR is dependent 
upon a prior conditioning procedure 
during which each occurrence of the 
cage-turning response resulted in the 
sounding of the tone. Initial tests 
of the tone prior to conditioning elic- 
ited no cage-turning responses. Thus 
contiguous conditioning appears to 


1 This research was Supported in part by 
grants from the National Science Foundation 
and the Research Committee of the Graduate 
School from funds provided by the Wisconsin 
Alumni Research Foundation. 


be dependent solely upon the tem- 
poral contiguity of stimulus and 
response. 

PROCEDURE 


The design of the study to be reported 
includes one experimental treatment and one 
control treatment, with the possibility of a 
second control treatment. It is based upon 
the positive results of five preliminary studies 
of contiguous conditioning in which experi- 
mental conditions were varied unsystemat- 
ically and in which there were no control 
procedures,” 

The 23 Ss were kittens, approximately 60 
days of age at the start of the experiment. 
They were obtained as six litters, with each 
litter split between Group E (experimental) 
and Group C (control) but with random 
assignment of individual Ss within litters to 
the two conditions so that N for Group E was 
11 and for Group C was 12. 

The rotator (Brogden & Culler, 1936) 
provided cage-turning as the response-to-be- 
conditioned. The CS was a 1000-cycle pure 
tone whose intensity was 60 db. above .0002 
dyne/cm?. The duration of the tone was 4 
sec. for all conditions and the duration of each 
test period was 10 min. s 

All Ss were given five procedures in suc- 
cession: (a) Two test periods of adaptation 
to the rotator. (b) One test period during 
which two trials of the tone were given when 
S was quiet to test neutrality of the tone in 
evoking cage-turning responses; none of the 
Ss made responses to these test trials. (e) 
Successive test periods until a final test period 
was reached that provided a cumulative total 
of 30 or more cage-turning responses; Group 
E involved presentation of the tone CS with 
each occurrence of the cage-turning response 
and there was no presentation of the tone to 
Group C. (d) Twenty test trials for contig- 
uous conditioning, each consisting of presen- 
tation of the tone alone when $ had been 
quiet for 30 sec. or more. (e) Instrumental 
shock-avoidance training to the tone CS at 
the rate of 20 trials per test period until a 
criterion of 18 CRs was attained. The dura- 


*Richard F. Thompson collaborated with 
the author in the preliminary studies, 


172 


ot I 


CONTIGUOUS CONDITIONING 173 


tion of every response during the training and 
test procedures was recorded for each S to 
the nearest .1 sec. by an electronic timer 
operated by E. Groups E and C were treated 
identically except for the conditioning or 
training procedure, during which Group E 
received the tone CS upon making each of 
the 30 or more cage-turning responses whereas 
Group C received no tone. 


RESULTS 


The second control procedure would 
have involved a variation of the train- 
ing procedure to include 30 presenta- 
tions of the tone when S was quiet in 
addition to the occurrence of 30 or 
more cage-turning responses inde- 
pendent of the tone. It was decided 
in advance that this control would 
be completed only if the experimental 
Ss showed greater activity or respon- 
siveness than the control Ss, thus 
indicating a differential effect of the 
tone during the training procedure. 
Therefore initial analyses were made 
of the number of test periods of train- 
ing procedure including the criterial 
period, total responses, and frequency 
of responses per test period. The 
means are, respectively, 3.36 (¢m=.54), 
35.80 (om =1.71), and 15.90 (om =3.19) 
for Group E and 4.67 (om = 1.36), 
37.82 (om = 1.87), and 15.91 (om = 
3.31) for Group C. Since not one of 
the ż values for the three differences 
exceeds 1, there is no evidence of sig- 
nificantly greater activity on the part 
of Group E. The second control pro- 
cedure, therefore, was not conducted. 

Comparison was also made of the 
duration of response during the train- 
ing procedure. The mean for Group E 
was 6.74 sec. (om = .88 sec.) and for 
Group C was 4.68 sec. (am = .52 sec.). 
The difference of 2.06 sec. in favor of 
a greater duration of response by 
Group E has a ¢ value of 2.02 that is 
significant at the 10% level but not 
at the 5% level. Because the dura- 
tion of response appears to increase 


as a function of trials, mean duration 
was computed for each successive 
block of 10 trials for the first 30 trials 
for each S. These means are 4.89, 
5.87, and 8.49 sec. for Group E and 
4.07, 4.61, and 4.97 for Group C. In 
separate analyses of variance of 
repeated measures, Trial Blocks has 
an F value significant at the 5% level 
for Group E but not for Group C. 
An overall analysis of variance shows 
Trial Blocks to be a significant source 
of variation, but Groups and the 
interaction of Groups and Trial Blocks 
are not significant. 

Because of the progressive increase 
in duration of response over trials, 
and the possibly longer response 
duration of the experimental Ss during 
the training procedure, analyses of 
the responses during the test pro- 
cedure were made. These responses 
are the ones made in between the 20 
test trials of the tone CS. The mean 
frequency of response per test period 
is 14.99 for Group E and 14.94 for 
Group C. The difference is not sig- 
nificant nor are these means signifi- 
cantly different from the comparable 
means of both groups for the training 
procedure. The mean duration of 
response is 5.78 sec. for Group E and 
5.21 sec. for Group C. The difference 
is not significant nor are these means 
significantly different from those of 
either group during the training pro- 
cedure. There are no trend effects 
in response duration for either group 
during the test procedure. 

Of the two measures available to 
provide a test of contiguous condi- 
tioning, the shock-avoidance training 
procedure measures showed no signifi- 
cant difference between groups. The 
test trials of the tone CS, presented 
when S had been quiet for 30 sec. 
or more, do provide evidence of 
contiguous conditioning. These data 
are presented in Table 1. The dif- 


174 W. J. BRODGEN 


TABLE 1 


FREQUENCY or CR to Test TRIALS 
or Tone CS 


Number of Animals 
Number of 


Responses 
Experimental Control 
0 2 10 
1 3 0 
2 2 0 
3 0 1 
4 1 1 
6 1 0 
9 1 0 
12 1 0 
Mean 3.46(¢m=1.19) | 0.58(¢m=0.39) 


ference in frequency of CR of 2.01 in 
favor of Group E has a ¢ value of 2.13 
and is significant at better than the 
5% level. Analysis of variance of 
these data shows a significant F 
(7.44) for Group E vs. Group C, but 
no significant F values for Litter or 
the interaction of Treatment and 
Litter. Mean duration of CR is 3.94 
sec. for Group E and is 7.24 sec. for 
Group C. The difference is not 
statistically significant. 


Discussion 


The significantly greater frequency of 
CR to the test trials of the tone CS by 
the experimental Ss over the control Ss 
is evidence of contiguous conditioning. 
These results coupled with the evidence 
of similarity in frequency and duration 
of extraneous responses during the test- 
ing procedure, establish the validity of 
contiguous conditioning. It can be 
argued that the increase in duration of 
response is also evidence of contiguous 
conditioning. Such an argument re- 
quires the following assumptions: (a) 
that the cage-turning response elicited 
by unknown stimuli is of constant dura- 
tion over training trials as is the case for 
Group C; (b) that a cage-turning CR is 
formed during the course of contiguous 
conditioning training; (c) that the CR 
increases progressively in magnitude as 
training increases; and (d) that the mag- 


nitude of the cage-turning CR is added to 
the magnitude of the cage-turning re- 
sponse elicited by unknownstimuli. This 
hypothesis is consonant with the experi- 
mental evidence. Other hypotheses are 
not. Facilitating action of the tone to in- 
creased duration of the response progres- 
sively is unlikely. Facilitation does not 
occur as a trend over trials nor under the 
time relations of the present study. The 
tone occurs after the response has started, 
the delay being equal to the reaction 
time of E. There is little support for the 
hypothesis that the experimental Ss 
learned to “turn on the tone,” since 
there are no group differences during 
the training phase for total responses, 
number of test periods, or frequency of 
response per test period. 

The progressive increase in duration 
of the cage-turning response by the 
experimental Ss during the training 
phase may be evidence of contiguous 
conditioning in addition to that provided 
by the test phase. In anylease, it does 
not appear to interfere with interpreta- 
tion of the test phase data. Thus, the 
conclusion stands that contiguous condi- 
tioning is a function of the contiguity 
of stimulus (tone) and response (cage- 
turning) during the training procedure. 
It follows that contiguity is a sufficient 
condition of learning. Whether it is a 
necessary condition is another matter. 
In considering contiguity versus rein- 
forcement as necessary conditions of 
learning, it should be noted that any 
experimental test of reinforcement would 
appear to confound the reinforcement 
operation with contiguity of stimulus and 
response. Such confounding does not 
occur in the test of contiguity provided 
by the present experiment. 

In further consideration of contiguous 
conditioning, it should be noted that the 
CR it produces is weak relative to the 
CR of standard conditioning procedures 
involving reinforcement. An important 
factor may be the backward time rela- 
tions inherent in contiguous conditioning 
training. The CS cannot be presented 
until after the response has started. 
Even though there may be discovery of 
training conditions for contiguous condi- 


ae 


CONTIGUOUS CONDITIONING 175 


tioning that provide a greater strength 
of CR, it is unlikely that the CR will be 
anywhere near as strong and stable as 
the CR produced by conditioning pro- 
cedures in which contiguity and rein- 
forcement are confounded. If, however, 
efficiency is considered in terms of num- 
ber of training trials required to produce 
a CR, then the efficiency of contiguous 
conditioning appears to be high. Con- 
tiguous conditioning was obtained with 
as few as 10 training trials in the pre- 
liminary studies. Evidence that sensory 
preconditioning occurs with 1 or 2 trials 
of preconditioning and attains a maxi- 
mum with 4 trials (Hoffeld, Kendall, 
Thompson, & Brogden, 1960) suggests 
a similar relation between amount of 
training and magnitude of contiguous 
conditioning. When contiguity of stimu- 
lus and response are confounded with 
reinforcement operations in the more 
standard conditioning procedures, con- 
tiguity may produce learning in the 
early trials, with reinforcement func- 
tioning in later trials to fixate and 
strengthen the connection established 
by contiguity. 

Contiguous conditioning and sensory 
preconditioning are similar phenomena 
in that learning occurs apparently with- 
out reinforcement. In contiguous condi- 
tioning, the training procedure involves 
a response for which the stimulus is 
unknown and a stimulus for which the 
response is not known. In sensory: pre- 
conditioning, the training procedure in- 
volves two stimuli to each of which the 
response is unknown. 
assume the antecedent occurrence of a 
response following the presentation of a 
stimulus, both the procedures for con- 
tiguous conditioning and sensory pre- 
conditioning have serious lacks in pre- 
cision of experimental manipulation and 
control. Speculation beyond an as- 
sumed stimulus-response relationship es- 
tablished during training and based upon 
contiguity, that persists in the organism 
to the testing procedure, appears fruitless 
until there are experimental operations 
to identify and manipulate the presently 
unknown variables of the training pro- 


cedures. 


Since we must ' 


Perhaps we are asking the wrong ques- 
tions when we concern ourselves with 
what condition or conditions are neces- 
sary and sufficient for learning to occur. 
Another approach to our understanding 
of learning follows from the assumptions 
that learning occurs almost continuously 
in organisms of many species, and that 
in experiments on learning, Ss will always 
learn, but not necessarily in terms of 
the behavior for which the experiment 
was designed. Then the major problem 
of learning becomes the identification 
and attainment of experimental control 
of variables that produce different kinds 
of learned behavior and of variables that 
reduce the probability of learning other 
than that for which the experiment is 
designed. From this point of view 
contiguous conditioning may interfere 
seriously with investigation of the more 
stable varieties of learning. If contig- 
uous conditioning does occur with only 
a few trials in which a given stimulus 
and response are contiguous, the oppor- 
tunity for establishing a number of 
contiguous CRs will occur in virtually 
every experiment on learning. Much 
of the variance in learning experiments 
may be due to the occurrence of con- 
tiguous CRs. If this is so, then the 
primary contribution of contiguous con- 
ditioning to the general study of learning 
will come from discovery of conditions 
that reduce its occurrence, rather than 
in demonstrating that contiguity alone 
is a sufficient condition for learning. 


SUMMARY 


Contiguous conditioning is represented by 
cage-turning CRs of cats to a tone CS. This 
CR is dependent upon a prior conditioning 
procedure during which each occurrence of 
the cage-turning response results in the 
sounding of the tone. Initial tests of the tone 
prior to the conditioning training elicited no 
cage-turning responses. A control group, not 
given the tone CS, made the same number of 
responses in the rotator prior to the test for 
contiguous conditioning that the experimental 
group made during its training procedure. 
Tests with the tone CS presented when the 
S had been quiet for 30 sec. or more were 
given to all Ss. The frequency of CR of the 
experimental group was significantly greater 


176 


than the frequency of response of the control 
group. 

The evidence of contiguous conditioning 
demonstrates that contiguity of stimulus 
and response is a sufficient condition for 
learning. The results are discussed relative 
to reinforcement as a necessary condition of 
learning, to the inherent confounding of any 
reinforcement operation with contiguity of 
stimulus and response, to sensory precondi- 
tioning, and to general theoretical considera- 
tions of learning. 


REFERENCES 


Brocpen, W. J. Sensory pre-conditioning. 
J. exp. Psychol., 1939, 25, 323-332. 


W. J. BRODGEN 


Brocpen, W. J., & Cutter, E. Device for 
the motor conditioning of small animals. 
Science, 1936, 83, 269-270. 

HorrELD, D. R., KENDALL, S. B., THOMPSON, 
R. F., & BRocpEN, W. J. Effect of amount 
of preconditioning training upon the mag- 
nitude of sensory preconditioning. J. exp. 
Psychol., 1960, 59, 198-204. 

KımBtE, G. A. Hilgard and Marquis’ condi- 
tioning and learning. New York: Appleton- 
Century-Crofts, 1961. 

Torman, E. C., & Honzix, C. H. Introduc- 
tion and removal of reward, and maze 
performance in rats. U. Calif. Publ. Psy- 
chol., 1930, 4, 257-275. 


(Received July 21, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 2, 177-183 


LEARNING OF SIMPLE STRUCTURES’ 


GEORGE MANDLER anp PHILIP A. COWAN 


University of Toronto 


In the exploration of variables that 
influence human learning, more atten- 
tion has recently been paid to the 
experimental correlates of such con- 
cepts as schemas, strategies, organiza- 
tions, and plans. However, little 
systematic work is available on the 
effect of such structural variables 
on relatively simple human learning. 
One possible approach was recently 
presented by DeSoto (1960) in a 
study ostensibly designed to explore 
social psychological variables. 

DeSoto showed that Ss learned a 
social structure, i.e., correctly identi- 
fied such relations as “x influences y,” 
in fewer trials when the relations 
followed the usually encountered 
properties (such as symmetry, transi- 
tivity, and completeness) of these 
relationships. If these “social rela- 
tions” are learned according to logical 
and quasilogical schemas, similar ef- 
fects should be demonstrable when 
the relations to be learned, and the 
members of the set among which 
the relations operate, are neutral, i.e., 
not influenced by prior social, expec- 
tations. 

Given a set of three elements (A, 
B, and C) and an attribute R which 
may or may not be present with any 
pair of elements, there are 16 possible 
paradigms (or structures) ranging 
from R occurring with no pair to R 
occurring with all of the six possible 
ordered pairs. If Ss are given the 
task to learn or identify those pairs 

1 The research reported in this paper was 
supported in part by Grant APA 37 from 
the National Research Council, Canada. 
The authors would like to thank Cecille Gold 
and David S. Abbey for their advice and 
assistance. 


which do and do not have the R 
attribute, two variables might influ- 
ence speed of acquisition : the number 
of R (and non-R) attributes in a set 
of six pairs; and the logical and 
quasilogical relations within these sets. 
On the basis of number of attributes 
alone, sets with one R and five non-R 
attributes should be easier to dis- 
criminate than sets with two and four 
or three and three R and non-R 
attributes. However, comparisons of 
different sets with equal number of 
attributes should reveal the operation 
of structural factors. The structure 
of a set describes the relations among 
the three elements. These relations 
are defined by specifying which pairs 
of elements do or do not have the 
attribute, thus involving both order- 
ing and attribution. There are 16 
possible structures under these condi- 
tions; they are shown in Fig. 1. We 
have arbitrarily labeled the three 
elements A, B, and C, and indicated 
the presence of the R attribute by a 
directed arrow; the direction of the 
arrow indicates the ordering of the 
pair? Non-R attributes are inferred 
from the absence of such an arrow, 
and in written description by the 
absence of R. Thus for example, 
Structure 2 may be described as 
ARB, AC, BA, BC, CA, CB; Struc- 
ture 7 as ARB, BRA, BRC, AC, CA, 
CB. We would expect, for example, 


2In terms of graph theory (Harary & 
Norman, 1953) this structure may be char- 
acterized as a directed graph of Type 2 with 
three points and with the two relations 
denying each other. A directed graph with 
three points has six possible lines, and in the 
case of the R and non-R relations a definition 
of these six lines defines the graph. 


177 


178 GEORGE MANDLER AND PHILIP A. COWAN 


Ae 
GROUP I Be eC 
Structure 1 
GROUP II / . 
Structure 2 


GROUP III s«—>» AN A EN 
4 


Structure 3 


5 6 
GROUP pl gi A 4D 


Structure 7 8 
GROUP V /\ EN IN A 
Structure 11 12 13 14 
GROUP VI A 
Structure 15 
GROUP VII A 
Structure 16 


Fic. 1. Diagrammatic presentation of the 
16 structures. (Structures within groups have 
the same number of attributes—checkmarks. 
In the diagrams an arrow indicates the 
presence of a checkmark for the two elements 
ordered by the arrow.) 


that a structure where both the AB 
and BA pairs are associated with the 
attribute would be learned faster than 
one where such symmetry is not 
present. 


METHOD 


Task.—In the present study we investi- 
gated the speed with which Ss learn the pres- 
ence or absence of an attribute (R and non-R) 
in structures consisting of three elements. 
Three CVC syllables were used as the three 
elements and the presence or absence of a 
checkmark to indicate the R and non-R 
attribute. In a trial, Ss were presented with 
six cards. A particular card had on the face 
side one of the six pairs of syllables separated 
by a space, and on the reverse side either a 
checkmark (attribute) or not. The Ss task 
was to learn which cards had checkmarks 
and which did not. 

Each S was given a deck of 30 cards (rep- 
resenting five trials of six pairs on a particular 
structure) and three answer sheets with 30 
spaces on each numbered 1 to 30, 31 to 60, 
61 to 90, respectively, i.e., Trials 1 to 5, 6 to 
10,11 to 15. The order of syllable pairs with- 


in any one trial of 6 cards was randomized 
with the restriction that in adjacent trials 
no cards with the same pairs would follow 
each other. The Ss were required to 
look at the face of each card, indicate on the 
answer sheet whether they thought that the 
reverse side had a checkmark on it by either 
placing a checkmark or a straight-line next 
to the appropriate number, then turn the 
card over, look at the reverse, and then 
proceed to the next card. They were paced 
at a speed of 7 sec. per card. 

Two sets of three CVC syllables were used 
with half of the Ss assigned to each set. The 
CVCs were low-association value syllables 
with the additional restriction that no letter 
of the alphabet was used more than once in a 
set of three syllables. Since no significant 
differences were associated with the two sets 
of syllables, no further reference will be made 
to this aspect of our design. The two sets 
of syllables were: XUR, ZIC, GYQ; and 
KEF, ZUV, QIJ. 

Subjects and instructions —The Ss were 
192 female students in the introductory 
general psychology course at the University 
of Toronto. Twelve Ss were used for each of 
the 16 structures shown in Fig. 1. The Ss 
were seated in a large auditorium and the 192 
packs and answer sheets were randomly 
distributed to Ss. 

The Ss were told what a CVC nonsense 
syllable is and were shown the two sets of three 
syllables. They were told that every card 
would have on its face two of the three syl- 
lables in various combinations. The opera- 
tive part of the instructions read: 


On the front of some cards there may be a 
checkmark like this (demonstrated on 
blackboard); on the back of others there 
may not be a checkmark. Your pack may 
have a checkmark on every card, or it may 
have no checkmarks on any card, or your 
pack may contain some cards with check- 
marks on the back, and other cards without 
them. ... You are to find out, or learn, 
which pair or pairs of nonsense syllables 
have checkmarks and which pair or pairs 
of nonsense syllables do not have check- 
marks. ... The cards that have the same 
pair of syllables on the front will appear 
over again throughout the pack... - 
You are to learn which pairs of nonsense 
syllables are associated with a checkmark, 
and which pairs are not. 


The Ss were paced by E telling them to look 
at the front of a card, mark the answer, look 


at the back, look at the next card, and so 
forth. 


Ee 


LEARNING OF SIMPLE STRUCTURES 179 


Analysis —Sixteen different structures pre- 
sented as six pairs made up from three non- 
sense syllables were learned by 12 Ss for each 
structure. ‘The dependent variable was the 
number of correct anticipations of the pres- 
ence or absence of checkmarks. All Ss were 
given 15 trials with the order of cards repeated 
after 5 and 10 trials (30 and 60 cards), respec- 
tively. In a subsequent analysis the number 
of checkmarks per trial was determined 
regardless of the correctness of the response. 


RESULTS 


In discussing the results of the 
acquisition data, two considerations 
should be borne in mind. First, there 
are two independent variables: the 
number of R and non-R attributes in 
a structure, and the comparison 
among structures with the same total 
number of attributes (or checkmarks). 
Second, certain structures are sym- 
metrical with respect to the R non-R 
attributes. Thus, Structures {and 16 
are identical except for the replace- 
ment of R with non-R; the same holds 
for Structures 2 and 15, 3 and 11, 4 
and 12, 5 and 13, and 6 and 14. 

Number of attributes —The predic- 
tion was that structures will be learned 
in the following order : Groups I and 
VII, Groups lH and VI, Groups HI 
and V, and Group IV in descending 
order of discriminability of R and 
non-R attributes. The mean num- 
bers of correct responses (out of 90) 
for these four groupings were 87.83, 
77.17, 67.12, 61.12 in the predicted 
order. All differences are significant 
at the .01 level or better. 

Structural differences.—The major 
question to be asked about the struc- 
tural variables is whether there are 
significant differences in the acquisi- 
tion of different structures when the 
total number of R and non-R attri- 
butes is held constant. This problem 
is examined by looking at differences 
in acquisition within the groups which 
contain more than one structure 
(Groups HI, IV, and V). It will be 


recalled that Groups III and V are 
symmetric with reference to R and 
non-R, 

Figure 2 shows the acquisition 
curves for all structures. Tables 1 
and 2 show the mean number of 
correct responses per trial for all 16 
structures and the relevant analysis 
of variance. The analysis shows 
highly significant effects for Struc- 
tures and Trials, and no significant 
interaction between these two effects. 
To examine the ordering of structures 
within groups Tukey’s gap test was 
used to segregate scores at the .05 
level of significance. 

For Group III the gap test on the 
scores of Table 1 segregates the struc- 
tures into the following clusters, 
with > indicating a .05 gap: 3>4, 
5 > 6. For Group V the ordering is: 
11 > 12 > 13, 14. In Group IV the 
following clusters were obtained :8>9, 
7>10. Thus, significant differences 
obtain among structures within groups. 

In comparing pairs of structures 
symmetric with respect to check- 
marks and noncheckmarks we note 
that for five such pairs of structures 
(Pairs 2 and 15, 3 and 11, 4 and 12, 
5 and 13, 6 and 14) the structure with 
the fewer checkmarks is learned faster 
and the difference between these 
pairs is significant at the .05 level for 
all except Pair 6 and 14. 

Probability matching —The results 
presented have only considered per- 
formance in terms of number of cor- 
rect responses, with correctness de- 
fined as a response appropriate to 
the information on the face of the 
cards, i.e., the degree to which Ss 
correctly learned the event informa- 
tion provided for them. In contrast 
to this analysis of event matching, 
the data can also be analyzed in terms 
of probability matching, i.€., the degree 
to which Ss’ behavior conforms to 
the frequency or percentage of re- 


180 


Structures: 


~ 

Q 

Y 

E 

K 

9 

Q 

S 

(ey 1 3 5 7 9 ll 
U 

Q 

N 

K 

Q Structures: 

o 7 9 e--—~ 
lo 8 oa 10 -~-o 
K 

& 


1 SA OTH Oat 


13 


13 


GEORGE MANDLER AND PHILIP A. COWAN 


Structures: | 


1 3 3 7 9 De eye 
moving averages in blocks of three trials 


a H 
50 
tema 


US S 13 


Structures: 
1l e--~ 13 — 
120-8 14 oo 
eel 1 
EN 7 
sof AI] 
Fi al 
80} Ape =o Poe d 
A Wo 4 
. Ana 
70% yey a a 1 
K gd ° 
60 
3 5 


oe x 
1 


Moving averages in blocks of three trials 


Fic. 2. Acquisition curves for all 16 structures, 
function of blocks of three tri 


sponses (checkmark or noncheck- 
mark) required in the task without 
any attention to the “correctness” of 
their responses. The data are com- 
bined for each of the seven groups 
which vary from zero to six in the 
number of checkmarks required in 
each trial of six cards. Figure 3 
shows frequency of checkmark or 
noncheckmark responses for all seven 


showing percentage correct responses as a 
als in moving averages. 


groups. It is obvious that even from 
the first block of three trials Ss’ 
behavior follows the probability struc- 
ture, and by Block 5 (Trials 5-7), 
mean performance is at or near the 
asymptote expected from sheer prob- 
ability matching behavior. 

When probability matching (fre- 
quency only) and event matching 
(correct responses) are compared for 


LEARNING OF SIMPLE STRUCTURES 181 


TABLE 1 


MEAN NuMBER CORRECT PER TRIAL OF 
Sıx RESPONSES 


Structure Mean Number Correct 
16 5.86 
1 5.85 
3 5.37 
2 5.26 
15 5.03 
il 5.00 
4 4.54 
5 4.46 
8 4.30 
12 4.27 
13 4.13 
9 4.04 
6 4.03 
14 4.02 
7 4.02 
10 3.82 


Note.—Lines between structures indicate significant 
differences at the .05 level by Tukey's test for gaps and 
stragglers. 


Groups II to VI (for Groups I and 
VII, probability and event matching 
curves are, of course, identical) it is 
evident that even when event match- 
ing proceedsslowly, probability match- 
ing reaches asymptotic or near asymp- 
totic behavior early in the task. If 
the groups symmetrical with respect 
to R and non-R relations are €x- 
amined, it appears that probability 
matching is even more pronounced in 


TABLE 2 


ANALYSIS OF VARIANCE OF NUMBER 
CORRECT PER TRIAL 


Source of Variance df MS F 
Taas R is 
Between 5s ; 
Structures (5) 15 | 82.62 | 11.36" 
Hass, (aa da 
Jithin S$! 2 k 
M ialo T) 14 | 69.95 | 75.22* 
TXS 210 | 1.13 | 1.22 
Error 2464 93 


*P < 001. 


GROUP: 1 »—— (% not checked) 
IL oo (X not checked) 
II] o— (X not checked) 


X IV o—-< (X nol checked) 
8 V ena-e (X checked) 
a VI &--4 (R checked) 
5 VII v--- (X checked) 
Q 

r: 6 

lOO) 

€ 

3 

H 

+8 

qraw 

o 

3 

x 

Y , 

£ 4 

pE 

È 

o 

& 

$o% 

6p, ee 


1 3 Ss 7 9 n 13 
moving averages of three triala 


Fic. 3. Mean number of responses per 
trial as a function of moving blocks of three 
trials. (The more frequent response is re- 
ported, i.e., checkmarks for Groups V, VI, 
and VII and noncheckmarks for Groups I, 
II, III, and IV. Responses were scored re- 
gardless of whether they were correct or not.) 


those cases where event matching 
proceeds more slowly. Thus, event 
matching in Structure 2 is faster than 
in Structure 15, yet probability match- 
ing is at 90% of asymptote by Block 2 
in Structure 15 (Group VI), while it 
does not reach that level until Block 4 
for Structure 2 (Group I). Simi- 
larly, Group V reaches 90% of asymp- 
tote by Block 2, while Group III does 
not reach that level until Block 7. 
In all groups other than Group II, 
however, the level of probability 
matching reaches 90% of asymptote 


Eby; Block 4, i.e., after 6 trials or 36 


presentations of individual cards. 


DIscussiIoN 


The between-groups comparisons— 
differentiated by absolute number of R 
and non-R attributes—are clear-cut and 
predictable from a mere consideration 


182 GEORGE MANDLER AND PHILIP A. COWAN 


of discriminability. Obviously, a task 
which requires S to assign one check- 
mark vs. five noncheckmarks to six 
stimuli (pairs) is easier than one which 
requires discrimination of two and four, 
or three and three checkmarks and 
noncheckmarks. The number of stimuli 
to be discriminated increases and so does 
the difficulty of the task. 

In accounting for the effects within 
groups, i.e., differences among structures 
of equal discriminability, we distin- 
guished among four relations that may 
be differentiated within any one struc- 
ture. These were symmetry (if ARB 
then also BRA), transitivity (if ARB and 
BRC, then also ARC), common origin 
(if ARB then also ARC), and common 
goal (if ARB then also CRB). The 
assumption is made that it is this kind 
of plausible logical and quasilogical 
reasoning (or use of transformation 
rules) that facilitates learning of these 
structures. In Table 3 are presented 
the number of these relations found with- 
in Structures 3 to 14, i.e., all those where 
there is more than one structure within 
a group. All relations, whether defined 
by the presence or the absence of the 
checkmark attribute, are included. For 
example, Structure 3 (cf. Fig. 1) has 
three symmetry relations: AB and BA 
(no checkmark), BRC and CRB (check- 
mark), AC and CA (no checkmark). 

The order in which structures within 
groups are learned as a function of num- 
‘ber and type of relations may now be 
considered. For Groups III and V the 
order of acquisition of structures com- 
plementary with respect to R and non- 


TABLE 3 


NUMBER AND TYPE oF RELATIONS IN 
DIFFERENT STRUCTURES 


Structures 
Relations p ? 

3&/)4 5 

u arlis 4¥| 7] 8} 9] 10 
Symmetry 3 | i hE SA [i J2 2 0 0 
Transitivity OO; 1] 17] 1 002/0 
Common origin} 1 | 3 | 1 | 1 |2/0}210 
Common goal | 1 | f 3 | 1 0/2|2|0 

} 


R is identical, i.e., 3, 4, 5, and 6 in III; 
and 11, 12, 13, and 14in V. From Table 
3 it appears that symmetry is, and could 
reasonably be expected to be, the most 
powerful single relation. Structures 3 
and 11 are learned most quickly and 
significantly better than the others. In 
ordering the remaining three structures 
in each of these two groups, Structures 
6 and 14 may be assigned the lowest 
rank because of the fewer total number 
of relations appearing in them (only 
four), while the ordering of 4 and 12, 
and 5 and 13 could only be assigned to a 
more powerful influence of common 
origin than common goal. 

Considering the structures in Group 
IV from this point of view, it is quite 
reasonable for Structure 10 to show the 
slowest learning rate (having none of our 
relations embedded in it). However, 
the poor showing of Structure 7 does not 
simply fit this schema. Obviously, 
much further evidence will have to be 
adduced—probably with structures in- 
volving more than three elements—in 
order to arrive at a metric of structural 
relations. 

The appearance of probability match- 
ing in a task ostensibly designed for 
other purposes we find most interesting. 
The Ss, given an event matching task 
of moderate difficulty, respond to the 
probability structure of responses sur- 
prisingly quickly (cf. Grant, Hake, & 
Hornseth, 1951) and without necessary 
reference to the event structure. To 
what extent this behavior is a function 
of hypothesis formation in the Ss is 
difficult to estimate. The parallel be- 
tween probability matching in our situa- 
tion, with its low payoff, and probability 
matching in a two-choice situation (as 
against maximizing) seems obvious and 
deserves further examination. 

Finally, it is of note that probability 


‘matching apparently is most marked 


when event matching proceeds most 
slowly. In the comparison between 
Groups III and V, for example, the fact 
that Group IIT learns more quickly may 
be due to the stress in the instructions 
on the detection of checkmarks; how- 
ever, probability matching cannot easily 


LEARNING OF SIMPLE STRUCTURES 183 


be assigned to this variable. The general 
proposition that as event discrimination 
becomeseasier, rate of probability match- 
ing decreases and vice versa, deserves ex- 
ploration. 

SUMMARY 


acquisition of structures, based apparently 
on discriminability of the checkmark-non- 
checkmark dichotomy. (b) Within three 
groups of structures with the same number of 


checkmarks or noncheckmarks, logical struc- 
ture showed significant effects on acquisition. 
(c) The probability structure of the required 
responses (checkmarks) showed a striking 
effect on Ss’ behavior. The Ss exhibited 
probability matching, i.e., emission of the re- 
quired percentage of responses, in the 
absence of event matching, i.e., correct 
response to the stimulus information. 


REFERENCES 


DeSoto, C. B. Learning a social structure. 
J. abnorm. soc. Psychol., 1960, 60, 417-721. 

Grant, D. A., Hake, H. W., & HORNSETH, 
J. P. Acquisition and extinction of a verbal 
conditioned response with different per- 
centages of reinforcement. J. exp. Psychol., 
1951, 42, 1-5. 

Harary, F., & Norman, R. Z. Graph theory 
as a mathematical model in social science. 
Ann Arbor: University of Michigan Insti- 
tute for Social Research, 1953. 


(Received July 21, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No, 2, 184-191 


SOME PROPERTIES OF SACCHARIN AS A REINFORCER! 


GEORGE COLLIER 


University of Missouri 


Saccharin appears to have only one 
property in common with the sugars: 
it tastes sweet. It differs from sucrose 
both in its other biological properties 
and in its physical properties. Sac- 
charin differs from sucrose in that it 
is not a sugar, it is nonnutritive (75- 
90% eliminated within 24 hr. in urine 
and the remainder in the feces, Carl- 
son, Eldridge, Martin, & Foran, 1923), 
it is hypotonic? at all concentrations 
examined, Table 1 (approximately 
3.7% by weight would be isotonic), 
and it has a substantially lower prefer- 
ence threshold (of the order of .01 F, 
Stellar, 1960). Saccharin elicits sali- 
vary, gastric, and intestinal secretions, 
but to a lesser degree than sugar. It 
decreases intestinal absorption ` of 
water in proportion to its concentra- 
tion by some nonosmotic mechanism. 
It appears in the blood, lymph, 
cerebrospinal fluid, tears, and mam- 
mary secretions following its ingestion 
in proportion to its concentration 
(Carlson et al., 1923). Saccharin 
intake does not cause a compensatory 

` decrease in food intake on an ad lib. 
feeding schedule as does sugar (Haus- 
mann, 1933), nor does it affect weight, 
mortality, state of organs (Fitzhugh, 
Nelson, & Frawley, 1951), with the 
possible exception that at high con- 
centrations it may result in a slight 
weight loss when fed in daily diet 
1 This research was su i 
Research Grant M-3328 (orn aR baked 
Institutes of Health, Bethesda, Maryland. 
Experiments 1, 3, and 4 were partially re- 
ported in a symposium paper entitled ‘In- 
teraction of factors governing amount of 
reinforcement function,” Midwestern Psy- 
chological Association, Detroit, May 1958. 

* The extracellular fluid concentration is of 

the order of 310 milliosmols for the rat. 


184 


(Thompson & Mayer, 1959). These 
differences and similarities provide 
a means of examining the possible 
mechanisms involved in food rein- 
forcement. The present study ex- 
plores saccharin as a reinforcer, com- 
paring the functions obtained with 
those obtained from sucrose. 


METHOD 


A pparatus.—Eight Skinner boxes (Collier 
& Myers, 1961) delivering liquid reinforce- 
ments were used. The solutions were pre- 
pared from soluble saccharin (Merck). 

Subjects—The 20 Ss of Exp. 1, the 24 Ss 
of Exp. 3, and the 12 Ss of Exp. 4 were 120- 
150-day-old female rats. They had been 
used in a previous experiment in the same 
apparatus with sucrose solutions. Equal 
numbers of Ss from each of the three con- 
centrations in the preceding experiment were 
assigned to the conditions in these experi- 
ments. They were maintained on a 23-hr. 
food privation schedule. 

The 64 Ss of Exp. 2 were 90-day-old naive 
female rats. They were maintained on a 10- 
gm. per 24-hr. food privation schedule. The 
daily ration was placed in S’s cage immedi- 
ately after running, 

The 16 Ss of Exp. 5 were 180-day-old male 
rats. They had been used previously in a 
water reinforcement experiment, In the 
“thirsty” portion of the experiment Ss were 
maintained on 1 hr. water, immediately fol- 
lowing a session, and free food. In the 
“hungry” portion of the experiment Ss were 


TABLE 1 
OSMOLARITY OF SACCHARIN CONCENTRATIONS 
Usep 
% Concen- Osmolarity* Calculated 
tration (Milliosmols) Values 
A asg 8.2 
ej 27 24.8 
9 77 75.4 
2.7 225 229.6 | 
oe SE a a č | 
s These val a 
determination? were obtained by freezing point! 


= 


SACCHARIN AS A REINFORCER 185 


maintained on a 10-gm. per 24-hr. schedule 
and free water. 

All rats were of the Sprague-Dawley strain 
(Holtzman Company), and maintained on 
Purina chow and tap water. 

Procedure.—In Exp. 1, four groups of 5 Ss 
were run, one at each of four concentrations 
(A, .3, .9, and 2.7%) on a 1-min. FI schedule 
for nine 30-min, sessions. Each reinforce- 
ment delivered .03 ml. Following a 3-day 
break, the same groups were run at the same 
concentrations for 7 days on a 4-min. FI 
schedule. 

Experiment 2 is a partial replication of 
Exp. 1 with order controlled. Four concen- 
centrations (.1, .3, .9, and 2.7%) were com- 
bined factorially with two fixed intervals 
between reinforcements, and two orders of 
presentation of the intervals (1 min.-4 min. 
and 4 min.-1 min.). Each S underwent both 
intervals, spending 6 consecutive days on 
each. A reinforcement delivered .1 ml. of 
solution. Sessions were of 20 min. duration. 
Two replications of 32 rats each were run. 
All Ss trained for 8 days on the saccharin 
concentration used subsequently. Some 
difficulty was experienced on training Ss to 
respond to the magazine at the higher con- 
centrations. Those Ss lost were replaced. 

In Exp. 3 two intervals, 1 and 4 min., and 
3 concentrations (.1, .3, and .9%) were com- 
bined factorially with 4 Ss assigned to each 
combination. The first 9 days of BP were 
for .1 ml. of saccharin solution per rein- 
forcement, the next 5 days of BP were for 
.3 ml. per reinforcement, and the final 9 days 
for .03 ml. per reinforcement. Sessions were 
30 min. in duration. 

In Exp. 4 three groups of 4 Ss each were 
run, one at each of the three concentrations, 
3, .9, and 2.7%. A 1-min. FI schedule was 
used. The sessions were 1 hr. in length. In 
the first six sessions, .03 ml. reinforcement was 
used, followed by a 4-day break, 4 days on .1 
ml. reinforcement, a 4-day break, and then 
finally 4 days on .3 ml. reinforcement. 

Experiment 5 is a partial replication of 
Exp. 4 with the effect of order controlled. 
Two concentrations (.1 and 2.5%) were 
combined factorially with four volumes per 
reinforcement (04, .08, 16, and .32 ml.) and 
four orders of presentation of the volumes. 
Each S spent 8 consecutive days at each 
volume. A 1-min. FI schedule and a 20-min. 
session were used. During the first cycle of 
32 days, Ss were thirsty. During the second 
cycle of 32 days, Ss were hungry. 

In Exp. 6, four concentrations (1; .3, 9, 
and 2.7% saccharin) were combined factori- 
ally with two levels of deprivation. The high 
deprivation group received 8 gm. of Purina 


chow and the low deprivation group 16 gm. 
immediately following the experimental ses- r 
sion. A 1-min. FI schedule was used for the 
eight 20-min. sessions. 


RESULTS 


The main results of Exp. 1 are 
presented in Fig. 1. The total num- 
ber of bar pressing responses (BP) 
averaged over the last 2 days on each 
interval is plotted against log con- 
centration for each FI interval. Anal- 
ysis of variance of these data for 
Concentration, Interval, and Ses- 
sions showed the main effects of 
Concentration and Interval significant 
(P¥.01, Fo=6.12, df=3/16; P<.01, 
F,=49.55, df=1/16). On the final 
day of the 1-min. and 4-min. condi- 
tions 97% of the rewards possible were 
received and 100% of those received 
were consumed. Thus, all 1-min. 
groups and all 4-min. groups con- 
sumed equal volumes of solution. 
When the total BPs per session are 
compared with those from the previ- 
ous sucrose experiment in which Ss 
served, it is apparent that there was 
a large, significant decline from the 
rate for comparable sugar solutions 


| MIN. 


Oy. ay oo eee 
CONC 


Fic. 1. The number of BPs/session as a 
function of concentration and interreinforce- 
ment interval in Exp. 1. 


186 


100 

a i75 | MIN. 

(an) _e-@ 

ie |. a T 

x 

E 25 

o EENS 
A ANE a yest 
CONC. 


Fic. 2. The number of BPs as a function of 
concentration and interval in Exp. 2. 


to a lower terminal rate for saccharin. 
Experiment 1, using .03 ml. per rein- 
forcement, shows that rate of BP was 
an increasing function of concentra- 
tion up to at least 2.7%, nine times 
the highest reported free ingestion 
preference value, on a 4-min. FI 
schedule and an increasing function 


TABLE 2 
ANALYSIS OF VARIANCE OF THE BP DATA 
or Exp, 2 
Source af MS F 
Between Ss 63 | 10,936.7 
Conc. (C) 3| 3,359.5| 3.05* 
Replication (R) 1| 1,265.6| 1.14 
RXC 3 421.2 38 
IXO (b) 1 12.1 OL 
: he X s p ; 469,0 42 
150.2 13 
IXOXR 
X C (b) 3 617.9 56 
Error, 48 
Within Ss 576 136.0 
Interval (I) 1 | 26,368.2 | 94.91** 
Order (O) 1| 2,600.2| 9.34** 
Minutes (M) 4| 2,164.9 | 47.33%" 
Ixc 3 89.6 32 
OXE 3 965.9| 3.47* 
LXR 1| 1,351.4] 4.85* 
CxXxRXO 3 772.8| 2.77* 
Cx R XIX 
í O XM (b) 12 19.1 „S1 
Errorw 432 67.7 
ê 48 278.1 
% 192 45.7 
ps 192 37.1 
*P<.05. CEA 
P< 01. 


GEORGE COLLIER 


of concentration up to at least .9% 
on a 1-min. FI schedule. 

The basic results of Exp. 2 are 
presented in Fig. 2. An analysis of 
variance of these data for the average 
of the last 2 days of each order, 
grouped into 4-min. totals, is pre- 
sented in Table 2. The rate of re- 
sponding was an increasing function 
of concentration at both intervals 
between reinforcements. The rate 
of responding within a session de- 
clined significantly, the level of re- 
sponding in the final 4 min. being 
about 68% of that of the initial 4 min. 
However, in Fig. 3 it is apparent that 
there were no differences in rate of 
decline over a 20-min. session. 

The major results of Exp. 3 are 
presented in Fig. 4. Rate of respond- 
ing was an increasing function over 
the range of concentrations examined 
for small volumes per reinforcement, 
and was an increasing then decreasing 
function of concentration for large 
volumes. At the larger volumes the 
highest rates were obtained for the 


90 
80 
70 
60 
50 
40 
30 
20 


BiR 


a 


O a 1 18 
MINUTES 


Fic. 3. The number of BPs in each 4-min. 


period as a function of concentration in 
Exp. 2. 


20 


> 
b 
7 
Í 
i 
| d 
i 
| 
1 


SACCHARIN AS A REINFORCER 


a 03 ML, i aun? ML 
i50! ruin 7 Fin 5 aye 
o 125 iri | pans 
, ¥ arsi 
i00 4' MIN AS 
A 75 k ni 
5 ‘ * ere 
= So se} 
25 — 
a OOW 
TC Wes 27 .3e9 
CONG. 


Fic, 4. The number of BPs as a function 
of volume per reinforcement, concentration, 
and interval between reinforcements in Exp. 3: 


longer intervals. The difference in 
level of responding between these 
results, for .1 ml., and those of Exp. 2 
is probably the result of the fact that 
Ss in Exp. 2 were trained directly on 
saccharin. 

The main results of Exp. 4 are 
presented in Fig. 5, in which the cumu- 
lative number of responses per minute 
for each combination of volume and 
concentration averaged over the last 
two sessions in each cycle is plotted 
against time. An analysis of variance 


300 5 a ie ar 
270) § < A 
240! 5 PN 


210 
180 
150 
120 
90 
60 
30 


CUMULATIVE BP 


TABLE 3 


ANALYSIS OF VARIANCE 


187 


or THE BP DATA 
4 


OF EXP. 
Source df MS F 
Between Ss 11 | 325.0964 
Conc. (C) 2 | 21.02 .05 
Errors 9 | 392.67 
Within Ss 2148 | 8.02 
Volume (V) 2 | 836.98 10.12** 
Minutes (M) 59 | 31.30 5.69** 
VXM 118 | 5.16 81 
CICV. 4 | 292.64 3.54* 
CXM 118 | 6.46 1.17 
CxVXM 236 | 5.46 .86 
Errorw 1611 
€ 18 | 82.63 
e: 531| 5.50 
e 1062 | 6.37 
*P< 05. 
** P= O01. 


over Concentration, Volume, and Min- 
utes is presented in Table 3. 

An examination of Fig. 5 shows 
that at small volumes rate of BP is an 
increasing function of concentration, 
at large volumes it is a decreasing 
function, and that the rate of decline 
in responding is a function of concen- i 
tration but independent of the volume. 


| ML 


o 30 60 Ò 


MINUTES 


Fic. 5. The cumulative nu 


mber of BPs and the number in the first 5 min. as a function 


of volume and concentration of reinforcement in Exp. 4 


188 


GEORGE COLLIER 


TABLE 4 


NUMBER OF REINFORCEMENTS RECEIVED (#R), PERCENTAGE CONSUMED (% 
AND VOLUME OF SOLUTION AND Amount OF SOLUTE CONSUMED 


o), 


IN EXP. 4 


-3 Concentration 


-9 Concentration 


2.7 Concentration 


Vol. (ml) Vol Vol vila 
i Gm . |. Gm. 
fe | (Sie [Ste] em |e SE Lge, | em o | S |e, 
.03 58 100 1.7 | .005 60 100 1.8 016 | 59 100 1.8 | .048 
al 52 100 5.2 | .016 | 60 100 6.0 | .054 60 100 | 6.0 | .162 
As 56 99 | 16.6 | .048 44 93 | 12.4 | 111 33 72 7.2 | 194 
a 60 possibi 


le. 
b Percentage of reinforcements consumed when magazine operated. 


This latter finding has been reported 
for sucrose (Collier & Myers, 1961) 
and for salt (Stellar, Hyman, & 
Samet, 1954). Total BPs per session 
do not show the decline observed in 
Exp. 1. With the longer sessions 
(60 min. vs. 30 min.) the decline takes 
place within sessions rather than 
between. 

Table 4 shows the number of the 
60 possible reinforcements received, 
the percentage taken, and the volume 
of solution and grams of solute 
consumed. 

The major data of Exp. 5 are pre- 


rivation, and Order is presented in 
Table 5. When Ss were hungry, rate 
of responding was an increasing 
function of volume at the low (1%) 
concentration and an increasing then 
decreasing function of volume at the 
high (2.5%) concentration. When 
Ss were thirsty, number of responses 
was an increasing then decreasing 
function of volume at both concen- 
trations and a decreasing function 
of concentration at all volumes. 
These volume per reinforcement curves 


TABLE 5 
sented in Fig. 6. An analysis of the ANALYSIS OF VARIANCE OF THE BP DATA 
data of the final 2 days at each condi- oF Exe. 5 
tion for Concentration, Volume, Dep- = - = = 
Source df MS F 
225 
Between Ss 15 | 23,853.52 
200 C 1 | 66,703.78 | 5.11 
175 V XO (b) 3 | 29,792.73 | 2.28 
A ,VXOXC(b)| 3| 32,438.05 | 249 
ME het Errors 8 | 13,050.84 
Ojo 6 À fo Within Ss 112 | 16,285.29 
m WATER V 3| 47,811.59 | 7.96** 
2 100 (6) 3| 3,268.89 54 
kos 5% D 1 [536,130.12 | 22.64** 
E V XO (w) ó| 7,867.00 | 1.31 
O 50 VXD 3| 11,656.07 | 3.06* 
H DixX'C 1 [195,781.53 | 8.27* 
25 OXVXC(w) 
ri HUNGRY THIRSTY z XD 6| 5,645.44] 1.48 
04 08 1832 04-08-16 o rrOry 56| 7,588.42 
G 24 | 6,004.84 
VOL. (ML) er 8 | 23,677.34 
Fic. 6. The number of BPs asa function 2 28}, 3,309.01 


of concentration, volume per reinforcement, 


and kind of deprivation in Exp. 5. 


ee ee ee 


— e O 


fis 


= 


~ 


g BES. TAI AEN T A R 


a 


SACCHARIN AS A REINFORCER 


for saccharin are similar to those ob- 
tained with water from a comparable 
group of Ss run under the same condi- 
tions (Manaster, 1962). The addition 
of a “small” amount of saccharin ap- 
pears to lead to an increase in rate 
of responding above that for water 
while the addition of a “large” 
amount leads to a decrease. 

Within a session, both initial rate 
and the rate of decline were affected 
by concentration and deprivation. 
Hungry Ss showed no noticeable de- 
cline within sessions at any combina- 
tion of volume and concentration 
with the exception of the .04 ml.- 
1% group. The differences in total 
number of bar presses were reflected 
in the initial rates, that is, the non- 
monotonicity of the rate-volume curve 
at the high concentration and the 
nonmonotonicity of the rate-concen- 
tration curves at the large volumes 
could not be attributed to postinges- 
tive effects. On the other hand, when 
Ss were thirsty, substantial within- 
session declines occurred, particularly 
at the high concentration. Here it 
is obvious that some postingestive 
effect was operative over the 20 min. 
of the session which was dependent 
upon the concentration of the load 
and apparently independent of the 
volume of the load. 

The Ss gained weight across cycles 
under the thirst schedule and lost 
weight across cycles under the hunger 
schedule. The average weight for 
the thirsty Ss was 355 gm.; for the 
hungry Ss it was 289 gm. 

The major data of Exp. 6 are pre- 
sented in Fig. 7. It shows that both 
the slope of the rate-concentration 
function and the rate of within-session 
decline were affected by deprivation, 
the steepest slope occurring for the 
8-gm. group, which averaged 199 gm. 
in weight, and the greatest within- 
session decline for the 16-gm. group 
which averaged 270 gm. in weight. 


225, 
200} o27% 
= ; 
150 P 
a aa n gs d 9% 
@ 00)" rysi o3% 03% 
75 cone o” e27% ‘ he’ 
: LE 
; i 
r: 7 H 
o/76 262004 8 26 Z0 
MINUTES 


Fic. 7. The cumulative number of BPs 
as a function of concentration and degree of 
food deprivation in Exp. 6. 


Discussion 


Saccharin resembles sucrose in that 
similar functions are obtained when the 
parameters of reinforcement and depri- 
vation are manipulated and the course of 
responding within a session examined. 
It is clear from these data that the 
supposition that there are three inde- 
pendent loci of the events governing 
the rate of responding—the proximal 
reinforcing stimuli, the momentary post- 
ingestive load, and the nutritive condition 
of the animal—is correct for saccharin as 
well as for sucrose. 

Initial rates of responding for saccharin 
prove to be functions of the same dimen- 
sions of the proximal reinforcing stimulus 
as for sucrose, namely concentration, 
volume per reinforcement, and interval 
between reinforcements. No combina- 
tion of these variables proves to be addi- 
tive, both the slope and the maximum 
of any one function being themselves 
functions of the other two variables. 
Saccharin differs from sucrose in that 
it is responded to at much lower concen- 
trations; it produces a lower maximum 
rate; and the increment in log concen- 
tration necessary to produce an equal 
increment in rate of responding is larger 
(cf. Collier & Myers, 1961). This latter 
contrasts with the parallel rate-log 
concentration functions obtained when 
similar sugars are compared (Guttman, 
1954). Volume per reinforcement ap- 
pears to have two effects on initial rate 
of responding for saccharin. In the 
lower range of concentrations increased 


190 GEORGE COLLIER 


volume increases the slope of the rate- 
concentration function. This may merely 
represent the reduction of the effect 
of dilution by saliva or it may be an 
example of the classic intensity-area 
relation. In upper range of concentra- 
tion, increased volume per reinforcement 
for both sucrose and saccharin lowers 
the point at which the inversion in the 
rate-intensity relation occurs. If the 
assumption (Collier & Myers, 1961) is 
correct that amount of reinforcement 
is an increasing function of the intensity 
and volume of the proximal reinforcing 
stimulus, then some other processes 
must intervene at these values. Two 
nonexclusive possibilities are that (a) 
the quality of the stimulus changes 
(e.g., from sweet to bitter) at the large 
volume high concentration combinations, 
and (b) that there are unconditioned 
withdrawal responses to intense stimuli 
which compete with the reinforced 
response. Some indication for the latter 
is given by the fact that the latency of 
the magazine response is longer at very 
high concentrations (Collier, 1959). 

As in the preceding experiments (e.g., 
Collier & Myers, 1961), two sorts of 
within-session decrements are found, 
those occurring at minimal reinforce- 
ment values and those occurring for 
combinations of volume, interval, and 
concentration which result in concen- 
trated postingestive loads. Numerous 
authors have attributed this latter shut- 
off to the increase in the osmotic pressure 
of the gastric load. Amount consumed 
as a function of concentration studies 
typically show a peak intake at approxi- 
mately the point of isotonicity while 
loading studies have typically shown an 
increasing depression of intake as a 
function of hypertonicity. However, 
the present results show the rate of 
shutoff as an increasing function of con- 
centrations, all of which were hypotonic 
(Table 1), and two volume-consumed 
studies show peak intakes at approxi- 
mately .24% for a 1-hr. session (Stellar, 
1960) and .44% for a 55-min. session 
(Cockrell, 1952), which are considerably 
below the point of isotonicity.4 Thus, 


* Interpolated from the curves presented. 


we have a postingestive shutoff effect 
in the hungry S which is proportional to 
concentration but is not due to the 
hypertonicity of the load, Similarly, 
for the thirsty Ss we find a higher rate 
of responding and a lower rate of decline 
for the low concentrations than for 
water, and a lower rate and a faster rate 
of decline for the high concentration, 
which is again still hypotonic. These 
results lead to a suggestion either of 
some peculiar property of saccharin or of 
a rejection of a simple osmotic explana- 
tion of satiation. Some support for the 
former alternative is given by Carlson 
et al. (1923), who report that saccharin 
delays intestinal absorption by some 
mechanism other than the osmotic factor. 
The results further suggest that there 
are two shutoff mechanisms, one for 
hunger and one for thirst, which respond 
differentially to the same load. 

The effects of nutritive condition on 
the rate of responding for saccharin 
are similar to those for sucrose (Collier 
& Willis, 1961). Higher rates of respond- 
ing and steeper slopes of the rate-log 
concentration function are obtained at 
higher deprivations. These results, in 
the light of the noncaloric character of 
saccharin and its sustained consumption 
over long periods of time (e.g., Hillix, 
1958), support the view that deprivation 
may exercise its effects on rate of re- 
sponding independently of the post- 
ingestive consequences of the reinforce- 
ment. However, it should be noted that 
saccharin simulates sugars and other 
nutrient materials in some of its non- 
nutritive postingestive consequences, in 
that it elicits similar gastric reflexes, etc. 
and that a large part of it is absorbed 
before it is eliminated (Carlson et al., 
1923). 

When the kind (e.g., thirst vs. hunger) 
rather than the degree of deprivation 
is varied the rate-intensity relation is 
again affected. For thirsty Ss the rate- 
intensity relation is essentially flat over 
the lower range of concentrations and 
then decreases for higher concentrations. 
The hypothesis that body concentration 
is being defended by means of an osmo- 
receptor at the gustatory level is not 


ae 


SACCHARIN AS A REINFORCER 


supported in the present study since 
all values of saccharin used were hypo- 
tonic. This interaction of taste and 
kind of deprivation again suggests that 
deprivation may in part exercise its 
effect on rate of responding independ- 
ently of the postingestive consequences 
of the reinforcement. 


SuMMARY 


The relations between concentration, vol- 
ume per reinforcement, interval between 
reinforcement, degree and kind of depriva- 
tion, and the rate of responding for saccharin 
solutions in the Skinner box were explored. 

For hungry 5s initial rate of responding was 
an increasing function of concentration and 
volume, and a decreasing function of interval. 
No combination of these variables proved to 
be additive. The slope of the initial rate vs. 
log concentration function was an increasing 
function of deprivation while the slope of 
the initial rate vs. log volume function was 
not. When Ss were thirsty the rate vs. log 
concentration function became flat at the 
low concentrations and decreased at the high, 
while the rate vs. log volume function retained 
the same shape. Rate of shutoff, within a 
session, was an inverse function of the mag- 
nitude of reinforcement at low levels of rein- 
forcement and a function of the magnitude 
of the load at high levels of reinforcement. 
The rate of shutoff did not appear to be 
greatly affected by deprivation. 

The relations found for saccharin were 
similar to those found for sucrose. The im- 
plications of the differences between sac- 
charin and sucrose, ¢.g., osmotic and meta- 
bolic, were examined for an account of these 
relations in terms of the view that there is a 
threefold locus of events governing the rate 
of responding, the proximal reinforcing stim- 
uli, the momentary ingestive load, and the 
nutritive condition of the animal. 


REFERENCES 
CARLSON, M. J., ELDRIDGE, C. T., MARTIN, 
H. P., & Foran, F. L. Studies in the 


physiological action of saccharin. 
metabol. Res., 1923, 3, 451-477. 


191 


COCKRELL, J. T. Operant behavior in rela- 
tion to the concentration of a nonnutritive 
sweet substance used as a reinforcement. 
Unpublished doctoral dissertation, Indiana 
University, 1952. 

Cotter, G. The loci of reinforcement. 
Amer, Psychologist., 1959, 14, 398, (Ab- 
stract) 

CoLLIER, G., & Myers, L. The loci of rein- 
forcement. J. exp. Psychol., 1961, 61, 
57-66. 

Corer, G., & Wiis, F. Deprivation and 
reinforcement. J. exp. Psychol., 1961, 62, 
377-384. 

Fitzuven, O. G., NELSON, A. A., & FRAWLEY, 
J. P. A comparison of chronic toxicities 
of synthetic sweetening agents. J. Amer. 
Pharmaceut. Ass., scient. Ed., 1951, 40, 
583-586. 

Gutman, N. Equal reinforcement values for 
sucrose and glucose solutions compared 
with equal sweetness values. J. exp. 
Psychol., 1954, 47, 358-361. 

HAUSMANN, M. F. The behavior of albino 
rats in choosing foods: II. Differentiation 
between sugar and saccharine. J. comp. 
Psychol., 1933, 15, 419-428. 

Hmurx, W. A. Volume ingested as a function 
of deprivation, taste, and nutrition. Un- 
published doctoral dissertation, University 
of Missouri, 1958. 

Manaster, M. Volume per reinforcement 
as a parameter of amount of reinforcement 
functions. Unpublished master’s thesis, 
University of Missouri, 1962. 

Sretiar, E. Drive and motivation. In J. 
Field (Ed.), Handbook of physiology. Vol. 
3. Neurophysiology. Washington, D. Cr 
American Physiological Society, 1960. 
Pp. 1501-1527. 

Srectar, E., Hyman, R, & SAMET, S 
Gastric factors controlling water and salt- 
solution drinking. J. comp. physiol. 
Psychol., 1954, 47, 220-226. 

Tompson, M. M., & Mayer, J. Hypo- 
glycemic effects of saccharin in experi- 
mental animals. Amer. J. clin. Nutr., 
1959, 7, 80-85. 


(Received July 22, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 2, 192-197 


ADAPTATION IN THE PERCEPTION OF VISUAL VELOCITY 


V. R.. CARLSON 
National Institute of Mental Health 


The aftereffect due to prolonged 
viewing of a constantly moving stimu- 
lus pattern has been studied with (a) 
a stationary test pattern, (b) a pat- 
tern moving in the same direction 
slowly enough to appear stationary, 
and (c) a pattern moving in the same 
direction and at the same objective 
speed as the adapting pattern. If one 
views a pattern moving down for a 
time and then, within some number 
of seconds, looks at a stationary 
pattern, the latter will give an appear- 
ance of moving up (the so-called water- 
fall illusion); if the pattern is moved 
down at a slow rate instead of re- 
maining stationary, the apparent up- 
ward motion can be cancelled. If 
the original pattern continues and 
a second, identically moving pattern 
is presented in a different part of the 
field, the first will appear to be moving 
at a slower rate than the second. 

Gibson (1937, pp. 234-236; 1959, 
pp. 490-491) has reviewed these 
several effects and has pointed out 
that they are consistent with the 
Proposition that adaptation involves 
an oppositely directed decrement in 
apparent velocity along the dimension 
of the adapting velocity. As a 
purely empirical generalization, one 
would predict that a test pattern 
moving in the same direction, but 
faster than the adapting pattern, 
should appear to be moving more 
slowly than it would in the absence 
of the prior adaptation. Similarly, 
if the test pattern is moving in the 
direction opposite to that of the adapt- 
ing pattern and at a more or less 
comparable rate of speed, then the 
test pattern should appear to be 


moving faster than normal in that 
opposite direction. The present study 
is a test of these two hypotheses, 
utilizing a procedure for measuring 
perceived velocity which does not 
depend upon subjective awareness 
of the occurrence of any aftereffect. 


METHODS 


Perceptual velocity test—Fixating binocu- 
larly, S viewed an aperture (depicted in 
Fig. 1) through which a bright test line rotated 
orbitally at constant speed. Two seconds 
before the appearance of the test line, a tone 
signal was presented and at the same time 
the designated target was lighted. The S$ 
depressed a key and held it down until he 
judged that the test line had reached the 
target position beyond the point of disap- 
pearance of the test line. Response time was 
measured from the moment of disappearance 
of the test line to the instant S released the 
response key. Release of the key also ex- 
tinguished the target. 

This general kind of task has been used by 
a number of investigators in the study of 
tracking performance (reviewed by Brown, 
1961, pp. 99-101). Gerhard (1959) and Held 
and White (1959) have employed it more 
specifically as a means of measuring perceived 
velocity. The present version, although 
similar in principle, differs in many details 
from the particular tests used by those 
investigators. " 

Adaptation.—Prior to an adaptation test 
trial, a continuously rotating pattern was 
presented in the aperture for 45 sec. The S 
made no response during the adaptation 
period other than to maintain fixation. The 
pattern was moving at a constant rate both 
when it appeared and disappeared, so that 
S never saw it stationary. Three identical 
presentations of the test line then occurred 
at approximately 4, 13, and 22 sec. following 
disappearance of theadaptation pattern. Con- 
trol trials were the same as the adaptation 
trials except that no pattern appeared in the 
aperture during the adaptation period. One 
minute elapsed between successive test periods 
(whether adaptation or control). 


192 


> 


PERCEPTION OF VISUAL VELOCITY 193 


p 
I0». 
s< Xo 
£ 3 
at 
TEST LINE 5 MM. DIAM 


6 CM. 


FIXATION POINT 


Fic. 1. Aperture used in perceptual 
velocity test. 


Schedule—The speeds of rotation were 
30°, 40°, and 53°/sec, with corresponding cor- 
rect response times of .33, .25, and .19 sec., 
respectively At the beginning of each ses- 
sion S performed two practice test trials at 
40°/sec, one clockwise and one counterclock- 
wise. The experimental schedule consisted 
of six control conditions interspersed among 
18 adaptation/test conditions on each of 4 
days (the nine possible speed combinations 
each occurred once with the adaptation and 
test rotations in the same direction and once 
in opposite directions). The order of occur- 
rence of these conditions was carefully dis- 
tributed and permuted so as to minimize 
possible sequential effects. Several days, 
usually 1 week, intervened between testing 
sessions. 

A pparatus.—The S was positioned 2 m. 
in front of a 59 X 76 cm. rear-projection, 
flashed opal-glass screen with the fixation 
point at eye level. The aperture, fixation 
point, and target spots were produced by cut- 
outs in a sheet of cardboard mounted im- 
mediately behind the glass. The test line 
and adaptation pattern were produced by 
open sectors in discs mounted on a rotator 
with the center of rotation in line with the 
fixation point and S’s eye. The test line con- 
sisted of a 2° sector, the adaptation pattern, 
of 18 2° sectors spaced 18° apart. The screen, 
its black frame, and the fixation point were 
always clearly visible, but the rest of the test 
stimuli appeared only at the appropriate 
times during an actual trial. 

Subjects. Fifteen male and 4 female 
junior college students were paid to serve as 
Ss in the main experiment. The data for 
the females fell within the distributions for 


1 All designations of measure in degrees 
in this paper refer to angular distance around 
the fixation point in the frontal plane of the 
fixation point, not to visual angle. 


the males and showed no consistent tenden- 
cies toward deviation from the data of the 
males. The two groups were therefore 
treated as a single group of 19 Ss. 

Preliminary work.—A variety of Ss have 
shown very high correlations between re- 
sponse time in this perceptual velocity test 
and presented speed of rotation of the test 
line, although absolute response time has 
varied appreciably from one individual to 
another. No relationship was found between 
response time in this task and simple reaction 
time to the cessation of a light, in agreement 
with Held and White (1959). 

In a preliminary adaptation experiment 
with 11 Ss (other than those used in the main 
experiment) only control trials were pre- 
sented on a first day, only adaptation trials 
on a second day, and only control trials again 
on a third day. Overall response-time level 
tended to shift toward generally increased 
response times after the first day, and it was 
for this reason that the control trials were 
distributed among the adaptation trials in 
the main experiment. The results of the 
preliminary experiment were otherwise es- 
sentially the same as those reported below. 


RESULTS 


Clockwise vs. counterclockwise ro- 
tation, as such, had no differential 
effect, so these conditions have been 
combined. Rotations in the Same 
direction thus refer to conditions in 
which both the adaptation pattern 
and the subsequent test line turned 
clockwise or both counterclockwise; 
rotations in Opposite directions refer 
to those conditions in which one 
turned clockwise and the other coun- 
terclockwise. 

An overall test for effect on the 
first test-line presentation following 
each adaptation period was made 
according to whether speed of the 
adaptation pattern was greater than, 
equal to, or less than the speed of the 
test line and whether rotations were 
in the Same or Opposite directions 
(Table 1). Since each of these six 
categories combines three different 
speed conditions (the appropriate 
combinations of 30°, 40°, and 5397 


194 V. R. CARLSON 


TABLE 1 


AVERAGE DIFFERENCE IN PERCENTAGE 
RESPONSE-TIME Error (CONTROL 
MINUS ADAPTATION) FoR FIRST 
TRIAL FOLLOWING 
ADAPTATION 


Adaptation Speed Relative to 
Adaptation vs. Test Speed 
Test Direction 
Greater Equal Less 
Same 
Mean —10.5 —14.7 1.6 
SD 9.0 23.3 20.9 
fa <,001 <.02 ns 
posite 
“oem —0.6 —44 —8.7 
SD 6.4 11.8 18.4 
i ns <.10 <.05 


“Probability associated with the £ value for the 
difference of the mean ftom zero, df = 18, 


sec), response-time error expressed 
as a percentage of the correct response 
time for each given test speed was 
used as the score. According to the 
empirical hypothesis under test, the 
control-minus-adaptation differences 
for rotations in the Same direction 
should be negative (relatively longer 
Tesponse time indicating slower ap- 
Parent speed following adaptation as 
compared to control); the differences 
for rotations in Opposite directions 
should be positive (test line appearing 
to move relatively faster following 
adaptation to a pattern moving in 
the opposite direction). For only two 
of the six categories in the table do the 
results agree with such a prediction. 
These two (adaptation speed equal 
to, or greater than, test speed, rota- 
tions in Same direction) represent 
consistent effects, each of the three 
conditions within each category at 
least approaching a significant dif- 
ference (P < 10) when tested sepa- 
rately. No single within-category 
condition for rotations in Opposite 
directions reached a significant effect 


by itself, although a tendency in the 
nonpredicted direction is suggested 
for two of these categories. If the 
average values for Same and Opposite 
rotations are compared with each 
other, instead of with the control 
values, the differences are generally 
significant (P <.001, <.05, and <.10, 
respectively, for adaptation speed 
greater than, equal to, and less than 
test speed), 

More detailed results for rotations 
in the Same direction are presented 
in Fig. 2. The corresponding curves 
for rotations in Opposite directions 
are not shown, since they were not, 
in any specific instance, significantly 
different from the control values. 
Each adaptation period was followed 


TEST TRIALS AT 53SEC. 


=le ri ADAPTATION SPEED 
3O07SEC. 
- 16 H > 


= =——— 47SEC. 
Oe 


oO 


g z 

Her —537sEC 

Ww 

= TEST TRIALS AT 40%SEC, 

F -6 ADAPTATION SPEED 
Ww ss 

Bias | fom. eg 0780 

5 ““*— sors 

W -24 L „————539SEC. 

o 

S -28 E 

fra j: 

@ -20 + TEST TRIALS AT 309SEC 

A o- ——— -9 ADAPTATION SPEED 
A 407SEC 

= F saë 30SEC. 

g 53YSEC, 

Œ 

5 

o 


e e FOLLOWING 
ADAPTATION 


O-——0 CONTROL 


i ea 
& eR 
aa] 


I 


n 1 
I 2 
TEST TRIALS 
Fic. 2, Response-time errors on three 


successive test trials following adaptation 


(test and adaptation rotations in the Same 
direction). 


jæ 


PERCEPTION OF VISUAL VELOCITY 


by three successive, identical test 
trials, occurring at approximately 4, 
13, and 22 sec. after disappearance 
of the adaptation pattern. A lower 
value on the ordinate represents 
longer response time, indicating rela- 
tively slower perceived speed of the 
test line. 

The three control points for test 
trials at the middle speed, 40°/sec, 
do not differ from each other within 
the limits of error. For both 30°/sec 
and 53°/sec, the second trial tends 
to be different from the first (P <.05), 
effective speed of the test line shifting 
toward a relatively slower value for 
53°/sec and toward a relatively 
faster value for 30°/sec. These trends 
are generally discernible in the curves 
for the adaptation trials as well. 

The clearest aftereffect is a dis- 
placement of the points downward ~ 
when the velocity of the adaptation 
pattern was greater than the velocity 
of the test line. A lesser effect, but 
in the same direction, occurred when 
the adaptation and test-line velocities 
were equal. In all of the present 
data the only instance of a significant 
increase in perceived speed due to 
prior adaptation was produced by 
adaptation at 30°/sec on test trials 
at 53°/sec rotations in the Same 
direction. All three points for this 
curve are significantly higher than 
the corresponding control points 
(P < .01). 

Since the preliminary experiment 
suggested an overall shift toward 
longer response times with succeeding 
days, the data were also analyzed for 
such a trend, The mean response 
times for the control trials were 10, 
.06, and .04 sec. longer for speeds 
30°, 40°, and 53°/sec, respectively, 
on the fourth as compared to the 
first day. These increases were pro- 
gressive over the 4 days, but there 
was no consistent trend from begin- 


195 


ning to end of testing sessions within 
days. 


Discussion 


Motion aftereffects of the waterfall 
type have represented a class of effects 
in which a decrement in effective velocity 
accrues to a test stimulus moving in the 
same direction as the adapting stimulus 
and at the same, or a slower, speed. 
The significant negative aftereffects which 
occurred in the present experiment fall 
within this class. The procedure used 
here is incapable of utilizing a stationary 
test stimulus, but Johansson (1956) has 
devised a similarly nonsubjective tech- 
nique representing the stationary test 
condition. His results also indicate a 
negative aftereffect of motion. The 
more objective procedure, therefore, 
appears to produce results which are 
consistent with previous subjective de- 
terminations of motion aftereffect, at 
least with respect to direction. 

Smith and Sherlock (1957) have 
pointed out that when a pattern moves 
through an aperture the apparent fre- 
quency of passage of contours is con- 
founded with apparent speed. Such 
confounding was the case here with 
respect to the adaptation pattern and 
presumably contributed to the impres- 
sion of its speed. Also, the time the test 
line was visible was confounded with its 
speed through the aperture, and it is 
possible that time-judgment is an im- 
portant aspect of this type of task (Ger- 
hard, 1959). But if the adaptation 
effects were due solely to the factors 
of frequency or time, then the direction 
of rotation of the test line relative to that 
of the adaptation pattern should not 
have made any difference. Since this 
relation was of major importance in the 
results, we can conclude, tentatively 
at least, that there was adaptation to 
velocity, not just to speed or a non- 
directional component such as frequency 
or time. One aspect of the results, 
however, does suggest some nondirec- 
tional adaptation. That was the tend- 
ency for the three successive test trials in 
a set, whether following the adaptation 


196 V. R. CARLSON 


pattern or not, to shift toward a higher 
value for the slowest speed and toward 
a lower value for the fastest speed. This 
effect would be consistent with the notion 
that an Adaptation Level tended to 
become established at or near the middle 
speed (or response time), dependent 
upon a cumulative effect of all preceding 
trials, 

One of the directional effects differen- 
tially supports Adaptation Level theory 
rather than the principle of negative 
aftereffect: Adaptation at 30°/sec pro- 
duced an apparent increase upon the 
test speed at 53°/sec, when both rota- 
tions were in the same direction. How- 
ever, the two other comparable condi- 
tions, 30° upon 40° and 40° upon 53°/ 
sec, did not. Moreover, the lack of 
negative aftereffect with rotations in 
opposite directions also appears incon- 
sistent with Adaptation Level theory. 
According to Helson (1959), “. . . ad- 
aptation has not only negative after- 
effect but positive and negative effects 
simultaneously: high AL sensitizes to 
negative qualities, low AL to positive 
qualities, and intermediate AL to both 
Positive and negative qualities. . .” 
(pp. 572-573). 

In any case, it is specifically with 
respect to Gibson’s generalization of 
negative aftereffect that the present 
findings do not agree. Gibson (1959) 
conceives of a single dimension as con- 
sisting of values varying from high in 
one direction through zero to high in the 
opposite direction. Adaptation to any 
Particular value along the entire dimen- 
sion reduces the difference between 
that value and zero and simultaneously 
shifts all values along the dimension 
in the same algebraic direction (p. 490). 
If the implication of the Present findings 
is correct, however, the principle would 
appear not to apply to perceived velocity, 
whether one understands the pertinent 
variable of stimulation to be angular 
optical motion across the retina or the 
“shear” relation between a pattern and 
the edges of an aperture (Gibson, 1958, 
p. 168). 

On the other hand, the existence of 


ganglion cells in thé vertebrate retina 
which are differentially sensitive to the 
direction of motion of a contour (Hubel 
& Wiesel, 1959, pp. 581-584; Maturana, 
Lettvin, McCulloch, & Pitts, 1960, p. 
159) suggests a possible basis for a lack 
of aftereffect when the adapting and 
test motions are in opposite directions. 
As far as such cells, or processes de- 
pendent upon them, would be concerned, 
the effects of stimulation in one direction 
would be expected to have little or no 
effect upon subsequent stimulation in 
a different direction. This is a highly 
speculative hypothesis at this point, 
but it seems likely to require considera- 
tion in future theorizing about motion 
perception. 


SUMMARY 


The effect of adaptation to an orbitally 
rotating pattern on a subsequently presented 
moving test stimulus was assessed using a 
procedure in which § is unaware of the oc- 
currence of aftereffect. When adaptation 
and test motions were in the same direction, 
results were generally consistent with already 
known aftereffects of the waterfall-illusion 
type. But little or no aftereffect occurred 
when adaptation and test motions were in 
opposite directions. This finding agrees 
neither with Adaptation Level theory nor 
with Gibson's principle of negative after- 
effect. It may, however, be related to the 
recent discovery of retinal units which are 
differentially sensitive to the direction of 
stimulus movement. 


REFERENCES 


Brown, R. H. Visual sensitivity to differ- 
ences in velocity. Psychol. Bull., 1961, 58, 
89-103. 

GERHARD, D. J. The judgment of velocity 


and prediction of motion. Ergonomics, 
1959, 2, 287-304. 


Gisson, J. J. Adaptation with negative 
pitta Psychol. Rev., 1937, 44, 222- 
Gtsson, J. J. Research on the visual per- 


ception of motion and change. In, Pro- 
ceedings second symposium on physiological 
Psychology. Washington, D. C.: Office of 
Naval Research, (ONR symp. Rep. No. 
ACR-30) 1958. Pp. 165-176. ; 


Ae 


ps 


Soa 
OF VISUAL VELOCITY 197 


Gipson, J. J. Perception as a function of Jowansson, G. The velocity of the motion 
stimulation. In S. Koch (Ed.), Psychology: after-effect. Acts Psychol., Amst, 1956, 
A study of a science, Vol. 1. New York: 12, 19-24. 


McGraw-Hill, 1959. 456-501. Marturaxa, H. R., Lerrvin, J. Y.. Mc- 
Herp, R., & Wnrre, B. deprivation CuLLocn, W. S., & Prrts, W. H. Anatomy 

and visual speed: An analysis, Science, and physiology of vision in the frog (Rana 

1959, 130, 860-861. pipiens). J. gen, Physiol., 1960, 43, 129- 
Hetson, H. Adaptation level theory. In 17 


L 
S. Koch (Ed.), Psychology: A study of a Surra, O. W., & SuerLock, L. A new ex- 
science. Vol. 1. New York: McGraw-Hill, planation of the velocity-transposition 
1959, Pp. 565-621. phenomenon. Amer, J. Psychol., 1957, 70, 
Huset, D. H., & Wreset, T. N. Receptive 102-105. 
fields of single neurones in the cat's striate 
cortex. J. Physiol, 1959, 148, 574-591. (Received July 27, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 2, 198-199 


SUPPLEMENTARY REPORT: EFFECT OF MODE OF RESPONSE ON 
JUDGMENT OF FAMILIAR SIZE! 


A, V. CHURCHILL 
Defence Research Medical Laboratories, Toronto, Canada 


Recent studies have compared the accuracy 
of estimation of the size of familiar objects 
from memory, with the accuracy of estima- 
tions obtained when the same objects were 
presented visually. Bolles and Bailey (1956) 
had Ss give verbal estimates of the size of 54 
familiar objects from memory, followed by 
verbal estimates of the size of the same ob- 
jects when the objects were visible. The 
“familiar” objects were present in the immedi- 
ate environment and included items ranging 
from pencils and books to furniture and auto- 
mobiles. The procedure suggests that the 
objects were familiar in terms of the recency 
of their appearance in S's environment. 

McKennell (1960) had Ss draw lines to 
represent their estimates, from memory, 
of the size of nine common objects, followed 
by estimates of the size of the same objects 
when they were presented visually. The 
“familiar” objects in this study included 
such standard-sized items as a 9-in. rule and 
a cigarette, and such non-standard-sized items 
as a medicine bottle and a penknife. 

The present study was designed to examine 
the reliability of the results obtained under 
the two response conditions, and to deter- 
mine the effect of the mode of response, 
when estimating the size of familiar objects 
from memory. 

Method.—Six laboratory personnel served 
as Ss. The objects to be estimated were 

verbally identifiable as being of a specific size, 
and were familiar to Ss through daily ex- 
posure in the environment. The group of 
objects provided a series of 40 dimensions, 
ranging from x% in., the diameter of a govern- 
ment issue pencil, to 12 in., the length of a 
Province of Ontario automobile license plate. 

The S sat at a table in front of a screen 
which eliminated his visual reference to 
objects in the room which might have given 
cues to the size of the dimension being esti- 
mated, Under the physical-response condi- 
tion, S was required to Separate two straight- 
edges, which were mounted on tracks over 
a continuous strip of paper, and to mark off 
his estimate with a nonstandard pencil. 
Each marked-off estimate was removed from 

! Defence Research Medical Laboratories Project 


No, 164, DRML Report No. 164-12, PCO Ng Dyes 
20.27, HR No, 196. EPA 


198 


S's view before continuing with the series. 
Under the verbal-response condition, S sat 
in the same position, with his hands folded 
to eliminate any tendency towards making 
physical estimates between his hands, and 
was asked to give a verbal estimate of the 
specified dimension. The physical-response 
condition preceded or followed the verbal- 
response condition for alternate Ss. The 
entire procedure was repeated in one session,. 
resulting in four sets of estimates of the 40 
dimensions by each S, two physical and two 
verbal. Each series of 40 dimensions was 
presented in random order. 

Results.—Product-moment correlations 
were calculated, for each S, between the 40 
measured dimensions and each of the four 
sets of estimates. Correlations were calcu- 
lated between the two sets of physical esti- 
mates, between the two sets of verbal esti- 
mates, and between the means of the two 
sets of physical estimates and the means of 
the two sets of verbal estimates, for each S. 
The resultant 42 correlation coefficients 
ranged from .92 to .99, 

The reproducibility of the physical re- 
sponses was shown by correlations of .98 to 
-99 between the first and second trials, and of 
the verbal responses by correlations of .93 
to .99, The relatively small mode-of-response 
effect was shown by correlations of .96 to .99 
between the means of the two sets of physical 
responses and the means of the two sets of 
verbal responses. 

These results are consistent with those 
reported by Bolles and Bailey (1956), in that 
Ss achieved a high degree of accuracy when 
making verbal estimates of the size of familiar 
objects on the basis of memory alone, and 
with those of McKennel (1960), in that Ss 
achieved a high degree of accuracy when 
making physical estimates of the size of 
familiar objects on the basis of memory alone. 

The correlations between the estimates 
obtained under the two response conditions 
Suggest that the mode of response had little 
effect on the accuracy of estimation of the 
size of familiar objects when the 


estimates 
were made from memory. 


4 The tendency 
towards slightly higher correlations under the 
physical-response condition was probably 


SUPPLEMENTARY REPORT 


due to the fact that Ss tend to verbalize their 
estimations on a discrete scale, ies 4, è d 
or 1 in., which would not apply to physical 
estimations on a continuous scale. 7 This 
rounding-off effect, under the verbal-response 
condition, increased with the increase in the 
dimension to be estimated, thus, estimates 
of dimensions between 1 and 6 in. showed 
30% to be rounded off to an even inch, where- 
as, estimates of dimensions between 6 and 
12 in. showed over 80% rounded off to an 
even inch. 

That Ss apparently did not remember the 
numerical value of verbal estimates from 
trial to trial is suggested by the fact that only 
37% of the pairs of verbal estimates differed 
by yy in. or less, while 30% of the pairs of 
physical estimates differed by Ys in. or less. 

Relative error, i.e., error as a percentage 
of the dimension estimated, was 10% or 
greater in 64% of the verbal estimates, and 
in 57% of the physical estimates. All esti- 


Journal of Ex imental Psycholo} 
Journey at (Gh No. 2, 199-200 id 


199 


mates of dimensions less than }-in, dimen- 
sion were less than 1 in.; approximately 50% 
of the }{-in. dimension were less than 1 in.; 
no estimate of dimensions of 1 in. or greater 
was less than 1 in. 

Estimates of some of the dimensions 
suggested that Ss overestimate familiar 
objects remembered on a small background 
more than when these same objects are 
remembered on a larger background, i.e., a 
license plate on a small European automobile 
is overestimated more than one remembered 
on a standard American automobile. 


REFERENCES 


Borres, R. C., & BAILEY, D.E. Importance of object 

recognition in size constancy. J. exp. Psychol., 1956, 
$1, 222-225. 

MCKENNELL, A. C. Visual size and familiar size: 
painia differences. SJ. Psychol., 1960, Sl, 


(Received June 9, 1961) 


SUPPLEMENTARY REPORT: AN EXAMINATION OF AN ASPECT OF THE 
GELB EFFECT ' 


JACOB BECK? 
University of Pennsylvania Š 


In his experiment with a hidden light 
source, Gelb. (1929) reported that the intro- 
duction of a small bit of white paper in front 
of a spinning black disk changed the ap- 
pearance of the disk from a faintly illuminated 
white to a brilliantly illuminated black. 
Recently, Stewart (1959) showed that this 
change in disk lightness is consistent with the 
usual laws governing lightness contrast. 
Beck (1961) suggested that the corresponding 
change from a faintly to a brilliantly illumi- 
nated field is the consequence of the increased 
luminance of the area now seen as white. 
He (Beck, 1959, 1961) reported that the 
judgment of illumination of a visual field 
consisting of discriminable areas of differing 
but uniform luminances is strongly influenced 
by the luminance of the highest reflecting 
area, i.e., the brightness of the area seen as 
white. The present experiment tested this 
suggestion by obtaining judgments of light- 
ness and illumination for a situation which 
was in principle the same as that of Gelb's. 

Method.—Beck's (1961) apparatus and 

1 This experiment was. supported by a grant (B1876) 


from the National Institute of Neurological Diseases 


lindness, United States Public Health Service. 


2 Now at Harvard University. 


= 


procedure were used. The Os adjusted the 
illumination on a comparison surface until 
it appeared equal to that of the standard 
while viewing each with monocular vision and 
a motionless head in a completely dark room. 
The standard and comparison surfaces were 
8 X 13 in. white, gray, and black smooth 
matte papers. The reflectances of the papers 
were, respectively, 84%, 21%, and 1%. The 
standard surface was half white and half 
black. Two comparison surfaces were used: 
One was composed of two luminance levels, 
equal sections of gray and black; the second 
was composed of three luminance levels, equal 
sections of white, gray, and black. 

All luminance measurements were taken 
with a Spectra brightness spot meter. T he 
experimental conditions and data are pre- 
sented in terms of the incident illumination 
values which were computed from the lumi- 
nance measures and the reflectances of the 
papers. The incident illumination on the 
standard surface was 1.79 ft-c and corre- 
sponded to a luminance of the white area 
equal to 1.5 ft-L. Thus, to equate the 
maximum luminances of the comparison 
surfaces to that of the standard, Os should 


- 


200 


JACOB BECK 


TABLE 1 
MEDIANS OF Os' LIGHTNESS MATCHES 


Standard Surface 


Two-Level Comp. Surface 


Three-Level Comp. Surface 


Match 

Median | Q0: Median Q1-03 Median Q:-0; 
White 63% 63%-63% 63% 43%-63% 63% 63%-63 
Gray 25% 259-31 
Black 12% 8%-12% 22% 19%-28% 12% 8%-16% 


have adjusted the incident illumination on 
the two-level comparison surface to 7.14 ft-c 
and on the three-level comparison surface to 
1.79 ft-c. For each comparison surface, O 
made 10 separate illimination matches with 
E alternately setting the incident illumina- 
tion to a point either too high or too low. 
The order in which the two comparison sur- 
faces were presented was alternated. At the 
conclusion of the illumination matches for 
each comparison surface, the illumination on 
the standard surface remained at the value 
used in the experiment while the illumination 
on the comparison surface was set at the 
median of O's matches. The Os were then 
asked to match the lightnesses of the standard 
and comparison surfaces with a chart of 
Hering grays. Ten Os were used. All had 
normal vision and were naive regarding the 
experiment. 

Results and discussion.—Table 1 shows the 
medians and interquartile ranges of the 
reflectances of the samples on the Hering 
chart matched to the papers composing the 
standard and comparison surfaces. In agree- 
ment with Gelb's (1929) finding, the table 
indicates that the area of maximum reflec- 
tance on each surface was seen as light gray 
or white and the other areas were seen as 
gray or black depending upon their relative 
reflectances. On the two-level comparison 
surface, the medians of Os’ lightness matches 
of the gray and black Papers were 63% and 
22%. However, due to the white paper on 
the three-level surface, the gray was now 
matched to a sample reflecting 25% and the 
black to a sample reflecting 12%, 

Corresponding to the darkening of the 
Papers on the three-level surface, the illumi- 
nation matches were lower. On the two-level 
surface, the median illumination match was 
8.57 ft-c with an interquartile range of 7.62- 
9.52 ft-c while on the three-level surface the 
median illumination match was 2.38 ft-c with 
an interquartile range of 1.79-2.74 ft-c. 
However, the luminances of the maximum 
reflecting areas on the two- and three-level 
comparison surfaces corresponding to the 
median illumination settings were similar, 
1.8 ft-L and 2 ft-L. This is consistent with 


the previous findings that Os’ illumination 
judgments of a visual field consisting of sur- 
faces of different lightnesses (and placed so 
that neither shadows nor highlights are 
Present) are greatly influenced by the bright- 
ness of the area seen as white, On both the 
two- and three-level comparison surfaces, the 
maximum luminances corresponding to Os’ 
median illumination matches were higher 
than the maximum luminance, 1.5 ft-L, of 
the standard surface. In part, this may be 
the result of an enhanced impression of 
illumination of the standard surface due to 
the juxtaposition on it of areas of great light- 
ness difference (Beck 1959), 

Katz (1935, p. 283) hypothesized that the 
judgment of illumination is based on the 
insistence of the field, i.e., the average lumi- 
nance of the field. The average luminances 
of the standard and comparison fields are 
equated when the incident illumination on 
the two-level comparison surface is set at 
5.71 ft-c and on the three-level comparison 
surface at 2.14 ft-c. On the two-level com- 
parison surface, Os’ illumination matches 
are closer to the setting which equates the 
maximum rather than the average lumin- 
nances of the standard and comparison sur- 
faces. On the three-level comparison surface, 
the settings equating the average and maxi- 
mum luminances are too similar to be dis- 
tinguished. However, the earlier experi- 
ments by Beck (1959, 1961) showed that 
for the simplified visual field here considered 

" judgments of illumination are more 
strongly influenced byjthe maximum lumi- 
nance reflected from the field than by the 
average luminance. 


REFERENCES 


Beck, J. Stimulus correlates for the 1 illumina- 
tion of a surface, J, exp. Psychol., 1600 ee, 267-214. 

Beck, J. Judgments of surface illumination and light- 
ness, J, exp. Psychol., 1961, 61, 368-377. 

Gers, A. Die “Farbenkonstanz” der Sehdinge. In A. 
Bethe, G. v. Bergmann, G. Embden, & A. paer a 

hytologte Vek 12 Pal int Jalm Sentegen 

1939. E soe” art I. Berlin: Julius Springer, 

per D, The world of color. London: Kegan Paul, 


Srewarr, E, C, T 
1959, 57, PEN da Gelb effect. J. exp, Psychol., 


(Received June 7, 1961) 


s 
. 


Journal of 


Experimental Psychology 


VoL. 64, No. 3 


SEPTEMBER 1962 


ON THE INHIBITORY EFFECTS OF A SECOND STIMULUS 
FOLLOWING THE PRIMARY STIMULUS TO REACT 


HARRY HELSON anp JOSEPH A. STEGER 
Kansas State University 


Many conditions affecting simple 
spot reactions to visual, auditory, and 
tactile stimuli have been investigated 
(cf. Teichner, 1954) but, so far as we 
have been able to ascertain, the fol- 
lowing phenomenon has not been 
hitherto reported in the literature: if 
a second stimulus follows the primary 
stimulus to react, reaction time (RT) 
is lengthened compared with RT when 
only a single stimulus is employed. 
Todd (1912) found that RT to a 
signal (visual, auditory, tactile) was 
lengthened when one or two other 
stimuli preceded the stimulus to react. 
The findings reported here also appear 
to be unique in that the intervals after 
onset of the primary stimulus (Si) 
during which the second stimulus 
(Ss) still exerts a significant influence 
on RT are very long, considering the 
magnitude of RT to a single stimulus. 
In this study we shall be concerned 
with the case where Sı and Sz are both 
in the same sense modality (vision), 
leaving the report of an investigation 
of heteromodal effects to a later 
publication." 

1The phenomenon discussed in this 
article was discovered by the senior author in 


_ 1925 when attempting to condition sensory 


METHOD 


Subjects—The Ss were 15 male students 
enrolled in an introductory psychology course 
ranging from 18 to 21 yr. of age. They were 
randomly divided into two groups, Group E 
(experimental), consisting of 10 Ss, and 
Group C (control), consisting of 5 Ss. 

Apparatus and procedure.—The apparatus 
consisted of a black panel board on which were 
mounted three small neon lamps (GE 51) 
spaced } in. apart horizontally with a small 
red fixation light slightly above the center 
lamp. A response button was conveniently 
placed for S to press as soon as the primary 
light (Sı) went on. The E sat behind the 
panel board and by pressing a single button 
actuated Sı, the Standard Electric clock 
which measured the reaction times in .01 sec., 
and the Hunter interval timer which operated 
the current on the second stimulus (Sz) at 
intervals ranging from 10 to 180 msec. in 
steps of 10 msec. Both S, and Sz stayed on 
until S responded since it is known that 
duration of a flashing visual stimulus affects 


responses. In order to conceal the purpose 
of the earlier experiment from Ss they were 
required to react to a light (tone) which was 
followed by a tone (light) 75 msec. later in 
most of the trials. In some trials the second 
stimulus was omitted and it was then noticed 
that RTs were shorter when only a single 
stimulus was presented than when the second 
stimulus was given. We have now com- 
pleted an investigation with light and tone as 
the stimuli with results similar to those re- 
ported here for unimodal stimulation. 


201 


202 


RT. The room in which the experiment was 
conducted was dimly lighted and shielded 
from external sounds, The luminances of the 
stimuli were not determined but they were 
distinctly visible against the black panel in 
the dim light and were not noticeably different 
from one another. 

The three visual stimuli were employed 
for the purpose of counterbalancing position 
of Sı vis-à-vis Sz, the interpolated stimulus. 
Sı was thus either on the right or the left of 
the middle stimulus, the latter always serving 
as Sz. The actual procedure can best be 
understood from the instructions given to Ss: 


This is an experiment in simple reaction 
time. You are to respond to a light as 
quickly as you possibly can by pressing the 
button under your index finger. 

Note the lights facing you. I will tell 
you which end light you are to watch. If 
I say “left,” you will watch the light on 
your left and react to it as quickly as pos- 
sible. If I say “right,” you will watch the 
end light to your right and react to it as 
quickly as possible, 

The procedure will be this: I will say 
“right” or “left” and then I will say 
“ready.” A short time after I say “ready” 
the light you are to watch will come on, 

Are there any questions? 


The right and left presentations of S; were 
randomly distributed and appeared from .5 to 
3 sec. after the ready signal. The intervals at 
which S, followed S, were randomly se- 
quenced, Each Sin Group E reacted a total 
of 380 times, 20 times with Se presented at 
each of the 18 intervals and 20 times when S, 

_ Was omitted, the latter being randomly inter- 
spersed among the trials in which S; followed 
the Primary signal. The Ss in Group C re- 
acted 360 times to S; presented in random 


TABLE 1 


TREND ANALYSIS oF REACTION 
TIME X INTERVALS DATA 


SS 


Varin | Y | MS ý 
Linear 1 26,332 
Quadratic 1| -74077 l-e ee 
ubic 1 2,533 -21 
Intervals 17 44,653 3.703%* 
Ss 9| 1,292,452 
Ss XI 153| "12060 
Residual 14 45,169 3.745°* 
$P = .05. = p L 
“P =.01 


HARRY HELSON AND JOSEPH A. STEGER 


order on the right or on the left and with no 
S:. Two control sets of RTs were thus avail- 
able with which to compare the experimental 
RTs: (a) the RTs of Group E without Ss, and 
(b) the RTs of Group C with only S. 

The Ss were given four trials with the 
primary stimuli to familiarize them with the 
procedure. There was a 5-sec. break between 
trials and a 2-min, break at the end of Trial 
180. The Ss were not told the purpose of the 
experiment or the role of S} All parts of the 
equipment were shielded from Ss except the 
light stimuli. 


RESULTS 


The results leave no doubt as to the 
inhibiting effect of S on RT to S. 
The results of a trend analysis (Grant, 
1956) of the data across interstimulus 
intervals in Table 1 show that the 
quadratic component is significant at 
the .05 level and that the linear and 
cubic components are not significant.? 
This finding lends support to the ex- 
pectation that the effect of S, should 
be minimal at some very short interval 
following Sı, that it should increase 
to some maximal value or values at 
certain interstimulus intervals, and 
then decline as the response is being 
consummated. Both this reasoning 
and the trend analysis suggest a para- 
bola as the proper type of function to 
fit the data in Fig. 1. A second find- 
ing from the trend analysis is that the 
overall influence of interstimulus in- 
tervals is significant at the .01 level, 
which is confirmed by individual ż 
tests of the differences between the 
mean RTs of Group E with and with- 
out Se: 17 of the 18 interstimulus 
intervals yield Significantly longer 
RTs at the following levels: 9 beyond 
001, 4 beyond 01, 4 between .01 and 
05, and 1 not Significant (with a 
180-msec, interval). The ¢ tests were 
based on comparisons of means of 20 

a We wish to express our gratitude to John 
Gaito for aid with the trend analysis and to 
John Gaito and to John Overall for discussion 
ofa number of statistical issues that arose in 
connection with treatment of the data. 


INHIBITORY EFFECTS OF A SECOND STIMULUS 


Ae re yon gig xn aor’ tea 


REACTION TIME IN MSEC 
R 
G 


© 
J 30 $0 rm » n ao so wo 
MSEC 
INTERVALS OF ONSET OF SECONO stimutus 
Fic. 1. Reaction time to a primary 
stimulus when a second stimulus follows at 
intervals ranging from 10 to 180 msec. 


RTs by 10 Ss under the two experi- 
mental conditions using 9 df at each 
of the 18 interstimulus intervals. A 
second control, the mean RT of Group 
C which was given only Sı, was even 
lower than the mean RT in the control 
trials of Group E when S, was omitted 
(i = 2,82, P < 01), a finding that 
shows that there was some carry-over 
in Group E from the S, trials to the 
trials in which only Sı was presented. 

The curve in Fig. 1 represents a 
parabola made to fit the 10-, 90-, and 
180-msec. intervals since it is at these 
times that the minimal and maximal 
inhibiting effects of S are found under 
the conditions of this experiment. 
The RTs at 80, 100, and 130 msec. 
depart quite widely from the fitted 
curve and could undoubtedly have 
been approximated more closely if it 
had been fitted according to the least 
squares criterion which minimizes the 
total sum of squared differences be- 
tween all observed and theoretical 
points. The continuous line in Fig. 1 
is given by the equation: 


y= (— 17/6400) (X — 90)? + 241 


where Y is RT and X is the interval 
between Si and Sə Solving this 
equation for 0 and 180 msec. yields 
a value of 220 msec. which is higher 
than the control mean of Group E 
(213 msec.). However since some SS 
still gave significantly longer RTs 


203 


when S> appeared 180 msec. after Sı 
and in view of the lower mean RT of 
Group C as compared with the control 
RTs of Group E, it is likely that there 
is still some effect of S+ even as long as 
180 msec. after the onset of Sı. Tak- 
ing the curve as a whole we find that 
the maximal effects are obtained from 
40 to 140 msec. following Sı for it is 
in this region that the parabola is 
fairly flat. Variations in luminance, 
hue, or other characteristics of S: 
and/or Sı would undoubtedly change 
the region of maximal effect. 

The inhibiting effect of Są appears 
even more clearly when RTs of 
Groups E and C are plotted as a 
function of trials in Fig. 2. The wide 
separation between the curves for 
Group E when Sz is given and Group 
C for all blocks of trials attests the 
strength of the inhibiting effect of Sa. 
Indeed, the two curves are more 
widely separated on the whole during 
the last 180 trials than during the 
first 180 trials. Analysis of variance 
showed these two blocks to be signifi- 
cantly different (P < .05). On the 
other hand, the plot of the trials in 


EXPERIMENTAL GROUP (10 $3) 
N 
x x MEE 


-X r 
PE sty pe 


ay p EXPERIMENTAL CONTROL 
x 
ie > sein 3 


` 
240 X---X 
`, 


Sheree 


REACTION TIME IN MSEC. 
8 
S 


~~ =x. 
CONTROL GROUP (SS! 
r uP Sk, 


pe re 
36 T 108 144 i00 26 252 209 324 360 
TRIALS 
Fic. 2. Decrease in RT in Groups E and 
Casa result of practice. (Group C improves 
to a greater extent than does Group E when 
Ss is present showing that repetition does not 
counteract the inhibiting effect of the inter- 
polated stimulus, but the RTs of Group E 
when Sx is omitted as shown by the “Experi- 
mental, Control” curve, are identical with 
those of Group C after 360 trials.) 


204 


which only S, was given Group E 
(intermediate curve in Fig. 2) shows 
that with practice Ss in Group E were 
able to react as quickly to a single 
stimulus as were the Ss in Group C 
toward the end of the 360 trials. 
There is, therefore, a differential effect 
of practice in the single-stimulus 
condition and in the two-stimulus 
condition; the carry-over from double 
stimulation to single-stimulus condi- 
tion is counteracted by practice, but 
practice does not overcome the in- 
hibiting effect of S, in the two-stim- 
ulus condition. These findings point 
to nonattitudinal processes as the 
basis for the inhibiting effects of S». 
Finally, it should be noted that the 
inhibiting effect of S, is negated to 
some extent because of the drop in the 
Group E curve with the two-stimulus 
condition from Trial 36 to Trial 144 
after which there seems to be no 
further practice effect. 

Individual differences in the extent 
to which Ss are affected by the inter- 
polated stimulus are striking. Group 
E divides into two subgroups, one of 
which is markedly and definitely 
affected by S, while the other, on the 
whole, is not. Overall z tests of dif- 
ferences between the experimental and 
control trials of the first subgroup 
were all significant at or beyond the 
O1 level; those for the other subgroup 
were not significant. When the 180 
differences (18 intervals X 10 Ss) be- 
tween the two conditions of stimula- 
tion are tested by means of correlated 
t tests for each S, it is found that 3 Ss 
had only two significant differences 
and 1 S had only three Significant 
differences whereas the other 6 Ss had 
ELLS, ele 16, and 17 significant 
differences out of a possible 18, It 
thus appears that some Ss are very 
much more affected by the inter- 
polated stimulus than are others, 
Comments made by some of the Ss 


HARRY HELSON AND JOSEPH A. STEGER 


support this view. One S, for ex- 
ample, remarked again and again that 
“something was wrong with the ap- 
paratus” when the second stimulus 
appeared. These results do not imply 
that some Ss may be impervious to 
the effects of all secondary stimuli: 
Had the second stimulus been more 
intense than the first or had it been 
presented in another sense modality 
or otherwise made more impressive, it 
is highly probable (indeed certain, in 
our minds) that they would have 
given significantly longer RTs. Even 
these 4 Ss, as pointed out above, did 
have significantly longer RTs on some 
of the interpolated intervals. 


Discussion 


The data of this experiment seem to 
establish very clearly that a stimulus 
following the primary signal to react has 
the effect of lengthening RT according to 
a parabolic function, the effect being 
minimal 10 msec. after the onset of the 
primary stimulus, increasing to about 40 
msec., and having maximal effect from 40 
msec. to 140 msec, after which RT ap- 
proaches the single-stimulus condition 
and is not statistically different from RT 
with one stimulus at the 180-msec. in- 
terval. The inhibiting effect of the 
second stimulus is found even after 360 
repetitions although learning has clearly 
occurred in the single-stimulus condition 
as shown by the decline in the RT 
X Trials curve (Fig. 2) for both Group 
C and Group E. It appears unlikely 
that higher order, attitudinal, or other 
perceptual-cognitive processes can be re- 
sponsible for the effect described here. 
Rather it seems more probable that lower 
order, automatically acting mechanisms 
not under voluntary control, are at work. 
The difficulty in explaining this phe- 
nomenon springs from the fact that a 
stimulus coming when the response is 
almost completed can exercise an in- 
hibiting effect. At first sight it might 
appear that we are dealing with a higher 
order mechanism in that 4 of the 10 Ss 


INHIBITORY EFFECTS OF A SECOND STIMULUS 


were not significantly affected by Sz but 
we saw that even these Ss had signifi- 
cantly longer RTs on 2 or 3 of the 18 
intervals than on their control trials. 
We confidently expect that such Ss would 
be influenced by a second stimulus if it 
were made more intense than the first 
or if a more compelling condition were 
used than was the case in this experiment. 

An explanation, in physiological terms, 
at this time must be purely ad hoc and 
speculative. Two hypotheses, among 
the many we have considered, may be 
ventured as follows: (a) The first hy- 
pothesis involves the assumption of in- 
hibitory fibers that are faster than the 
ordinary motor fibers innervating volun- 
tary responses. Such fibers might be 
analogous to slow and fast afferent fibers 
that are known to function in mediating 
slow and fast pain or to differences in 
speed of afferent conduction associated 
with differences in fiber size. 
or not such fast, inhibitory efferent fibers 
exist we do not know and if none have 
been discovered these data warrant a 
search for them by electrophysiological 
means. (b) The second hypothesis, while 
less suggestive of specific neurological or 
physiological mechanisms, has behav- 
ioral implications and may be stated as 
follows: Let us assume that the reflex arc 
is a total, ongoing process such that a 
disturbance in any part of it disrupts the 
ongoing activity with the result that a 
new integration is required for the activ- 
ity to be resumed or completed. On this 
basis S is set to react to a single stimulus 
and when the second stimulus appears, 
even though he has not been instructed 
to react or attend to it, nevertheless it 
breaks into the ongoing response with 
consequent lengthened RT. It is evi- 
dent that an hypothesis that explains the 
inhibiting effect of a second stimulus 
after a very short interval, eg, 10-40 


205 


msec., may not explain its effect after a 
comparatively long interval, e.g-, 170 
msec., or vice versa. An hypothesis 
must explain lengthened RTs found with 
all intervals used in this study. 


SuMMARY 


An earlier finding that a light following a 
sound or a sound following a light after an 
interval of 75 msec. lengthened RT to the 
stimulus first presented was verified in the 
present experiment in which the stimuli were 
both visual. RT to Sı was significantly in- 
creased when Se followed Sı at intervals 
ranging from 10 to 170 msec. with the maxi- 
mum effect occurring from 40 to 140 msec. 
Trend analysis showed the quadratic com- 
ponents in the Intervals X RT data to be 
significant. While RT with Sz present de- 
creases with practice, the greater improve- 
ment of control Ss as compared with experi- 
mental Ss shows that the inhibitory effect of 
S: on Sı is not completely negated by 360 
repetitions. Four of the 10 experimental Ss 
failed to give significantly lengthened RTs 
over all 18 intervals although they did have 
significantly higher RTs at some of the in- 
tervals. While the individual differences and 
results of practice seem to argue in favor of 
attitudinal factors as responsible for the 
effect, other facts argue against this explana- 
tion. Two hypotheses were discussed but 
neither seems completely satisfactory to ex- 
plain all the facts. 


REFERENCES 


Grant, D. A. Analysis-of-variance tests in 
the analysis and comparison of curves. 
Psychol. Bull., 1956, 53, 141-154. 

TrrcHNER, W. H. Recent studies of simple 
reaction time. Psychol. Bull., 1954, 51, 
128-149. 

Topp, J. W. Reaction to multiple stimuli. 
Arch. Psychol., N. Y., 1912, Whole No. 25. 


(Received July 27, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 3, 206-214 


RESPONSE SUPPRESSION IN PERCEPT UAL DEFENSE 1 


ROBERT B. ZAJONC 
University of Mi ichigan 


Recent theorizing maintains that 
the phenomenon of perceptual defense 
can be accounted for in terms of 
response processes. Elevated thresh- 
olds to taboo words are now generally 
regarded as reflecting a response bias 
deriving from either previous experi- 
ence (Goldiamond & Hawkins, 1958), 
set (Postman, Bronson, & Gropper, 
1953), or conflict (Brown, 1961), 
rather than a defensive perceptual 
blocking. Although the core of the 
issue deals with the relative contribu- 
tions of the stimulus and response 
to the perceptual defense effect, 
studies attempting to evaluate such 
relative contributions have been rather 
few (Matthews & Wertheimer, 1958; 
Neisser, 1954). It is the purpose of 
this experiment to determine the 
extent to which both recognition 
threshold and the galvonic skin re- 
sponse (GSR) are influenced by the 
stimulus and to what extent by the 
response, using a procedure first 
Suggested by Garner, Hake, and 
Eriksen (1956), First, threshold and 
GSR data for a set of taboo and neu- 
tral words were obtained by means 
of standard methods. Secondly, a 
paired-associate list was constructed 
using the previously exposed words 
as stimulus terms and a new set of 
taboo and neutral words as response 
terms. Some taboo stimuli were 
paired with taboo, others with neutral 
response terms. Neutral stimuli too, 


1 This work was supported by Grants 
AS 49(638)-367 and G-4951 from the Air 
Force Office of Scientific Research and the 
National Science Foundation. I wish to 
thank James Taylor for his assistance in this 
study, and Dorwin Cartwright and Arthur 
Platz for reading the manuscript. 


were sometimes paired with neutral 
and sometimes with taboo response 
terms. All Ss learned the paired- 
associate list to a criterion, The third 
step consisted of a repeated threshold 
and GSR assessment of the original 
stimuli. Now, however, one group 
was required to indicate recognition 
as before, i.e., by reading out loud 
the word presented tachistoscopically, 
and another by saying the appropriate 
response term which they have learned 
in the previous paired-associate task. 
Thus, the second group was given an 
opportunity to indicate recognition 
by means of responses whose emo- 
tional significance was either positively 
or negatively correlated with the 
emotional significance of the stimulus. 


METHOD 


Subjects —Forty male Ss, all enrolled at 
the University of Michigan, participated 
in the experiment. They were randomly 
assigned to two experimental groups con- 
sisting of 20 Ss each. The Ss were paid $1.25 
per hour for participation in the experiment. 

Apparatus.—Gerbrands' transparent mir- 
ror tachistoscope with an instant start 
fluorescent lamp circuit was employed to 
Present stimuli. Skin resistance changes were 
observed by means of a Lafayette psycho- 
galvanometer Model 603-A. 

Materials.—Stimulus words were printed 
in black 2-in. block letters and presented in 
the center of the exposure field on gray 10% 12 
in. cards (54.5% reflectance), Stimulus- 
response pairs in the paired-associated task 
were shown in the same manner, Twelve 
taboo and 12 neutral words were selected 
from McGinnies’ (1949) original list of 18 
words to which equivalent neutral and taboo 
stimuli were added, Half of the taboo and 
half of the neutral words were used as stimuli 
in the threshold assessment and as stimulus 
terms in the paired-associate training task. 
The remainder of the list was used as response 


206 


oo. -3 


4 
\ 
| 


RESPONSE SUPPRESSION 


terms in the paired-associate task. Three 
taboo stimuli were paired with three taboo 
responses, three taboo stimuli were pair 
with three neutral responses, three neutral 
stimuli were paired with three taboo re- 
sponses, and three neutral stimuli with three 
neutral responses. These sets of words will 
be referred to as the TT, TN, NT, and NN 
sets. The 24 words were APPLE, BROOM, 
CANDY, CHAIR, CHILD, FLOOR, MUSIC, RAINS, 
RIVER, SHELF, STOVE, TRADE, BALLS, BELLY, 
BLEED, FAIRY, FILTH, HYMEN, KOTEX, PENIS, 
PUBIC, RAPED, VOMIT, WHORE. 

Procedure.—As briefly outlined above, the 
procedure consisted of two recognition thresh- 
old and GSR assessment sessions separated 
by an intervening paired-associate learning 
task. The Ss were divided into two groups 
of 20 Ss each, one of which was required 
during the second threshold assessment 
session to indicate recognition in terms of the 
stimuli presented (Group S), the other in 
terms of the response terms paired with the 
stimuli (Group R). 

Thresholds were obtained by the ascending 
method of limits in .01-sec. steps beginning 
with .05 sec. below S's threshold to a neutral 
training word. The intertrial intervals were 
approximately 30 sec. Some of the words 
were shown more than once in order to 
eliminate prerecognition guesses during the 


analysis. The criterion of threshold was the 
first correct recognition of the word in Group S 
and the first emission of the correct response 
term in Group R. 

The S was seated with his head against 
the eyepieces and with his hand to which 
electrodes were affixed lying relaxed on the 
table. A rest period of 1 min. was given after 
the first threshold assessment session and 
after the paired-associate training. 

The tachistoscope was operated by an 
adult male and the psychogalvanometer by 
an adult female. The GSR readings were 
taken in terms of reduction in resistance 
from the basal resistance level, which was 
adjusted for each stimulus exposure- Only 
those reactions which occurred within 5 sec. 
following stimulus exposure, and only those 
for which the resistance returned to the 
immediate neighborhood of the pre-exposure 
level were recorded. GSR readings were 
taken on every presentation of the stimulus 
word. Since some Ss recognized the word 
on the fourth exposure, only two prerecogni- 


tion trials and the recognition trial were 


IN PERCEPTUAL DEFENSE 


TABLE 1 


207 


Mean RECOGNITION Turesnotps (SEC.) 
BEFORE PAIRED-ASSOCIATE ‘TRAINING 


Later PA Conditions Words 
Group 
TT | TN | NT | NN qos. | Neutral 
S .219 | .228 .193 | .198 | .223 .196 
"234 | .233 | -206 213) .234 210 
Both |.227 | -231 .200 | .206 .229 | .203 


considered. ‘Thus, for each S three GSR 
scores were computed for each set of stimuli, 
and for the purposes of analysis all were 
converted to standard scores with a mean of 


50 and SD of 10 for all 40 Ss. 


The paired-associate task was conducted 
using a 2-sec. interval for the presentation 
of the stimulus and a 2-sec. interval for the 
presentation of the pair, with 20 sec. between 
trials. All terms were presented tachis- 
toscopically. The order of the stimuli was 


randomly altered from trial to trial. 


Three 


consecutive correct anticipations of the 


entire list were used as the criterion. 


RESULTS 


Recognition Threshold and GSR before 


Paired-Associale Training 


Mean recognition thresholds ob- 


tained before paired-associate tra 


ining 


are shown in Table 1. The analysis 
of variance for these results showed 
that the only significant effect is due 
to the difference between taboo and 
neutral words (F = 17.08, P < .001). 
Although the mean recognition thresh- 
olds for Group R are somewhat higher 
than those for Group S, this difference 
is not significant. Also, no significant 
differences were obtained between 
taboo words to be later used with 
taboo responses (TT) and taboo 
words to be later used with neutral 
responses (TN). Nor was there any 
difference between neutral words to 
be later used with taboo responses 
(NT) and neutral words to be later 
used with neutral responses (NN). 


208 ROBERT B, ZAJONC 


R -GROUP 


g 6 

8 60 

2 55| 7 

a 

[a] 

q $o E 

n ==- 

245 

G40}- o—oTT stimu! ele TN STIMULI 
o 


BOTH GROUPS 


7 
og 
“4 
pe 


4-—-aNT STIMULI 


&—ANN STIMULI 


TRIALS BEFORE RECOGN ITION 


Fie. 1, 


The GSRs are shown in F ig: 1. 
Again no difference between the 
experimental groups was found. It 
is evident from the results that on 
all trials taboo words exceed neutral 
words in GSR (F = 30.99, P < .001). 
It is also clear that there is a consider- 
able rise in the GSR on the recogni- 
tion trial (F = 32.94, P < -001). No 
significant differences between TT 
and TN words as well as between NN 
and NT words were found for either 
of the two groups. 


Paired-Associate Learning 


Average number of trials to learn 
the four sets of associations are 
presented in Table 2, The means 
represent the number of trials which 
Ss required to learn a given associa- 
tion to a criterion of three correct 
anticipations, averaged for the three 
items in each set, Shown in Table 2 
is also the average number of errors 
for each set of pairs. The results 
indicate that the four sets of associa- 
tions were not learned at the same 
rate (see Table 3). In particular, 
the TN pairs seem to be the most 
difficult, and the TT easiest. The 
analysis of variance presented jn 
Table 3 shows a significant effect due 
to differences between word sets, 


GSR before paired-associate training. 


which is primarily due to the type of 
response. In general, pairs with a 
taboo response require fewer trials 
and lead to fewer errors than pairs 
with neutral responses, Of particular 
importance to the present experiment 
is the difference between the TN and 
NT pairs. If speed of learning and 
number of errors reflect the degree 
to which a given response has become 
attached to the stimulus word, the 
TN stimuli should, during the sub- 
sequent threshold and GSR assess- 
ment of Group R, be more handi- 
capped than NT stimuli. The dif- 
ferences between these pairs in both 
trials to criterion and average number 


of errors are Significant at the .001 
level. 


Recognition Threshold and GSR after 
Paired-A ssociate Training 


Group R.—Table 4 shows recogni- 
tion thresholds for the four sets of 
words for the condition in which Ss 
indicated recognition by means of 
the response term acquired during 
the paired-associate training. It is 
apparent that, compared with those 
obtained before the paired-associate 
training, the thresholds to all the 
words are considerably reduced. It is 


Sy 


RESPONSE SUPPRESSION IN PERCEPTUAL DEFENSE 209 


TABLE 2 


MEAN TRIALS AND ERRORS TO CRITERION IN PaIRED-ASSOCIATE LEARNING 


Pairs Words 
Group 
Taboo N a N 
TT TN NT NN Stimuli Neutral | pesponses Piper 
Group S 
Trials 4.56 5.62 4.37 5.13 5.09 4.75 4.47 5.38 
Errors s 4.55 7.10 3.70 6.10 5.83 4.90 4.13 6.60 
Lees Aaea 

Group R 
Trials 4.21 4.95 4.48 4,28 4.58 4.38 4.35 4.62 
Errors 2.90 5.00 3.80 3.55 3.95 3.68 3.35 4.28 
Both i 
Trials 4,39 5.29 4.43 4.71 4.84 4.57 4.41 5.00 
Errors 3.73 6.05 3.75 4,83 4.89 4.29 3.74 5.44 


also clear that no longer does the learning of the four types of associa- 
recognition threshold totally depend tions was not uniform. In particular 
on the stimulus. There is a consider- there was a considerable difference 
able effect due to the response which between the TN and the NT pairs, 
S utilizes in indicating recognition. in favor of the latter. Moreover, the 
It should be pointed out that S's examination of the results on paired- 
ability to give evidence of recognition, associate learning disclosed a signifi- 
not by means of a word which is cant effect due to individual differ- 
presented but by means of a response ences. The F ratios evaluating the 


previously learned, depends on the individual difference effect were 8.79 
ion, and 11.41 for 


degree to which these responses were for trials to criter 
fixated. It will be recalled that the errors, which for the degrees of free- 


TABLE 3 
ANALYSIS OF VARIANCE FOR DATA IN TABLE 2 


Trials to Criterion Errors 
Source af 
MS F MS F 
‘Treatments (A) 1 8.06 83 96.09 
Words 3 6.69 (ra We hats 48.54 6:23°** 
S component (B1) £ 3A 27 14.40 68 
R component (Bz. 1 13.40 8.93** 115.60 BO hy Shakes 
Bi X Be 1 hey 4.12 15.62 1.92 
Treatments X Words 3 1.78 1.60 13.45 1.73 
A X Bı 1 .24 25 4.23 79 
A XB: 1 3.93 2.62 24.03 2.44 
A X Bı X B2 1 1.18 1.37 12.10 1.49 
Error (b) 38 9.76 88.91 
Error (w) 114 1.11 1.19 
Ss X Bı 38 95 5.38 
Ss X Ba 38 1.50 9.85 
Ss X Bı X Bz 38 86 8.13 
+*+ P =.01. 


++ P =.001. 


210 


ROBERT B. ZAJONC 


TABLE 4 
MEAN RECOGNITION THRESHOLDS (SEc.) IN Group R AFTER Parrep-AssociAtE LEARNING 
PA Pairs Words 
Soe Tab Neutral | Tab Neutral 
Neutra 
TT TN NT NN Stimuli Stimuli Repatios Reaporises 
Rapid learners 151 -138 145 -138 145 142 148 138 
Slow learners 178 180 -174-| .172 179 173 176 .176 
All Ss 165 159 -160 -155 -162 158 163 157 
Adjusted means 
for all Ss 168 153 .160 -457 161 -159 164 155 


dom given are significant well beyond 
the .001 level. We would expect a 
more reliable test of the relative 
contributions of the stimulus shown 
and of the response given from Ss 
who learned these responses well. 
Group R was therefore divided at the 
median number of trials to criterion, 
and the recognition thresholds for 
the rapid and slow learners are shown 
in Table 4, and the analysis of vari- 
ance in Table 5. It is clear from 
Table 4 that slow learners manifest 
considerably higher recognition thresh- 
olds for all the words. The difference 
between groups is significant at better 


TABLE 5 
ANALYSIS OF VARIANCE FOR DATA IN 
TABLE 4 
Source af MS r 

e EAN 
Groups (Rapid vs. 

Slow) (A) 1 [24,945] 6.72* 

ords 3 338 2.54 

S component (B;) 1 466| 2.13 

R component (B4) 1 546| 5.00* 

1X Bz 1 1| <1.00 

Groups X Words 3 131 | <1.00 

A XB, 1 35| <1.00 

A X B: 1 536| 4.91* 

A X Bı X B; 1 122 1.67 
Error ee 18 | 3,711 
Error (w) 54 133 

Ss X By 18 218 

Ss X Be 18 109 

Ss X Bi X B: 18 73 

© P=05. 


than the .05 level. It appears that 
the slow learner’s recognition thresh- 
old depends primarily on the type 
of stimulus presented, while that 
of rapid learners on the response 
which they were required to make. 
However,the Groups X Stimulus X Re- 
sponse interaction was not significant. 
The overall results, however, indicate 
that the effects due to the stimulus 
component were not significant while 
those due to the response were signifi- 
cant. Further support for the con- 
clusion that recognition threshold 
depends primarily on the type of 
response required is obtained when 
the data are adjusted for differences 
in learning the four types of associa- 
tions. The mean recognition thresh- 
olds, adjusted by means of the regres- 
sion equation relating the former 
to the number of trials, are shown 
at the bottom of Table 4. Analysis 
of covariance performed on these re- 
sults disclosed a significant effect due 
to the response component (F = 8.78, 
df = 1/17) and no effects due to 
stimulus, 

The GSR data shown in Fig. 2 
follow a similar pattern. Again, as 
compared with the results obtained 
before paired-associate training, the 
GSRs are weaker. The analysis of 
variance (Table 6) shows a significant 
effect due to the differences between 


RESPONSE SUPPRESSION IN PERCEPTUAL DEFENSE 211 


~ 
[e] 


RAPID LEARNERS SLOW LEARNERS ENTIRE R-GROUP 


ru) 
u 


8 


50} 


GSR IN STANDARD SCORES 
(eN 
a 


o—oTT STIMULI e@-—-eTN STIMULI &—* NN STIMULI a= NT STIMULI 


2 1 o 2 
TRIALS BEFORE RECOGNITION 
Fic. 2. GSR after paired-associate training (Group R). 


2 1 o 


words which seems to be a function both groups combined indicate that 
of the stimulus and of the response on the second prerecognition trial the 
component as well. The results of GSRs do not follow any particular 


TABLE 6 
ANALYSES OF VARIANCE FOR DATA IN FIG. 2 AND 3 


Group R Group S 
Source af 
MS F MS F 
Groups (Rapid vs. Slow) (A) 1 522.4 1.19 164.7 <1.00 
Words 3 171.1 6.35** 78.6 3.48* 
S component (B1) 1 149.6 7.03* 227.0 71.13* 
R component (Bz 1 356.5 10.57** 2.5 <1.00 
BiXBz 1 7.3 <1.00 6.3 <1,00 
Trials (C) 2 2,235.6 28.97*** 942.5 12,572" 
Groups Words 3 9.6 <1.00 26.1 1.16 
AXBi 1 R <1.00 16.9 <1,00 
AXB: 1 27.8 <1.00 51.4 1.92 
AXBiXBz 1 38 <1.00 10.1 1.00 
Words X Trials 6 136.7 5.44*** 38.6 3.08** 
BıXC 2 87.8 4,27* 79.0 4.71* 
BexC 2 319.6 12.47*** 6.6 <1.00 
BıXB:XC 2 2.8 <1.00 30.4 4,94* 
Groups X Trials 2 25.3 <1,00 39.4 <1.00 
Groups X Words X Trials 6 32.6 1.30 22.2 1.77 
AXBiXxC 2 77.5 3.71* 36.7 2.19 
AXB:XC 2 57 <1.00 4.8 <1.00 
AXBiXB2XC 2 14.7 <1,00 25.1 4.09* 
Error (b) 18 439.1 441.1 
Error (w)1: SsXWords 54 27.0 22.6 
SsXBı 18 21.3 29.4 
Ss Bz 18 33.7 26.8 
Ss Bi XBa 18 25.9 11.6 
Error (Ww)? ‘Ss Word X Trials 36 77.2 75.0 
Error (w)3: Ss Words X Trials 108 25.1 12.6 
SsXBixC 36 20.6 16.8 
Ss BeXC 36 25.6 14.7 
Ss xB: XB2XC 36 29.2 62 


212 ROBERT B. ZAJONC 


TABLE 7 


MEAN RECOGNITION THRESHOLDS (SEc.) IN 


GROUPS AFTER PAIRED-ÅSSOCIATE TRAINING 


PA Pairs Words 
ead Tab Neutral | Tab Neutral 
ne eutra 
TT TN NT NN Stimuli Stimuli TEN Responses 
Rapid learners 153 151 -143 141 152 -142 148 -146 
Slow learners 167 175 154 -162 171 -158 160 -168 
All Ss -160 163 -149 152 161 .150 154 -157 
pattern. However, the curves for response, the autonomic reactions 


the rapid learners show a pattern 
of particular interest. On the second 
Prerecognition trial the GSRs seem 
to depend primarily on the stimulus 
component; their order is TN, TE, 
NN, and NT. As the Ss approach 
recognition the stimulus ‘effect is 
gradually replaced by the response 
effect and the GSRs are ordered 
according to the response. One may 
interpret this result to mean that 
stimulation present two trials before 
recognition is probably too weak to 
call out strong anticipatory partial 
responses. As soon as the stimulation 
gains in strength and becomes capable 
of evoking some parts of the learned 


TABLE 8 
ANALYSIS OF VARIANCE FOR DATA IN 
TABLE 7 
Source df MS F 
Groups (Rapid vs. 
Slow Learners) (A) 1 |10,160 3.12 
'ords 3 896| 3.03 
S component (Bi) 1) 2,532] 5.34% 
component (B4) 1 157 | <1.00 
1X B: 1 0| <1.00 
Groups X Words 3 150} <1.00 
AX Bi 1 131 | <1,00 
AX Ba 1 419 1.68 
A X Bı X B: 1 1 1.00 
Error (b) 18 | 3,088 
rror (w) 54| 296 
s X B, 18 474 
Ss X B3 18 250 
Ss X Bi X By 18 164 
* P =,05. 


lose their dependence upon the stim- 
ulus and begin to be dominated by 
the response component. 

The mean GSR reaction for rapid 
learners was 47,55 and for slow 
learners 50.53, but as is evident from 
Table 6 this difference was not 
significant. 

Group S.—The principal purpose 
of the paired-associate learning task 
was to enable Ss to give evidence of 
recognition of the stimulus words 
without having to say them. How- 
ever, it is possible to argue that the 
training simultaneously produced tem- 
porary changes in the emotional 
quality of the stimulus words. T hus, 
taboo stimuli which were paired with 
neutral responses could, by virtue of 
the repeatedly reinforced association, 
have become emotionally “neutral- 
ized,” Similarly, conditioning a ta- 
boo response to a neutral stimulus 
word might have affected the emo- 
tional quality of the latter. These 
eventualities are of course quite 
remote because of the small number 
of conditioning trials involved. If 
conditioning of the type suggested 
has in fact taken place then the recog- 
nition thresholds and the GSR data 
should show the same patterns in 
Groups S and R. The average recog- 
nition thresholds for Group S are 


La 


ae 


RESPONSE SUPPRESSION IN PERCEPTUAL DEFENSE 


effects only due to the stimulus 
component. It is of interest to note 
that as was the case in Group R slow 
learners in Group S also showed 
somewhat higher recognition thresh- 
olds than rapid learners. However, 
this difference failed to reach an 
acceptable level of significance. 

Neither do the GSR results shown 
in Fig. 3 suggest any conditioning 
effect. Besides the increase in reac- 
tions over trials, the only significant 
effect is that due to the stimulus 
component. The analysis of variance 
in Table 8 shows an F ratio signifi- 
cant at the .05 level for the stimulus 
component. On the trials immediately 
preceding recognition there is a slight 
but not significant response effect 
for rapid learners. Also, as observed 
before, the GSRs of rapid learners 
are somewhat less than those of slow 
learners (45.69 and 47.16, respec- 
tively), but this difference is decidedly 
not reliable. 


Discussion 


The evidence presented failed to dis- 
close perceptual effects of any signifi- 
cance. The recognition threshold was 
found to be a function not of what Ss 
saw but what he had to say. Moreover, 
GSR data follow an identical pattern. 


x 
° 


RAPID LEARNERS 


2) ess g 


GSR IN STANDARD SCORES 
eS 
a 


ond 


SLOW LEARNERS 


o—oTT STIMULI @-—-eTN STIMULI 


213 


The GSRs were found to be produced 
not by the stimulus alone, but depended 
primarily on the response required of S. 
The results are best accounted for by 
Brown’s (1961) competing response 
theory. Irrespective of the stimulus, 
if the responses were in conflict with an 
inhibitory tendency, that is, if S had to 
make a vulgar response, both recogni- 
tion threshold and GSR were elevated. 
Stimuli arousing no response conflict 
failed to produce differential thresholds 
and GSRs irrespective of their “emo- 
tionality.” Further support for the 
response competition hypothesis is seen 
in the GSR data. In general, the dif- 
ferences in the GSRs were found to 
increase over trials, reaching their peak 
upon recognition. To the extent that 
the GSRs reflect response conflict, one 
would expect that with increasing €x- 
posure time both the positive and the 
negative tendencies increase, thus gen- 
erating a stronger conflict. 

There is evidence in the data that 
recognition threshold and GSR are also 
subject to variation as a result of not 
only a conflict between a positive and 
negative tendency, but also as a result 
of a conflict between competing excita- 
tory tendencies. First we note that both 
are markedly reduced after familiariza- 
tion with the stimuli. Before paired- 


associate learning the response alterna- 
tives available to Ss are many, and all of 
these are in competition. 


The training 


ENTIRE S- GROUP 


a——ANN STIMULI 


2 1 o 
TRIALS BEFORE RECOGNITION 


Fic. 3. GSR after paired-associate training (Group 5). 


214 ROBERT B. ZAJONC 


reduces them to 12, thus reducing the 
extent of response competition involved. 
Secondly, consistent differences in the 
overall recognition threshold and GSR 
reactions between the rapid and slow 
learner were found. If one views the 
speed of the paired-associate learning 
and the mean number of errors as an 
index of the amount of response com- 
petition present, these results become 
quite meaningful. 

It is not claimed here that the per- 
ceptual defense phenomenon has been 
disproven. But if the phenomenon is 
empirically demonstrable its proof must 
be established by experimental methods 
other than those commonly used. Per- 
haps Blum’s (1954) forced-choice tech- 
nique of threshold assessment holds 
best promise since it eliminates possible 
effects due to the response process. 


SUMMARY 


The role of stimuli and responses in per- 
ceptual defense was examined by first ob- 
taining recognition thresholds and GSRs to 
taboo and neutral words. Subsequently, 
Ss learned a paired-associate list with the 
original words serving as stimulus terms and 
a new set of words as response terms, Half 
of the neutral stimuli were paired with neutral 
and half with taboo responses. The same 
was true of taboo stimuli. Following training, 
recognition thresholds and GSRs were again 
measured with one group required to indicate 
recognition by means of response terms and 


another by means of stimulus terms. Both 
recognition threshold and GSR were found 
to depend primarily on the response required 
of the Ss in indicating recognition, 


REFERENCES 


Bium, G. S. An experimental reunion of 
psychoanalytic theory with perceptual 
vigilance and defense. J. abnorm. soc. 
Psychol., 1954, 13, 94-99, 

Brown, J. S. The motivation of behavior. 
New York; McGraw-Hill, 1961. 

Garner, W. R., Hake, H. W., & ERIKSEN, 
C. W. Operationism and the concept of 
perception. Psychol. Rev., 1956, 63, 149- 
159. 

GOLDIAMOND, I., & Hawkins, W. F. Vexi- 
erversuch: The log relationship between 
word frequency and recognition obtained 
in the absence of stimulus words. J, exp. 
Psychol., 1958, 56, 457-463. 

MatuEws, A, & WERTHEMER, M. A 
“pure” measure of perceptual defense 
uncontaminated by response suppression. 
J. abnorm. soc. Psychol., 1958, 57, 373-376. 

McGinnis, E. Emotionality and percep- 
tual defense. Psychol. Rev., 1949, 56, 
244-251. 

Neisser, U. An experimental distinction 
between perceptual process and verbal 
response. J. exp. Psychol, 1954, 47, 399- 
402, 

Postman, L., Bronson, W. C., & GROPPER, 
G. L. Is there a mechanism of perceptual 
defense? J, abnorm. soc. Psychol., 1953, 
48, 215-224. 


(Early publication received January 5, 1962) 


č 


re 


Journal of Experimental P: 
1962, Vol. 64, No. 3, 215-226 


ogy 


FACTORS IN THE RETENTION AND RELEARNING 


OF 


PERCEPTUAL-MOTOR SKILL ' 


EDWIN A. FLEISHMAN 
Yale University 
anp JAMES F. PARKER, JR. 


Psychological Research Associates, Arlington, Virginia 


In several previous reports, Parker 
and Fleishman have described studies 
of complex tracking performance. 
The first of these (Parker & Fleish- 
man, 1959, 1960) attempted to predict 
performance at different stages of 
learning a complex tracking skill. 
Special efforts were made to predict 
high levels of proficiency after exten- 
sive practice (17 sessions distributed 
over 6 weeks) with this task. 
second study (Parker & Fleishman, 
1961) made use of information about 
the components of tracking skill 
to facilitate the learning of this skill. 
The present study is an investigation 
of factors in the retention and re- 
learning of this same skill. 

Previous studies of motor skill 
retention (e.g. Ammons, Farr, Bloch, 
Neumann, Dey, Marion, & Ammons, 
1958; Battig, Nagel, Voss, & Brogden, 
1957; Bell, 1950; Jahnke, 1958; Jones 
& Bilodeau, 1953; Leavitt & Schlos- 
berg, 1944; Mengelkoch, Adams, & 
Gainer, 1958; Reynolds & Bilodeau, 
1952) all present evidence that contin- 
uous control, perceptual-motor skills 
are well retained over fairly long 
periods of no practice. What loss oc- 
curs appears to be quickly regained. 
The present study is a more compre- 
hensive study of factors in retention 
and relearning, using a highly com- 
plex continuous control task requir- 
ing considerable practice for initial 
learning. 

1 This study was performed under Con- 
tract Nonr 3065 (00) between Psychological 
Research Associates, Incorporated, and the 
Office of Naval Research. 


While essentially alaboratory study, 
the task was designed to simulate 
a complex skill, i.e„ that of a pilot 
flying a radar intercept mission. The 
problem of retention over extended 
periods without practice is especially 
critical here. Furthermore, where 
the skills are of such complexity, 
the problem of finding the optimum 
conditions for retraining becomes 
even more critical. 

Specifically, the following questions 
were investigated. How well is such 
a skill retained without practice? 
What is the relation between the 
length of the “no practice” interval 
and level of retention? If there isa 
loss in proficiency, how much practice 
is required to regain proficiency? 
What is the relation between reten- 
tion and level of proficiency after the 
original learning? Is the type of 
initial training related to retention? 
What is the relative effectiveness of 
a distributed vs. massed retraining 
schedule? Does the type of retraining 
schedule affect later performance as 
well as performance during retraining? 


METHOD 


Task.—The criterion task consisted of a 
tracking device constructed so as to simulate 
roughly the display characteristics and con- 
trol requirements of an air-borne radar inter- 
cept mission. The task of S was to maintain 
the target dot at the center of the oscillograph 
display, while at the same time nulling a 
sideslip indicator. TI hat is, S envisioned him- 
self to be flying the attack phase of an air- 
borne rador intercept mission. Thus, if the 
target was to the right, S made appropriate 
control movements to steer the craft to the 
right. These movements would bring him 


215 


216 


on target and the dot would return to the 
center i 


and rudder controls, 

Three identical tracking devices were con- 
structed especially for Purposes of this study, 
Photographs and complete schematics of all 
components are presented elsewhere (Parker 
& Fleishman, 1959). These devices and 
related Scoring consoles allowed for the testing 
of from 1 to 3 Ss simultaneously under the 
control of a single test administrator, 

The S’s instrument panel contained two 
i The first consisted of a target dot 


In. Zero-center voltmeter termed a “side. 
slip indicator,” 


eft and a consequent left deflection on the 
sideslip indicator, 


Control of the 


u h ir was directly 
proportional to stick displacement, 


EDWIN A. FLEISHMAN AND JAMES F, PARKER, JR. 


The time constant of this lag network is 1 sec., 
i.e., it requires 1 sec, to achieve approximately 
two-thirds (1--1/e) of the final signal resulting 
from a given stick displacement, 

Control of the target dot in azimuth 
(envisioned as turning the aircraft) and 
centering the sideslip indicator (coordination 
display) were both affected 
displacement as well as by the control stick, 
This rudder control of the sideslip indicator 
involves 


through two exponential lag networks, 
Movement of 


Scoring —The Primary score was the 
integrated absolute error Score. This 
recorded at the conclusion of every trial and 
was produced by Summing algebraically the 
three absolute error part scores in accordance 
with this relationship: T=1/2X+1/2Y+2Z, 
where T = integrated absolute error score, 
X = absolute azimuth error, Y = absolute 


1959, 1960) 


This group 
of Ss will be referred to as Group I, 

In the second study (Parker & Fleishman, 
l ed to as Group II, spent 
an identical period of time mastering the 
These Ss were adminis- 


at different stages 
The second study 
training program. 
the present study, was the 
that experimental training 


RETENTION OF PERCEPTUAL-MOTOR SKILL 


followed by critiques with each S after Ses- 
sions 7, 11, and 15. As would be expected, 
terminal proficiency for the group trained 
under this program was significantly superior 
to that of the group which had no formal 
training. 

Retention testing—Seven groups of 10 Ss 
were brought back for retraining following 
various intervals of no practice. Intervals 
since training for Group II were 1, 5, 9, and 
14 mo. Intervals since training for Group I 
were 9, 14, and 24 mo. The fact that it was 
possible to study the 9- and 14-mo. intervals 
for both groups allows a comparison of 
retention for the same intervals for two types 
of original training. 

Each retention group was split into two 
subgroups of 5 Ss each. One subgroup was 
retrained during four intensive, continuous 
retraining sessions* during the same day. 
The other subgroup was retrained during 
four sessions, each scheduled 1 day apart. 
The purpose of this experimental breakdown 
was to allow.an evaluation of the relative 
effectiveness of these two retraining schedules 
where one involved massed and the other 
distributed practice. 

One week following the 
session all Ss were again tested for one addi- 
tional session. The purpose of this additional 
testing was to allow a more adequate evalua- 
tion of the two types of retraining. This 
fifth retraining session was included to show 
whether any differences which might be 
found occurred only during the retraining 
program or whether these differences per 
sisted in later performance. If transfer to 
later performance could be demonstrated, 
this could be attributed to differential learning 
during the course of retraining rather than 
to temporary performance factors (e.g 
fatigue, inhibition) during the retraining. 

Matching of retention groups.—It will be 
recalled that the seven retention groups were 
drawn from two groups of original trainees. 
One group (Group 1) had been trained with- 
out benefit of specific guidance while the 
other group (Group II) had been trained 
with supplementary instruction and guidance. 
Accordingly, it was found (Parker & Fleish- 
man, 1961) that Group II was superior in 
terminal proficiency although the number 
of practice sessions was identical for each 
group. For the present retention study an 
attempt was made to match the different 
retention samples on the basis of final per- 
formance level during original learning. This 
was done separately for the three retention 
samples drawn from Group I and for the 
OESE e 


3 As before, a “session” includes 21 1-min. 
trials. 


217 


TABLE 1 
FINAL PERFORMANCE LEVELS OF THE 


SEVEN RETENTION SAMPLES AFTER 
INITIAL LEARNING 


Group I G u 

Rear No Koral Formal C uidance 
Month) |_|, _. 

N | Mean*| SD | N | Mean*| SD 

1 10 | 246 | 82 

5 10 | 232 | 57 

9 9 | 294 |119| 9| 232 | 59 

14 7 | 263 | 88| 9| 230 71 

24 8 | 260 |117 


a Integrated absolute error score. 


four retention samples drawn from Group IL. 

Scores (integrated error) attained by each 
S during the final session of original training 
were used as a basis for matching. These 
were converted to standard scores (stanines). 
An attempt was made to assign a proportion- 
ate number from each stanine level to each 
retention group in order to obtain a normal 
distribution representative of the original 
learning population. It soon became ap- 
parent that certain Ss needed to fulfill these 
requirements were not available for retest- 
ing. However, this procedure was followed 
as closely as possible. A preliminary analysis 
of variance for the first four samples tested 
indicated that they were not homogeneous. 
An adjustment was made by eliminating a 
few Ss whose final scores during the original 
training were extremely poor. This left a 
total of 62 Ss in the seven retention samples. 
Table 1 indicates the number of Ss in each 
retention group and the means and SDs of 
their final scores after initial learning. Itcan 
be seen that within each of the original learn- 
ing groups adequate matching was achieved. 
Analyses of variance performed for each origi- 
nal training group confirmed that the reten- 
tion samples could be considered comparable 
(Group I: F = .22, df = 2/21; Group Il: 
F = .09, df = 3/34). 

The comparability of these retention 
groups becomes especially apparent when 
one considers the range of possible scores 
from early to late learning. The curves in 
Fig. 1 and 2, for example, illustrate this. 


RESULTS 
Magnitude of retention.—A primary 
concern is the extent to which the 
developed performance capability de- 
teriorates through time. However, 


218 EDWIN A. FLEISHMAN AND JAMES F. PARKER, JR. 


2500 


$ 


ay 


LUTE ERROR 


ARBITRARY UNITS) 


g 


( 


INTEGRATED ABSO 
a 
S 


EE o E S 


FiG. 1. Performance of Grou 


GROUP I 


— 9 MONTH(Ne9) 
e— 14 MONTH(N=7) 
m 24 MONTH(Nz8) 
— INITIAL LEARNING 


OF COMBINED 
GROUPS (N=24) 


25 30 35 40 45 50 123 
TIME SEGMENTS 
ORIGINAL LEARNING PERIOD 


TIME SEGMENTS 
Ist RETRAINING 
SESSION 


p I (no formal training) during original learning 


and following varying periods without practice, 


a simple comparison of an S's score 
following some period of no practice 
with his final score in initial training 
will not provide a complete under. 
Standing of performance loss. One 
must have information concerning 
the course of initial learning and the 
extensiveness of the training required 
to develop the skill. Figures 1 and 2 
present, for Groups I and II, curves 
illustrating the course of initial learn- 
ing followed by the results of the first 
Session retention measurements. Each 
point in the initial learning curve is 
based on the average of the combined 
Ss of the retention samples. Points 
on the abscissa are directly com- 
parable for original training and 
retention and represent 6-min. periods 
within a practice session. Each 
session consisted of 21 min. of prac- 
tice. The first three 1-min. trials 
were not scored to remove warm-up 
effects. 

One of the Primary conclusions 
drawn from Fig. 1 and 2 is that by 


comparison with the original learning 
of this skill there is little decrement 
in performance even for no practice 
periods of up to 24 mo. There is 
obviously somewhat more decrement 
in the 24-mo. group but recovery is 
rapid even during this first 21-min. 
retraining session. 

Retention and length of interval.—In 
order to obtain a more precise descrip- 
tion of performance at the beginning 
of retraining, the results of the first 
retraining session were plotted on 
a trial-by-trial (minute-by-minute) 
basis. F igure 3 shows that for Group 
I (no formal training) the major part 
of performance capability was re- 
gained during the first 2 or 3 min. of 
retraining. It can also be seen that 
the 24-mo, group is consistently 
poorer during this first retraining 
session than are the other two groups. 
However, even this group is improv- 
ing to the level of the other groups 
within this first 21-min. retraining 
session. A much smaller loss occurs 


hii ee 


iii. ae 


A 


RETENTION OF PERCEPTUAL-MOTOR SKILL 219 


2500 
(is 
ro) 1 MONTH(N=I0) 
z GROUP I 5 MONTH(N=10) 
w- 2000: 9 MONTH(N=9) 
AN 14 MONTH(N=9) 
Ez INITIAL LEARNING 
35 OF COMBINED 
35. 1500 GROUPS (N= 38) 
oS 
<a 
O 5 1000 
wo 
ss 
© 
E s ee emeni. 


5 io 15 20 25 30 35 40 45 50 123 
TIME SEGMENTS TIME SEGMENTS 
ORIGINAL LEARNING PERIOD Ist RETRAINING 
SESSION 


Fic. 2. Performance of Group II (formal guidance) during original learning 
and following varying periods without practice. 


for the 9- and 14-mo. intervals and Differences between these latter two 
this is regained in just a few minutes. groups are negligible. 


»—« 9 MONTH GROUP 
150 e—2 14 MONTH GROUP 
e— 24 MONTH GROUP 


(ARBITRARY UNITS) 


50 


INTEGRATED ABSOLUTE ERROR 


5 10 15 
MATCHING SCORE ONE MINUTE TRIALS 
(ORIGINAL LEARNING) 


Fic. 3. Performance of Group I (no formal training) subgroups during 
j the first retraining session. 


220 EDWIN A. FLEISHMAN AND JAMES F. PARKER, JR. 


150 


8 


INTEGRATED ABSOLUTE ERROR 
(ARBITRARY UNITS) 


5 
MATCHING SCORE 
(ORIGINAL LEARNING) 


1 MONTH GROUP 
5 MONTH GROUP 
9 MONTH GROUP 
14 MONTH GROUP 


10 15 20 


ONE MINUTE TRIALS 


Fic. 4, Performance of Group II (formal guidance) subgroups during the 
first retraining session 


Figure 4 indicates that Group II 
(formal guidance procedures) showed 
practically no deterioration without 
practice. It should be kept in mind 
that the maximum period without 
practice for this group was 14 mo. 
Our findings with this group are 
consistent with the findings for Group 
I in finding no differences in retention 
level for periods of no practice of 9 
to 14 mo. The findings with Group II 
also indicate that the 9- and 14-mo. 
groups show no greater losses than do 
the groups with only 1 and 5 mo. of no 
practice. It is also shown that these 
groups, which exhibit essentially no 
forgetting, do not improve much dur- 
ing this first retraining session. 

Figure 5, which is based on the 
first 1-min. trial of retraining, illus- 
trates no performance loss as a func- 
tion of longer intervals of no practice 
up to 14 mo. 

Retention and original learning level. 
—Next an examination was made of 
the correlation between final per- 


formance level at the conclusion of 
the original learning period and per- 
formance level during the first retrain- 
ing session. For this, an attempt was 
made to obtain as stable measures 
of performance as possible. | Thus, 
the original learning measure 18 based 
upon an average score for the last 
three original practice sessions, or 44 
min. of performance. The retention 
Score represents the entire 18 min. 
which were scored during the first 
retraining session. (As in previous 
analyses, the first 3 min. of this session 
were not scored in order to avoid 
possible need for warm-up effects.) 


Table 2 presents the obt ained 
correlations between final level of 
original learning and performance 


after different intervals of no practice. 
It is readily apparent that all correla- 
tions are exceptionally high (.80 to 
-98), and all are statistically signifi- 
cant beyond the .01 level. Thus, 
there is virtually no change in the 
ordering of Ss in any group with the 


* 


on 


RETENTION OF PERCEPTUAL-MOTOR SKILL 


passage of time without practice. In 
order to obtain a single estimate of 
the relationship between retention 
and original learning level, all these 
cases were pooled together with 
scores of 40 other additional Ss not 
shown in Table 2. These last 40 Ss 
were Ss who learned initially under 
Group I procedures (no formal train- 
ing) and who were brought back on a 
random basis for a single retraining 
session. Their retention intervals 
ranged from 9 to 95 mo. For this 
combined group (N = 109), having 
as it did a wide range of no practice 
intervals, the zero-order correlation 
between original learning level and 
retention score was found to be .80. 
A partial correlation coefficient be- 
tween original learning and retention, 
with the effect of retention interval 
held constant, was 79, The loss of 
one point can be attributed to round- 
ing error in the computational process. 

The zero-order correlation between 
retention interval and retention score, 
for this combined sample of 109 Ss 
was .30; when initial learning is 


200 


ERROR 
ORE) 
a 

[e] 


TRIAL SC 


100 


a 
co} 


INTEGRATED ABSOLUTE 
(MEAN FIRST 


+. GROUP I NO FORMAL TRAINING 
»— GROUP I FORMAL GUIDANCE 


221 


TABLE 2 


CORRELATIONS BETWEEN ORIGINAL LEARN- 
ING LEVEL AND RETENTION TEST 
PERFORMANCE 


Retention Interval (Months) 


Group 
1 | 5 9 14 | 24 
I — = 84 | -93 -80 
Il 89 85 98 | .93 = 


Note.—All r's are based on Ns of 10, except for Group 
1, 9-mo. entry, where N is 9. 


partialed out this drops to .23. This 
underscores the small amount of 
variance in the retention score at- 
tributable to the retention interval 
relative to the large amount of vari- 
ance in retention due to initial learning 
level. 

An important question is whether 
the effect of initial learning upon 
retention performance is more im- 
portant following short periods of no 
practice as opposed to longer inter- 
vals. In other words does the relation 
between initial learning level and 
retention level dissipate through time? 


24 


MONTHS SINCE COMPLETION OF INITIAL TRAINING 
Fic. 5. The effect of retention interval upon first-trial retraining performance. 


222 


100 


ie 


` 
epee Sm 
Naate: eitt e 


INTEGRATED ABSOLUTE ERROR 
(ARBITRARY UNITS) 


5 


prat biam N 


EDWIN A. FLEISHMAN AND JAMES F. PARKER, JR. 


GROUP I NO FORMAL TRAINING 


ee 


MONTH GROUP 
4 MONTH GROUP 


GROUP I FORMAL GUIDANCE 


9 MONTH GROUP 
14 MONTH GROUP 


o—o 


X-X 
Or=-0 


wy ORS 


F ia — 
A a eee 


10 15 


20 


ONE MINUTE TRIALS(FIRST RETRAINING SESSION) 


Fic. 6. Comparison of groups with same retention interval, illustrating the 
importance of prior proficiency level. 


The correlations presented in Table 2 
offer no evidence in support of this. 
The relationship between original 
learning level and retention perform- 
ance appears to remain relatively 
high and constant through periods 
from 1 mo. to 24 mo. of no practice. 

Retention and type of initial training. 
—The no-practice intervals of 9 and 
14 mo. were common to Groups I and 
Il. In Fig. 5 the differences between 
the two retention performance curves 
at these points reflect differences in 
the final learning levels of these 


TABLE 3 
ANALYSIS OF RETENTION Scores (Inre- 
GRATED ABSOLUTE ERROR) to DETER- 
MINE IMPORTANCE OF Type 
OF INITIAL TRAINING 


Final Session : Fi : 

Group | y | Initial Training Rete 
= ee 

Mean SD Mean SD 

I 10 | 743.3 | 198.11 | 857.2 237.13 
Il 10 | 749.2 | 206.20 825.9 | 248.64 


Note.—Subjects matched on retention interval and 
ncy at conclusion of initial training. 


groups. As described earlier, the 
differences in final learning level result 
from two types of initial training 
procedures. It still remains to be 
shown if the type of initial training, 
independent of final learning level, 
is related to retention performance. 
An analysis was made to separate 
the contribution of these two factors. 

Figure 5 shows the performance 
levels of the 9- and 14-mo. Ss during 
the first minute of retraining. Figure 
6 compares their performance during 
the entire 21-min. initial retraining 
session. It can be seen that the Group 
I Ss are consistently poorer than the 
Group II Ss with the same retention 
intervals, 

Table 3 presents the results of an 
analysis designed to answer the ques- 
tion concerning the importance of 
type of initial training vs. level of 
proficiency at the conclusion of initial 
training, as determiners of perform- 
ance retention. It was possible to 
select Ss from Groups I and II who 
were matched in terms of retention 


RETENTION OF PERCEPTUAL-MOTOR SKILL 


interval (9 or 14 mo.) as well as in 
terminal proficiency at the conclusion 
of initial training. Table 3 presents 
the matching scores (terminal pro- 
ficiency after initial training) and 
the mean scores obtained during the 
first retention session. No significant 
difference was found between scores 
(t = .32,df = 9). This indicates that 
the differences among our Groups 
I and II retention samples following 
periods of no practice of 9 and 14 mo. 
are a function of level of proficiency 
at the end of initial training rather 
than the type of initial training used 
in this study. 

Comparison of retraining’ proce- 
dures.—Each retention group was 
split into two subgroups of 5 Ss each.$ 
Assignment to a particular group was 


4To be more accurate, technically, the 
term “refresher practice” might be used 
here since no formal training was involved 
in these practice sessions. For ease of dis- 
cussion, however, “retraining” is used. 

5 Since certain cases later were dropped 
in order to obtain matched retention samples, 
the subgroups which were used in the data 
analyses ranged in size from 3 to 5 Ss. 


1000 


900 


800 


700 


(ARBITRARY UNITS) 


600 


INTEGRATED ABSOLUTE ERROR 


500 
| 2 


RETRAINING SESSIONS 
7. Effect of different retraining programs during retraining and after a further 1-wk. rest. 


Fic, 


3 


223 


on the basis of stanine score at the 
completion of initial training. This 
was used as a means of making thé 
subgroups approximately comparable 
in tracking ability. One group was 
retrained during four 21-min. sessions 
with 10 min. rest between sessions ; 
the other during four 21-min. sessions 
scheduled 1 day apart. Figure 7 
presents average performance curves 
for the retention groups retrained 
under these two conditions. The 
varying periods of no practice are 
equated for the two curves. The two 
retraining procedures do not result 
insubstantially different performances 
through the third retraining session, 
but in the fourth retraining session 
the distributed practice group ap- 
pears decidedly superior to the m: 
practice group. 
Apparently, there may be some 
critical period beyond which perform- 
ance under massed practice begins 
to deteriorate. Asa means of further 
evaluating this, the fourth (terminal) 
session scores for the two retraining 
procedures were compared statis- 


+--+ MASSED PRACTICE GROUP 
+—* DISTRIBUTED PRACTICE 


4 ONE WEEK-45 
FINAL SESSION 


224 EDWIN A, FLEISHMAN AND JAMES F. PARKER, JR. 


TABLE 4 
GROUP MEANS AND SDs OF INTEGRATED 
ABSOLUTE ERROR SCORES DURING FINAL 
TRAINING SESSION, RETRAINING 
SESSIONS, AND FINAL Re- 
TRAINING SESSION 


[i eee 


Massed Distributed 
Practice Tactice 
Session (N = 30) (N = 32) 


SD | Mean | sp 


241.9 | 773.4 | 220.0 
A 300.6 | 858.5 | 308.6 
801.4 | 365.4 | 650.2 232.8 
251.0 | 693.8 | 225.0 


—_—______| 
Last initial training 
First retraining 
Fourth retraining 
Final (1 wk. later) 


tically. Due to the limited number 
of cases in each subgroup and the 
consequent difficulty of matching 
such scores, an analysis of covariance 
Was conducted which compared fourth 
Session scores while removing the 
effect of first session scores as a source 
of variance. In effect, this procedure 
equates the subgroups statistically 
and increases the efficiency of the 
comparison procedure. The results 
verify the superiority of distributed 
Practice over massed Practice as a 
retraining procedure at the end of 
four retraining sessions (F = 10.75, 
df = 1/59, P < .01). 


One of the assumptions underlying the 
use of analysis of covariance concerns the 


I s practice, 
is both to improve tracking proficiency and 
to reduce inter-§ variability, 

Relative permanence of retraining 
benefit.—One week following the final 
retraining session all Ss were again 
tested for one additional Session, 
Table 4 presents the means and SDs 
for both the massed and distributed 
groups for the first and the final 


retraining sessions as well as for the 
session held 1 wh. later. Whereas 
group performances initially are ap- 
proximately equal, at the conclusion 
of the retraining period the distributed 
Practice group is considerably more 
proficient. However, 1 week later 
the two groups again are performing 
at an approximately equal level and 
this level is closer to that attained 
by the distributed practice group at 
the end of retraining than it is to 
that of the massed practice group. 
These differences were evaluated sta- 
tistically; again, an analysis of co- 
variance procedure was used com- 
paring the two groups for the final 
session with a control on individual 
variation during the first retraining 
session. An F of ,005 (df = 1/59) 
clearly indicated that during the final 
session there was no significant dif- 
ference between the groups. It ap- 
pears that the differences observed 
during retraining are not due to differ- 
ential learning, but to temporary fac- 
tors affecting performance. Thus the 
same “massed practice, postrest re- 
covery" phenomena are found to occur 
in relearning as has been found re- 
Peatedly in studies of original motor 
learning (see Bilodeau & Bilodeau, 
1961), 

It is also interesting to note that 
the performance of both groups during 
the later session was superior to that 
attained at the conclusion of the 
initial training period. Tests were 
conducted to evaluate this effect. 
Results indicated a significant im- 
provement for the distributed practice 
group (t = 2.57, df = 31, P = .02) 
and similar though not significant 
improvement for the massed practice 
group (t = 1.56, df = 29, P = .12). 
Apparently the five sessions com- 
prising the retraining and the later 
test trial not only recovered the 
initial performance capability for 
these Ss, but produced additional 
improvement, 


RETENTION OF PERCEPTUAL-MOTOR SKILL 


Predicting retention from ability meas- 
ures—Those Ss in Group I had been 
administered a battery of 44 printed 
and psychomotor aptitude tests in an 
earlier study (Parker & Fleishman, 
1959, 1960). A subsequent factor analy- 
sis of the correlations among these tests 
identified 15 ability factors, but only 2 
of these (Spatial Orientation and Multi- 
limb Coordination) were found related 
to performance on the tracking task 
during initial learning. And these two 
factors, jointly, never contributed more 
than 25% of the variance in performance 
at any stage of practice with this task. 
Nevertheless, it was thought useful to 
see if measures of these factors were 
related to performance after periods 
of no practice. 

From their loadings on the two 
factors (see Parker & Fleishman, 1960), 
the Stick and Rudder Orientation 
(printed) Test and the Rudder Control 
(apparatus) Test were chosen to repre- 
sent the Spatial Orientation and Multi- 
limb Coordination factors, respectively. 
Correlations between these tests and 
performance during the first retention 
session were computed based on an N 
of 69 (the Group I Ss represented in 
Table 3, plus the 40 Ss brought back for 
a single session of retention testing). 
These zero-order correlations with reten- 
tion performance were .21 for the Spatial 
test and .18 for the Coordination test; 
these coefficients are significant at the 
10 but not the .05 level of confidence. 
To hold the effects of initial learning 
level and retention interval constant, 
second-order partial correlations were 
computed. With these factors partialed 
out the Spatial test correlated .21 and 
the Multilimb Coordination test cor- 
related .20 with performance in the 
retention session. Again these coeffi- 
cients are significant only at the .10 
level, not at the .05 level, for second- 
order partials. 

Thus, for this particular skill, a 
negligible to insignificant portion of 
retention performance is attributable 
to Ss’ abilities as measured prior to initial 
learning.’ This is true when retention 

6 The distinction between the constructs 


“ability” and “skill” has been elaborated 
elsewhere (Fleishman, 1959, 1962; Gagné & 


225 


is defined in terms of performance after 
no practice, as well as when this perform- 
ance is residualized with respect to 
initial learning level and retention in- 
terval. 

Performance on this task during early 
stages of initial learning was shown to 
be uncorrelated with performance during 
late stages of initial learning (e.g, as 
late as Trial 8 the correlation with Trial 
50 was only .13); however, practice 
sessions late in original learning cor- 
related .70 with each other (Parker & 
Fleishman, 1960). The communality of 
the final initial learning trial attributable 
to independently measured ability fac- 
tors was only .24. Taken together, 
these findings suggested that proficiency 
at the end of training was mainly a 
function of specific habits and skills 
acquired during the 6 wk. of practice 
with the task and only to a small extent 
a function of Ss’ abilities prior to his 
experience with this task. 

The present findings indicate this is 
also true of retention performance after 
prolonged periods of no practice. This 
is especially apparent when we recall 
the high correlations (in the .80s and 
90s) between proficiency at the con- 
clusion of training and retention per- 
formance, relative to the negligible 
correlations of retention with the inde- 
pendent ability measures. 


SUMMARY 


Two groups of Ss were given extended 
training on a highly complex tracking task. 
Practice extended over 17 sessions distributed 
over 6 weeks. The two groups differed only 
in the amount of verbal guidance provided 
in initial training. Within each group, sub- 
groups of Ss matched for final proficiency 
were retested following various no-practice 
intervals of up to 24 mo. These retention 
samples were further divided into two sub- 
groups, each of which were given four addi- 
tional retraining sessions; in one group this 
relearning practice was massed in 1 day and 


Fleishman, 1959). Briefly, “ability” refers 
to a more general, stable trait of the indi- 
vidual inferred from response consistencies 
on a given range of tasks. Skill refers to 
proficiency on a specific task. Some portion 
of the variance in a given skill can be ac- 
counted for in terms of particular component 
abilities. 


226 


for the other group it was distributed over 
4 days. One week following the retraining 
all Ss were retested as a means of evaluating 
the persistence of the effects of these two 
relearning schedules. 

1, The retention of proficiency in a com- 
plex, continuous control, perceptual-motor 
skill is extremely high, even for no-practice 
intervals up to 24 mo. For Ss trained ini- 
tially to high levels of proficiency (Group II), 
virtually no loss was observed for periods 
up to 14 mo. What small losses did occur 
were recovered in the first few minutes of 
relearning. With 24 mo. of no practice, rapid 
recovery still occurred during the first 20 
min. of relearning. 

2. Variations in retention interval from 
1 to 14 mo. are shown to be unrelated 
to retention performance, even during the 
first 1 min. of relearning. The function has 
zero slope until the loss in performance shown 
by the 24-mo, retention group. 

3. The most important factor in retention 
is the level of proficiency achieved by the 
Ss during initial learning. This effect is 
shown to be just as important following long 
and short periods of no practice. 

4, The type of initial training (amount 
of verbal guidance) is unrelated to retention 
performance when proficiency level after 

original learning is held constant. 
nA 5. Retraining administered under condi- 
tions of distributed practice proved to be 
superior to that administered under mass 
practice based upon a measure of performance 
during the final retraining session, However, 
on retesting 1 week later no difference was 
noted between the two retraining procedures. 
Thus, in terms of transfer to later performance 
there was no “permanent” disadvantage in 
ere ae es both groups 

improv md thei igi i 
cic yor ir original learning 

6. Predictions of individual differences in 
retention from independent ability measures 
were negligible. Retention appears more a 
function of specific task habits acquired, than 
of Ss’ ability traits developed prior to training. 


REFERENCES 


Ammons, R. B., Farr, R, G., Broc, E. 
Neumann, E., Dry, M., Marton, R. 
i trate th H. Long-term retention 
of perceptual-motor skills. J. exp. 

1958, 55, 318-328, oe 

Barris, W. F., NAGEL, E. H., Voss, J. F. 
& Brocpen, W. J. Transfer and retention 
of bidimensional compensatory tracking 


pÈ 


EDWIN A. FLEISHMAN AND JAMES F. PARKER, JR. 4 


after extended practice. Amer. J. Psychol. 
1957, 70, 75-80. 

Bett, H. M. Retention of pursuit rotor 
skill after one year. J. exp. Psychol., 1950, 
40, 648-649. 

Biropeau, E. A., & BrLopeau, I. McD. 
Motor skill learning. Annu. Rev. Psychol., 
1961, 12, 243-280. 

FLEISHMAN, E. A. Abilities and the learning 
of psychomotor skills. In P. H. Dubois, 
W. H. Manning, and C. J. Spies (Eds.), 
Factor analysis and related techniques in the 
study of learning. St. Louis: Washington 
University, 1959. 

FLEISHMAN, E. A. The description and 
prediction of perceptual-motor skill learn- 
ing. In R. Glaser (Ed.), Training re- 
search and education. Pittsburgh: Univer. 
Pittsburgh Press, 1962. 

Gacné, R. M., & Fietsuman, E. A. Psy- 
chology and human performance. New York: 
Holt, 1959. } 

JAHNKE, J. C. Retention in motor learning 
as a function of amount of practice and 
rest. J. exp, Psychol., 1958, 55, 270-273. 

Jones, E. I., & BıLopEau, E. A. Retention 
and relearning of a complex perceptual- 
motor skill after ten months of no practice. 
HumRRO res. Bull., 1953, No. 53-15. 

Leavitt, H. J., & ScuLosperG, H. The re- 
tention of verbal and of motor skills. J. 
exp. Psychol., 1944, 34, 404-417. 

MENGELKocH, R. F., ADAMS, J. A., & GAINER, 
C. A. The forgetting of instrument flying 
skills as a function of the level of initial pro- 
ficiency. USN Train. Dev. Cent. tech. Rep., 
1958, No. 71-16-18. 3 

PARKER, J. F., & Freisuman, E. A. Predic- 
tion of advanced levels of proficiency in a 
complex tracking task. USAF WADC 
tech. Rep., 1959, No. 59-255. R 

PARKER, J. F., & FLEISHMAN, E. A- Ability 
factors and component performance meas- 
ures as predictors of complex tracking 
behavior. Psychol. Monogr., 1960, 74(16, 
Whole No. 503). 

PARKER, J, F, & Firtsaman, E. A. Use 
of analytical information concerning task 
requirements to increase the effectiveness 
of skill training. J. appl. Psychol., 1961, 
45, 295-302. 7 

RevyoLDs, B., & BiLopeEau, I. McD. Acqui- 
sition and retention of three psychomotor 
tests as a function of distribution of practice 
during acquisition. J. exp. Psychol., 1952, 
44, 19-26. 


(Early publication received February 8, 1962) 


, 
4 
4 


| 
; 


-A 


Journal of Experimental Psych 
1962, val 64, No. 3, ‘a 


DISCRIMINATION OF THE REWARD IN LEARNING WITH 
PARTIAL AND CONTINUOUS REINFORCEMENT * 


STEWART H. HULSE 
Johns Hopkins University 


Over the past 20 years, psycholo- 
gists have spent much time observing 
and manipulating the things that 
happen on nonreinforced trials in 
partial reinforcement learning situa- 
tions. The result has been a massive 
array of information, and an equally 
massive array of theory, concerning 
the phenomena that partial reinforce- 
ment produces (Jenkins & Stanley, 
1950; Lewis, 1960). The nonrein- 
forced trial is the hallmark of par- 
tially reinforced learning, an the 
study of the stimulus and response 
correlates of nonreinforcement iS, 
consequently, of unquestioned im- 
portance. But the nonreinforced 
trial is not the only place to look for 
information about the effects partial 
reinforcement produces. 

It is quite conceivable that new 
information could be gained about 
partial reinforcement from a study 
of the behavior used to ingest the 
reward on reinforced trials. This 
notion is particularly appealing, first 
of all, in view of recent studies which 
have shown that consummatory Te- 
sponding, i-€- the rat’s licking rate, 
varies quite systematically as a func- 
tion of certain variables such as 
sweetness of reward, size of drop 
delivered from the drinking tube, 
and so on & Bacon, 1962; 
Hulse, Snyder, & Bacon, 1960). 

Consummatory behavior has an- 
other unique property in partial 


1 This research was supported by National 
Science Foundation Research Grants G8712 
and G18125. The author wishes to thank 
W. E. Bacon and H, L. Snyder for collecting 
the data. 


reinforcement situations which makes 
its study potentially important. Prac- 
tice of the learned response, running 
in an alley for example, is unavoidably 
confounded with number of rein- 
forced and nonreinforced trials, but 
practice of the consummatory Te- 
sponse in the goal box is not. This 
is true since the consummatory re- 
sponse cannot occur in overt and 
nonfractional form on nonreinforced 
trials. The reward is not there. 
Generally speaking, then, consum- 
matory behavior takes place only 
as a function of number of reinforced 
trials. It follows that, while an 
examination of the development of 
the running response as a function 
of number of reinforcements is not 
logically justifiable, a similar exami- 
nation of consummatory behavior 
is justifiable and might be quite 
interesting. 

While the above is true in theory, 
in fact, some experiments (e.g., Marx, 
1958) permit confounding of con- 
summatory behavior and running in 
the sense that their procedures call 
for leaving a food cup, 4 drinking 
tube, or some other stimulus closely 
tied to the consummatory response, 
in the goal box on nonreinforced 
trials. Presumably, the rat makes 
abortive licks at the drinking tube, 
bites the food cup, Or otherwise shows 
fractional components of the con- 
summatory response. Other experi- 
mental procedures (e.g, Hulse, 1958; 
Weinstock, 1954) do, indeed, com- 
pletely unconfound consummatory be- 
havior from practice of the running 
response by deliberately removing 


227 


228 STEWART H. HULSE 


the food cup or drinking tube on 
nonreinforced trials. In both cases, 
the rationale is generally that of 
controlling secondary reinforcement 
or some other process presumed to 
transpire in the goal box. In neither 
case has an attempt been made to 
examine the specific behavior which 
the goal-box stimuli elicit on rein- 
forced and nonreinforced trials. 

The purpose of the present experi- 
ment was to examine consummatory 
behavior on both reinforced and non- 
reinforced trials as a function of three 
percentages of reinforcement of a 
running response in an alley. The 
confounding of consummatory be- 
havior with practice of running in 
the alley was deliberately manipu- 
lated by making a drinking tube 
either available to S or unavailable 
to S on nonreinforced trials. 


METHOD 
Subjects and Apparatus 


The Ss were 50 male naive albino rats of 
the Sprague-Dawley strain obtained from 
Sprague-Dawley, Incorporated, Madison, 
Wisconsin. The Ss were 70 to 80 days of age 
at the time they started the experiment. 

The apparatus was a 15-ft. U shaped en- 
closed runway. The sides of the U were 6 ft. 
long, and the base of the U was 3 ft. long. 
The start box was 7 in. long and 5} in. wide, 
and the goal box was 20 in. long and 4 in. 
wide. The goal box was attached at right 
angles to the end of the runway such that 
S made a left-hand turn into it. Inside height 
of the runway was 4 in. throughout. The 
runway was covered with hinged pieces of 
Plexiglas. The goal-box floor was covered 
with brass shim stock. The apparatus was 
painted flat black throughout. 

Two guillotine doors separated the start 
and goal boxes from the alley. The goal-box 
door was located 4 in. before the turn into 
the goal box proper. Photocells, used to 
facilitate the timing of running behavior, 
were located 2 in. past each door. 

Reinforcements were provided from a 
brass drinking tube (2 mm. inside diameter) 
located behind a Plexiglas shield at the far 
wall of the goal box. The tube was centered 


in a ł X $in. vertical slot cut into the 
Plexiglas. A piece of Masonite could be 
placed over the entire far wall of the goal box 
such that S could not see or reach the drinking 
tube on a particular trial. 

Each lick on the drinking tube operated 
a pump system which delivered a drop of 
water of specified volume to the tip of the 
tube. The pump system, described in detail 
elsewhere (Hulse, 1960), consisted of an 
infusion pump operated by an electronic 
relay (Otis & Boenning, 1959). The elec- 
tronic relay also operated a counter. 


Procedure 


Experimental design.—Three percentages 
of reinforcement of the running response were 
used: 33%, 66%, and 100%. In addition, 
for the partial groups, the drinking tube was 
available to S on nonreinforced trials (T) 
or it was not available to S on nonreinforced 
trials (NT). An NT condition could not be 
included for the 100% Ss, since this per- 
centage of reinforcement required the pres- 
ence of the drinking tube on all trials. Ten 
Ss were used in each of the five groups called 
for by the design. 

Taming.—On each of 10 taming days, the 
Ss were handled freely and placed in groups 
of 5 or 6 into a large wooden box. Six water 
bottles were clipped to the outside of the box 
with their drinking tubes projecting through 
holes in the walls of the box. The Ss were 
permitted to explore and to drink from the 
tubes for 15 min. This was the only water 
available during taming. Purina lab chow 
pellets were available in the individual home 
cages at all times. 

Training.—Following taming, Ss were 
given 60 training trials, 1 trial per day. On 
each trial, S was placed in the start box, and 
after a 2- to 3-sec, delay, the start-box door 
was raised. The start-box and goal-box 
doors were lowered after S had passed under 
them. After S was removed from the goal 
box, it was returned to its home cage. Fifteen 
to 30 min. later, a water bottle was attached 
to the cage, and S drank for 30 min. The 
Ss were thus approximately 23 hr. thirsty 
at the time trials began each day. 

All Ss were reinforced on each of the first 
3 days of training. Reinforcement consisted 
of 600 licks on the drinking tube with the 
pump set to deliver .0053 cc of water with 
each lick. On Day 1, the drinking tube 
projected through the slot into the goal box. 
On Days 2 and 3, the tube was gradually 
withdrawn so that on Day 3, its tip was } in. 
behind the slot. The tube remained in this 
position for the rest of the experiment. 


En 


3 


rn 


DISCRIMINATION OF REWARD 


On Day 4, the first nonreinforced trial 
was introduced for the partial Ss. There- 
after, reinforced and nonreinforced trials were 
determined randomly according to the per- 
centages of reinforcement called for by the 
experimental design. The only other restric- 
tion on randomization of reinforcement was 
that the last training trial was reinforced 
for all Ss. 

On nonreinforced trials for the partial Ss 
tested under the T condition, the drinking 
tube was present in its usual location behind 
the slot, but the pump system was emptied 
of water. The S was thus free to lick from 
the tube, but no water was obtainable. On 
nonreinforced trials for Ss tested under the 
NT condition, the far wall of the goal box 
was covered with a piece of Masonite so that 
S could neither see nor lick from the tube. 
Goal-box confinement on nonreinforced trials 


Measures of performance —Three measures 
of running behavior were recorded: start 
time, alley time, and goal-box time. Start 
time began when the start-box door went up 
and ended when S passed the photocell out- 
Alley time was the 
time S required to run from the first photo- 
cell to the second photocell at the goal-box 
door. Goal-box time was the time S required 
to run from the second photocell to the 


timers. The time scores were transformed 
to reciprocals and multiplied by 100 for 
purposes of the statistical analyses. 

Licking i 
determined from a clock which started with 
the first lick on the drinking tube and stopped 
when S had completed its 600-lick allotment. 
‘These times were transformed to rates of 
licking, in licks per second. The number of 
licks Ss in the T groups emitted on nonrein- 
forced trials was also recorded. 


RESULTS 


Licking.—Figure 1 shows that if 
licking rates are plotted as a function 
of number of reinforcements, the 
rates for different percentages of rein- 
forcement reach approximately the 
same asymptote at the last five 
reinforced trials. However, the figure 
also shows that the rate of increase 


60 
o 
z 
O s5 
o 
w 
o 
5.0 
« 
w 
a 
4 
a S 
Š © 333 (N+20) 
a 40 o 66% (+20) 


a t00% 0N +101 


i5 6-10 mis 16-20 LASTS 


REINFORCEMENTS 


Fic. 1. The development of licking rates 
as a function of ordinal number of reinforced 
trials. (The scores have been combined for 
the partial groups, since the statistical analy- 
ses indicated the groups did not differ accord- 
ing to the T and NT variable.) 


of licking over the first 20 reinforce- 
ments is quite different depending 
upon percentage of reinforcement. 
The curve for the 33% groups ap- 
pears negatively accelerated, that 
for the 66% groups almost linear with 
a slight suggestion of negative ac- 
celeration, and that for the 100% 
group positively accelerated. 

There is abundant statistical evi- 
dence to support the significance of 
the differences shown in Fig. 1. A 
simple F test for percentage of rein- 
forcement based on means for the 
last 10 of the first 20 trials shown 
in the figure yields an F of 7.92 
(df = 2/47, P<.01). An analysis 
of variance of 5-trial means for the 
first 20 reinforcements for the 33% 
and 66% T groups and the 100% 
group, i.e. for those groups which 
had comparable goal-box conditions 
on all training trials, yields an F for 
percentage of 4.44 (df=2/27, P<.05). 
This analysis also shows a significant 
Percentage X Blocks of Trials inter- 
action (F=2.50, df=6/81, P <.05) 
which indicates that the three groups 
are progressing towards their final 
asymptote at different rates. A simi- 
lar analysis based on 5-trial means 


230 STEWART H. HULSE 


for the 33% and 66% T and NT 
groups also shows a significant Per- 
centage X Blocks of Trials inter- 
action (F=3.12, df=3/108, P<.05). 
No analysis shows means on the last 
5 reinforced trials to be significantly 
different; the analysis for the com- 
bined 33%, 66%, and 100% groups, 
for example, yielded an F of 3.00 
(if = 2/47, P > .05). 

The T condition did not produce 
different licking behavior on rein- 
forced trials than the NT condition. 
The analysis of variance of 5-trial 
means for the 33% and 66% T and 
NT groups provides no evidence that 
licking rates differ as a function of 
the availability of the tube in the 
goal box on nonreinforced trials 
(F=1.66, df=1/36, P>.05). This 
fact is equally true for the last 5 
reinforced trials of the 60 training 
‘trials (P > .05). Moreover, the T 
and NT condition does not interact 
with percentage of reinforcement 
(P > .05). 

Figure 2 shows that the 66% T 
group emits more licks on nonrein- 
forced trials than the 33% T group 
(F = 6.02, df = 1/18, P < .05), num- 
ber of licks decreases for both groups 
across blocks of nonreinforced trials 
(F = 10.92, df = 3/57, P < .01), but 


NUMBER OF LICKS 


1-5 6-10 1-15 


tost 5 
NONREINFORCED TRIALS 


Fic. 2. Number of licks on nonreinforced 
trials for the 33% and 66% T groups as a 
function of ordinal number of nonreinforced 
trials. 


the difference in number of licks 
between the groups across blocks of 
trials does not change in size (P >.05). 
Of particular importance is the fact 
that the absolute number of licks for 
both groups is small, and most of the 
decrease in number of licks occurs 
during the course of the first 5 to 10 
nonreinforced trials. 

Many of the Ss in the T groups 
failed to lick at all on some nonrein- 
forced trials. This happened more 
frequently for the 33% T group than 
for the 66% T group where, out of the 
200 trials represented by the first 
15 and last 5 nonreinforced trials, 
the frequencies are 73 and 60 nonlick 
trials, respectively. A Wilcoxon T 
test shows this difference to be 
significant (P < .05), 

Running.—We noted that separate 
analysis of running behavior for rein- 
forced and nonreinforced trials is not 
logically justifiable, since practice 
of running is confounded with the 
number of such trials. Strictly speak- 
ing, however, this is not true for 
goal-box speeds, since this measure 
reflects behavior which occurs after 
Ss enter the goal box and expose them- 
selves directly to the different stimu- 
lus conditions correlated with rein- 
forcement and nonreinforcement. 

Although an analysis of variance of 
blocks of 5 trials for the first 20 rein- 
forced trials shows a significant Blocks 
X Percentage interaction (F = 2.64, 
df = 6/81, P < .05), goal-box speeds 
differ only for the first 5 reinforced 
trials. For this block of trials, speeds 
are greater the higher the percentage 
of reinforcement. The groups do not 
differ on the last 5 reinforced trials 
of the 60 training trials (P>.05). 

An analysis of variance of goal-box 
speeds on the first five and last five 
nonreinforced trials for the 33% and 
66% T groups shows that the 66% 
group ran to the tube faster than 


DISCRIMINATION OF REWARD 231 


the 33% group (F = 747, df = 1/18, 
P <.05), but that goal-box speeds 
for the two groups decrease from the 
beginning to the end of training 
(F = 6.70, df = 1/18, P < .05). The 
interaction between the variables is 
not significant. 

The differences obtained for be- 
havior in the goal box are indeed 
unique to the goal box and did not 
result from some extraneous factor 
such as differential handling by E at 
the beginning of reinforced as com- 
pared with nonreinforced trials. This 
is substantiated by a control analysis 
run on alley speeds for Reinforced 
Trials 16 to 20 which showed a sig- 
nificant effect for percentage of rein- 
forcement (F=4.78,df = 2/47, P <.05). 
Partially reinforced Ss run faster 
than continuously reinforced Ss, as 
we would expect, since the former 
have had many more trials in the 
alley. 

An analysis of start and alley speeds 
at the end of 60 training trials shows 
little except that a 15-ft. U shaped 
alley produces a great deal of response 
variability. Analyses of variance 


* based on means for the last 20 training 


trials for start speeds show no sig- 
nificant differences due to any of 
the variables. The same is true for 
alley speeds, except for an analysis 
which compared means for the 33% 
and 66% condition and the T and NT 
condition. Here, a significant per- 
centage effect was obtained (F =4,27, 
df = 1/36, P < .05) which indicated 
that 66% reinforcement produced 
higher speeds than 33% reinforce- 
ment. This effect was not significant, 
however, for an analysis based on 
means from the 100%, 66% T, and 
33% T groups. 


Discussion 


Rats quickly learn to detect whether 
or not the drinking tube will produce 


water, and their licking rates rapidly 
increase as a function of number of rein- 
forcements. However, the rate with 
which licking rates increase over rein- 
forced trials is a function of the number 
of intervening nonreinforced trials. 

We can account for this phenomenon 
on the assumption that a stimulus-dis- 
crimination process takes place in the 
goal box for the partial groups. If we 
think of the drinking tube and fluid 
as discriminative stimuli for approaching 
the tube and drinking, the data suggest 
that rats quickly learn to attend to 
these stimuli and to do the appropriate 
thing when they are present Or absent. 

First, licking on nonreinforced trials 
occurs, but it rapidly extinguishes. The 
Ss in the T condition learned to take 
only a lick or two on the dry drinking 
tube on nonreinforced trials, and they 
learned to do this early in training over 
the first 5 to 10 nonreinforced trials. 
Also, after a given number of nonrein- 
forced trials, the Ss in the 66% T group - 
have had more reinforced trials than Ss 
in the 33% T group and have, presum- 
ably, developed a stronger consumma- 
tory response. We might, therefore, 
expect a greater tendency for them to 
generalize licking on reinforced trials 
to licking on nonreinforced trials. This 
apparently occurred, since the 66% T 
group emitted more licks on nonrein- 
forced trials than the 33% T group. 
Finally, in this connection, Ss in the 
partial groups were apparently so set 
to discriminate cues associated with 
reinforcement from cues associated with 
nonreinforcement that they were able 
to outwit E in their ability to identify 
a nonfunctional drinking tube. Under 
the T condition, all Ss failed to lick on 
some nonreinforced trials. Possibly, 
since the pump system was drained on 
nonreinforced trials, the Ss could dis- 
criminate after they entered the goal 
box whether or not water was visible 
at the opening of the tube. Possibly, 
for the same reason, they discriminated 
an odor difference in the goal box; given 
the chemical content of local laboratory 
tap water, this is not an inconceivable 
proposition. 


232 


Second, goal-box speeds on nonrein- 
forced trials decrease as a function of 
the number of such trials. Since the 
licking data show that partial Ss could 
sometimes detect a dry drinking tube 
before they took a lick, approaching 
the drinking tube on a nonreinforced 
trial extinguished to some extent. More- 
over, the effects of this process appear 
to have generalized to suppress the speed 
with which partial Ss approached the 
drinking tube on reinforced trials. Thus, 
goal-box speeds on reinforced trials 
increase at approximately the same rate 
regardless of percentage of reinforce- 
ment. This is to be contrasted with 
the usual finding, obtained for alley 
speeds in the present experiment, that 
increments in response strength as a 
function of number of reinforced trials 
will be much greater the lower the per- 
centage of reinforcement of the response. 

Finally, the consummatory response 
develops over reinforced trials as Ss 
learn to discriminate the special sig- 
nificance of the drinking tube and fluid 
as stimuli for consummatory responding. 
For a fixed number of reinforcements, 
the rate with which this discrimination 
develops will increase as the number 
of intervening nonreinforcements in- 
creases. We would therefore expect 
the 33% groups to learn the consumma- 
tory response faster than the 66% groups. 
Since the 100% group received no dis- 
crimination training with respect to the 
drinking tube, we would expect this 
group to develop the consummatory 
response slowest of all. The data 
clearly support these conclusions. 

It seems clear that partial reinforce- 
ment provides discrimination training 
for reward stimuli in the goal box, but 
continuous reinforcement does not. Dur- 
ing partial reinforcement, in effect, be- 
havior is critically focused on the re- 
ward and its stimulus properties because 
of the contrast in goal-box conditions 
on reinforced as compared with nonrein- 
forced trials. It follows that, if partial 
reinforcement is used to condition a 
response, the development of response 
strength may be more critically deter- 
mined by stimuli correlated with the 


STEWART H. HULSE 


reward, such as its sweetness, than if 
continuous reinforcement is used. We 
might expect, for example, that after 
partial reinforcement resistance to ex- 
tinction would increase as a function 
of the sweetness of a reward. After 
continuous reinforcement, however, there 
should be much less correlation between 
reward sweetness and resistance to ex- 
tinction. Hulse and Bacon (1962) 
showed these predictions to hold follow- 
ing training with different concentra- 
tions of saccharin. Hulse (1958) showed 
much the same thing for different-sized 
food rewards. 

Some additional support for these 
hypotheses comes from another source. 
Jenkins (1961) found that resistance to 
extinction in the presence of a stimulus 
was greater if that stimulus had been 
paired with reinforcement during dis- 
crimination training as compared with 
continuous reinforcement training. Jen- 
kins’ stimulus was a light pattern pro- 
jected on S's response key, The analo- 
gous stimulus in the present approach 
is the reinforcing stimulus itself, and 
this is not presented, of course, during 
extinction. If the effect Jenkins noted 
is to occur in straightforward fashion, 
it would have to operate through some 
mechanism other than the reward stim- 
ulus per se. There is a clear parallel 
between Jenkins’ approach and the 
approach outlined here, and the cor- 
respondence between Jenkins’ data and 
the data discussed here is interesting. 
This apparent correspondence must re- 
main suggestive for the present, however. 


SUMMARY 


Fifty albino rats were given 60 training 
trials in a 15-ft. U shaped alley. They were 
reinforced with 600 licks of water from a 
drinking tube on 33%, 66%, or 100% of the 
trials. Half the partial Ss could lick on the 
dry drinking tube on nonreinforced trials; 
for the other half, the tube was blocked such 
that the Ss could neither see it nor lick it. 
Start speeds, alley speeds, and goal-box 
speeds, licking rates on reinforced trials, and 
number of licks on nonreinforced trials were 
recorded. 

The results show that the partial groups 


ST a ll 


DISCRIMINATION OF REWARD 233 


developed licking rates faster in the goal box, 
as a function of number of reinforcements, 
than the continuous group. This happened 
whether or not the tube was available on 
nonreinforced trials. Licking rapidly extin- 
guished, and goal-box speeds decreased, on 
nonreinforced trials for the partial groups. 
Start speeds and alley speeds did not vary as 
a function of any of the experimental vari- 
ables; this may have been due to excessive 
response variability produced by the very 
long runway. 

The data suggest that partial reinforce- 
ment produces a very powerful discrimination 
of reward stimuli in the goal box. This 
process may be a factor in experiments which 
have shown that stimulus variables corre- 
lated with the reward, such as its sweetness 
or size, have different effects if they are used 
with partial as compared with continuous 
reinforcement. 


REFERENCES 


Huse, S. H. Amount and percentage of 
reinforcement and duration of goal confine- 
ment in conditioning and extinction. 
J. exp. Psychol., 1958, 56, 48-57. 

HuLse, S. H. A precision liquid feeding 
system controlled by licking behavior. 
J. exp. Anal. Behav., 1960, 3, 1-3. 


HuLse, S. H., & Bacon, W. E. Supple- 
mentary report: Partial reinforcement and 
amount of reinforcement as determinants 
of instrumental licking rates. J. exp. 
Psychol., 1962, 63, 214-215. 

Hutse, S. H., SNYDER, H. L., & BACON, 
W. E. Instrumental licking behavior as a 
function of schedule, volume, and con- 
centration of a saccharine reinforcer. 
J. exp. Psychol., 1960, 60, 359-364. 

Jenkins, H. M. The effect of discrimination 
training on extinction. J. exp. Psychol., 
1961, 61, 111-121. 

Jenkins, W. O., & Stantey, J. C., JR. 
Partial reinforcement: A review and 
critique. Psychol. Bull., 1950, 47, 193-234. 

Lewis, D. J. Partial reinforcement: A 
selective review of the literature since 1950. 
Psychol. Bult., 1960, 57, 1-28. 

Marx, M. H. Resistance to extinction as a 
function of continuous or intermittent 
presentation of a training cue. J. exp. 
Psychol., 1958, 56, 251-255. 

Oris, L. S., & BOENNING, R.A. A transistor- 
ized circuit for recording contact responses. 
J. exp. Anal. Behav., 1959, 2, 280-291. 

Weinstock, S. Resistance to extinction 
of a running response following partial 
reinforcement under widely spaced trials. 
J. comp. physiol. Psychol., 1954, 47, 318- 
322. 


(Early publication received March 16, 1962) 


Experimental Piychology 
rs. 184 fis 


MEDIATED ASSOCIATION IN A PAIRED-ASSOCIATE 
TRANSFER TASK - 


DAVID S. PALERMO 
Institute of Child Development, University of Minnesota 


A reliable condition for producing 
negative transfer in paired-associate 
learning occurs in the A-B, A-C 
paradigm. In this paradigm, one 
set of responses is learned to a set of 
stimuli and subsequently, new re- 
sponses are learned to the same stim- 
uli. Another approach to an under- 
standing of the learning of successive 
lists of paired associates has used 
an A-B, B-C, A-C paradigm for an 
experimental group and an A-B, D-C F 
A-C paradigm for a control group. 
In such mediated association studies 
(e.g., Norcross & Spiker, 1958), it 
has been assumed that the A-C list 
is learned more rapidly by the experi- 
mental group because the learning 
of the B-C list provides a mediating 
link between A and C of the following 
nature: 


Sa Se 
| 
Rasa —> Res — Re 


The sp — Re association learned in the 
B-C list provides the associative or 
mediational link assumed to facilitate 
the learning of the A-C list. 

The present experiment was de- 
signed to determine whether mediated 
association effects can be demon- 
strated within an A-B, A-C paradigm. 
The usual paradigm was modified to 
the extent of presenting lists in which 
each response was Paired with two 
stimuli rather than one response with 
each stimulus. Therefore, there were 
only half as many different responses 
as stimuli in both the A-B and the 


A-C lists. Upon reaching criterion 
on the A-B list, each response and its 
associated stimulation had been con- 
ditioned to two stimuli of the list.. 
By appropriate pairing of the new 
responses with the old stimuli in a 
“mixed” A-C list (Twedt & Under- 
wood, 1959), it was possible to ar- 
range stimuli and responses which, 
according to the mediational hypoth- 
esis, would lead to facilitation or im- 
pairment of parts of the A-C list 
during the learning of the list. Thus, 
the learning of a particular pair in 
the A-C list was assumed to mediate 
responses which would affect the 
learning of another pair within the 
list. 


METHOD 


Apparatus.—The apparatus was a modi- 
fied Card Master. Basically, it consisted of a 
box 13 X 13 X 94 in. The front face of the 
apparatus contained a 3 X6 in. aperture 
covered by clear plastic. A mechanical 
arrangement delivered to the aperture a 
3} X 6 in. plastic card from the bottom of a 
stack of cards. Mounted on each card was a 
pair of Stanford-Binet Picture Vocabulary 
pictures reproduced by a Thermofax process 
for this purpose. Gray metal doors served as 
shutters to expose independently the left and 
right pictures on the cards, Following a 
Presentation, the doors closed simultaneously 
and the card was released from the aperture 
and returned to the top of the stack by a 
conveyor belt. A system of electronic timers 
controlled the rate of presenting the cards 
and the exposure times of the stimulus and 
response picture. 

Experimental design —All Ss learned two 
lists, each of which was composed of six 
stimulus pictures and three response pictures 
paired so that each response was learned to 
two stimuli. Table 1 presents the design of 
the experiment, with the picture names, 
which provides that each S serve under each 


234 


MEDIATED ASSOCIATION 


TABLE 1 


DesiGn OF THE EXPERIMENT 


Lis IE 
———— ee — Cond. 
s R s R i 

a E eS i: 

S; STOOL Ra scissors S; STOOL Rp House I 

S: SHOE SCISSORS S, suog Rp nouse 1 

S; GLASSES Ry BED S: GLASSES Ry BASKET i 

S, cup Ra BED Sı cur Ry KNIE i 

S, CLOCK Ro HAND S, FORK Ry BASKET il 

Se TABLE Ro HAND S, TREE Ry KNIFE m 
of three experimental conditions. For two Elementary School.! All Ss were naive with 
pairs in List 2 the mediational chain, estab- respect to verbal learning except the Grade 6 


lished during the learning of the pairs, is 
expected to facilitate learning (Cond. 1). 
For two other pairs the mediational link is 
expected to elicit incorrect responses s 
thus, interfere with learning (Cond. II). 
The other two pairs involve two new stimuli 
as well as the two new responses and thus 
have no experimentally established mediating 
tendencies (Cond. III). In the case of Cond. 
Land I, it is assumed that when a stimulus 
is presented in List 2, the List 1 response and 
attendant stimulation will occur either 
or covertly. It is also assumed that the 
response of List 2 will be conditioned to the 
attendant stimulation of the List 1 response. 
For example, in List 2 of Table 1 the learn- 
ing of the pair STOOL-HOUSE (Sı + Rp) in 
Cond. I is expected, the establish- 
scissors-HOUSE 


the se HOUSE (Rp) to SHOE (S2). 
Similarly, the learning of SHOE-HOUSE (Sr>Ro) 
is expected to facilitate the learning of STOOL- 
HOUSE (Sı > Rp). In the case of Cond. I, 
it is expected that the learning of GLASSES- 
BASKET (Sa — Re), in List 2, will interfere 
with the learning of CUP-KNIFE (Ss —> Re) 
because of the establishment of the association 
of bed-BaskeT (s) — Re). Similarly, the 
establishment of the association bed-KNIFE 
(sp — Rr) during the learning of CUP-KNIFE 
(Sı > Rr) will interfere with the learning of 
GLASSES-BASKET (S; > Re). The occurrence 
of BASKET (Re) to CUP (S,) or KNIFE (Rr) 
to GLASSES (S;) would be considered a medi- 
ated error. In the case of Cond. III, the 
stimuli in List 2 have no experimentally 
established mediating tendencies since they 
have not been previously presented. 

Subjects. —The Ss were drawn from Grades 
3-6 (Ns = 12, 12, 15, and 18, respec- 
tively) of the University of Minnesota 


children who had participated in a paired- 
associate learning experiment 1 yr. previously. 
Seven Ss had to be discarded due to apparatus 
failure and 19 were discarded because they 
failed to reach criterion on List 1 within 27 
and 1 in Grades 3-6, respec- 


the same form 


of List 1. 


groups desig 
ferential difficulty of the stimulus-response 
pairs in List 2. The groups differed with 
to which of the three forms of List 2 
received, The three forms were con- 
structed so that a given set of two List 2 pairs 
was used in each of the three conditions, 

Procedure.—The Ss were given two experi- 
mental sessions separated by 1—4 days. Each 
S was brought by E from the classroom to the 
experimental room, and seated facing a 
37 X 49 in. panel with a 3 X 6 in. aperture 
through which S viewed the stimulus cards. 
A shaded 40-w. light directly above the 
aperture provided illumination. The ap- 
paratus was located behind the panel in an 
adjoining room. On the table was a Webster 
Teletalk two-way speaker. The E gave the 
instructions through the speaker system and 
recorded the verbal responses as they were 
given by S. Thus, S was alone during both 
experimental sessions. 

‘At Session 1 instructions explaining the 
task were read to S and a pair of pictures, not 
subsequently used in the experiment, were 
shown to familiarize him with the apparatus 


1 The author wishes to express his apprecia- 
tion to James R. Curtin, Principal of the 
University Elementary School, for his co- 


operation in making Ss available for the 
experiment, and Ernest Washington, who 
ran the Ss. ‘4 


236 


and procedure. The S was not required to 
name the stimulus picture but was asked to 
anticipate the response picture by naming it. 
No S had any difficulty in giving the appro- 
priate name to the stimuli. List 1 was pre- 
sented immediately and continued until S 
reached a criterion of three successive errorless 
trials. 

At Session 2 each S relearned List 1 to 
three successive errorless trials and then was 
told that another list would be presented. 
After approximately 2 min., necessary to 
change lists, List 2 was presented for nine 
trials or three successive errorless trials, 
whichever was fewer. If the criterion was 
reached prior to the nine trials, it was assumed 
for the analysis that no additional errors 
would have been made. 

The pictures in List 1 were randomly 
designated as stimuli and responses and 
randomly paired together with the restriction 
that no obviously highly associated pictures 
were paired together. All lists were presented 
in three varying orders to control for serial 
learning. The orders were randomly deter- 
mined with the exception that no response 
picture was presented twice in succession. 
A 3-sec. anticipation period, a 3-sec. joint 
presentation of stimulus and response, and a 
3-sec. interval between pairs was used through- 
out the experiment. All verbal responses 
occurring in the anticipation interval were 
recorded verbatim throughout the experiment. 


RESULTS 


Table 2 presents the means and 
SDs for the number of correct antic- 
ipations in each condition for each 
grade. An evaluation of the differ- 


DAVID S. PALERMO 


ences among these groups employed 
an analysis of variance in which the 
main effects of Control List Groups 
and of Grade were between- S factors 
while the main effect of Conditions 
was an intra-S factor (Lindquist, 
1953, p. 281, Type III). The means 
for the groups designed to control for 
possible differential difficulty of the 
picture pairs did not differ significantly 
(F=1.68, df=2/45, Error, =14.54), 
nor did the four grades (F = 2.56, 
df=3/45, .05<P<.10). The means 
for the three experimental condi- 
tions differed beyond the .001 level 
(F=14.88, df=2/90, Errorw= 3.32). 
Individual ¢ tests for related measures 
showed, that the differences between 
Cond. I and III and between Cond. 
II and III were significant (P < .01), 
but the difference between Cond. I 
and II was not (.05 < P < .10). 
None of the interactions was sig- 
nificant. 

Although the analysis of variance 
indicated no significant interaction 
between Grade and Conditions, it 
may be seen in Table 2 that there 
appears to be an interaction between 
Cond. I and II and grade level. Per- 
formance of Ss in Grades 3 and 4 
shows clearly the effects of the two 
conditions, but for Grade 5 the effect 
is much less pronounced and for 


TABLE 2 


MEANS AND SDs or TRIALS TO LEARN LIST 1 AND FOR CORRECT RESPONSES ON 
List 2 UNDER EACH CONDITION ror EACH GRADE 


List 1 ; List i paps.: 5 
Grade N Trials to Learn p H ae, ul s FEA 1, 11, 11 
Mean SD Mean SD Mean SD Mean SD 

3 12 | 19.42 | 7.64 10.33 | 3.4 =| RP 32 
33 | 3.40 | 12.50 | 2.81 | 11.47 | 3.24 
Š i ter 17 10.83 | 2.11 | 14.33 | 2.28 | 12.69 | 2.72 
3 13 asta eae 12.27 | 2.52 | 13.27 | 2.77 | 12.71 | 2.89 
: -39 | 3.27 13.44 | 2.09 | 14.67 | 1.20 | 13.74 | 1.95 
Total | 57 | 17.16 | 6.96 11.93 | 2.82 | 13.77 | 2.44 | 12.77 | 2.80 


wyt 


MEDIATED ASSOCIATION 


Grade 6 the means are actually in 
the opposite direction of that pre- 
dicted. Consequently, an analysis 
of variance was conducted on the 
data for Cond. I and II alone to 
permit a more sensitive test for the 
interaction indicated in Table 2. 
The difference between the condi- 
tions was significant at the .05 level 
(F = 4.14, df = 1/45, Errorw = 3.22), 
but the Grade X Conditions inter- 
action was not significant by usual 
standards (F = 2.48, df = 3/45, 
05 < P < .10). 

An analysis of the errors made in 

response to stimulus members of the 
pairs in Cond. II and III was made by 
the procedure used by Norcross and 
Spiker (1958). The mean number of 
reversals per S for Cond. II was 1.33 
and for Cond. NUR POR Fl Sir 10, 
related £ test). For Grades 3 and 4, 
where the mediation effect was more 
apparent, there were 1.58 reversals in 
Cond. II and 1.00 in Cond. 
(P = 05; related ż test). However, 
these suggestions of differences must 
be discounted because the ratio oO 
the number of such intrusion errors to 
total errors, was approximately equal 
in each comparison of Cond. II and 
lil. 


Discussion 


These data provide evidence that the 
conditions designed to produce mediated 
associations result in differential effects 
upon the amount of transfer in an A-B, 
A-C transfer paradigm. Performance on 
the pairs assumed to be facilitated by 
the mediation of correct responses was 
superior to performance on pairs in 
which interference was assumed due 
to the mediation of incorrect responses. 
This result had been found in a prelimi- 
nary study using 22 Grade 6 Ss in which 


signed to produce facilitation through 
mediation (Cond. 
designed to produce interference through 
mediation (Cond. II) but without the 


237 


control pairs (Cond. III). The mean 
number of correct responses for Cond. I 
and Il in that study was 25.00 (SD = 4,00) 
and 23.32 (SD = 3.76), respectively, 
yielding a t which was significant at 
the .025 level. 

While the relationship between the 
conditions assumed to produce facilita- 
tion and interference due to mediation 
seems to be a stable one under these 
conditions, some caution should be made 
in generalizing these results. Attempts 
to replicate the findings of this study 
with college students and Grade 10 
children using the same design failed to 
yield significant differences among the 
three conditions. In the study with 
college students, low-association non- 
sense syllables were used as stimuli and 
responses, and in the Grade 10 study 
high-frequency adjectives were used. 

The suggestion of an interaction be- 
tween grade level and the effects of 
Cond. I and II in this study along with 
the failure to replicate the findings with 
college students and tenth graders might 
indicate that some factor or factors 
related to age may be important. 

In the present study, performance on 
the control pairs was superior to per- 
formance on the pairs assumed to be 
facilitated by mediation as well as on the 
pairs assumed to be impaired by media- 
tion. These results are in contrast with 
those of Norcross and Spiker (1958). 
Performance on their control pairs was 
superior to performance on pairs in 
which interference was expected and 
inferior to performance on pairs in which 
facilitation was expected in a three-list 
paradigm. Apparently, the strong asso- 
ciative interference effects of the A-B, 
A-C paradigm in the present study out- 
weigh any facilitation introduced by the 
mediated associations. 

Ignoring the mediation effects, this 
study replicates the findings of the Spiker 
and Holton (1958) study in which it was 
demonstrated, using @ motor paired- 
associate task, that learning of an A-B, 
A-C series results in associative inter- 
ference relative to an A-B, D-C series. 
In the present experiment the control 
pairs are comparable to the D-C pairs 


238 


of the Spiker and Holton study and were 
learned significantly faster than either 
the Cond. I or II pairs. 


SUMMARY 


An experiment was conducted to study 
facilitation and impairment of performance 
as a function of mediated associations in a 
modified A-B, A-C transfer paradigm. Chil- 
dren in Grades 3, 4, 5, and 6 were required to 
learn two lists of six paired associates com- 
posed of six stimuli and three responses. In 
List 2, the pairs were arranged so that learning 
could be facilitated by mediated associations 
or impaired by mediated associations. In 
addition, control pairs, in which no experi- 
mentally manipulated mediation was present, 
were included. 

The results indicated that the condition 
designed to produce facilitation through 
mediated associations led to superior per- 
formance when compared with the condition 
designed to produce impairment through 


DAVID S. PALERMO 


mediated associations. Performance on the 
control pairs was superior, however, to that 
of both of the mediation conditions. The 
results were discussed in terms of age related 
variables and the associative interference 
effects of the A-B, A-C paradigm. 


REFERENCES 


Linpguist, E. F. Design and analysis of 
experiments in psychology and education. 
New York: Houghton Mifflin, 1953. 

Norcross, K. J., & SPIKER, C. C. The 
effects of mediated associations on transfer 
in paired-associate learning. J. exp. 
Psychol., 1958, 55, 129-134. 

Spiker, C. C., & Horton, R. B. Associative 
transfer in motor paired-associate learning 
as a function of amount of first-task prac- 
tice. J. exp. Psychol., 1958, 56, 123-132. 

Twepr, H. M., & UnpErwoon, B. J. Mixed 
vs. unmixed lists in transfer studies. J. 


exp. Psychol., 1959, 58, 111-116. 
(Received July 17, 1961) 


< 


Journal of Experimental Psychology 
1962, Vol. 64, No. 3, 239-248 


PREDICTION OF PREFERENCE, TRANSPOSITION, AND 
TRANSPOSITION-REVERSAL FROM THE 
GENERALIZATION GRADIENT ' 


WERNER 


Denison 


The present study concerns the 
prediction of differential responding 
in an operant situation from the 
generalization gradient. Gradients 
were obtained on the spectral con- 
tinuum with the technique of Gutt- 
man and Kalish (1956), while prefer- 
ence tests involving the generalization 
stimuli were concurrently adminis- 
tered. In a general sense, this is 
an attempt to predict choice behavior 
when the responsiveness to each single 
stimulus is known. More specifically, 
it involves a test of the power of the 
generalization gradient in the deter- 
mination of behavior in situations 
more complex than the single-stimulus 
condition in which it is 

The desired relationships were in- 
vestigated after simultaneous and 
successive” discrimination training as 
well as simple acquisition, in order 


1 This paper is adapted from a doctoral 
dissertation submitted to the Department 
of Psychology, Graduate School of Arts and 
Sciences, Duke University, in 1958. 
research was supported by Grant MH-1002 
from the National Institute of Mental Health 
to Norman Guttman. 
indebted to Norman 
and guidance throughout the research. The 
preparation of the publication version was 
supported by Grant M-2414 to the author. 

2A successive discrimination refers in this 

per to the case where only the positive or 
the negative stimulus value is presented at 
one time to S, or the “go, no-go” discrimina- 
The term has been used in this sense 


tion. 
by Grice (1949) and Baker and Lawrence 
(1951). Other authors (Bitterman, Spence) 


have more recently used successive discrimina- 
tion to refer to a conditional left-right dis- 
crimination, but this usage js not intende 
here. 


K. HONIG 


University 


to determine whether the peak shift 
obtained by Hanson (1959) for the 
postdiscrimination gradient (PDG) is 
accompanied by transposition and 
transposition-reversal as predicted by 
Spence (1937). In previous transposi- 
tion studies (e-g-, Baker & Lawrence, 
1951; Ehrenfreund, 1952; Kendler, 
1950) the PDGs have not been ob- 
tained; it could not be determined, 
therefore, whether the consistent fail- 
ure to obtain clear-cut transposition- 
reversal in such studies was due to 
the form of the PDGs or to a lack of 
correspondence between them and 
stimulus preferences on the transposi- 
tion tests. In the present study, 
PDGs and transposition tests were 
obtained concurrently. 


EXPERIMENTS 1 AND 2 


Method 


Apparatus. —The automatic key pecking 
apparatus used in the present investigation 
was similar to that employed in other spectral 
generalization studies (e.g, Guttman 
Kalish, 1956) except that it included two 
separately illuminated keys 74 in. from the 
floor with 2 in. between centers. Bausch 
and Lomb monochromatic interference filters, 
with band widths at half height of 7-9 mp, 
provided the different spectral values. In 
the text, stimuli will be referred to by the 
nominal value of the filters, but the spacing 
of points on the abscissae of the figures cor- 
responds to the actual transmission peaks 
which differ slightly in some cases. A K-2 
yellow filter was inserted when necessary 
to prevent the transmission of the visible 
second-order spectrum produced by the 
filters. The various spectral values were 
equated for apparent brightness by a human 
observer with the aid of a Macbeth illuminom- 
eter. This appears justified by the work of 


239 


240 


TABLE 1 


COMPOSITION OF STIMULUS PAIRS ON THE 
GENERALIZATION TESTS IN EXP. 1 AND 2 


Experiment 1 Experiment 2 

Absolute Difference | Absolute | Difference 

Value from CS Value from CS 

(ma) (ma) (mp) (mp) 
490, 510 | —60, —40 | 490, 590 | —60, +40 
510, 520 | —40, —30 | 590, 520 | +40, —30 
520, 530 | —30, —20 | 520, 570 | —30, +20 
530, 540 | —20, —10 | 570, 540 | +20, —10 
$40, 550 | —10, 0 | 540, 550 | —10, 0 
550, 560 0, +10 | 550, 560 0, +10 
560, 570 | +10, +20 | 560, 530 | +10, —20 
570, 580 | +20, +30 | 530, 580 | —20, +30 
580, 590 | +30, +40 | 580, 510 | +30, —40 
590, 610 | +40, +60 | 510, 610 | —40, +60 


Blough (1957), which indicates that the 
spectral luminosity function of the pigeon 
is similar to that of man. No illumination 
was provided in the box other than the light 
falling on the keys, except during the presen- 
tation of grain, when a magazine light went 
on. 
( Subjects—The Ss were 38 white Carneau 
pigeons reduced to 75% of their free-feeding 
weight. Twenty-two experimentally naive 
animals were used in Exp. 1. The 6 Ss used 
in Exp. 2 had served in a previous experiment 
in which 10 sessions of pecking at a 550-my 
stimulus on a variable interval schedule were 
followed by a generalization test in which 
the values used in the present study were 
presented. The previous experiment was 
a in a different apparatus which had one 
ey. 

Procedure.—Over several daily sessions, S 
was magazine trained, conditioned to peck 
at a 550-my key, and given 60 continuous 
reinforcements. Following this, S received 
10 daily sessions of variable interval (VI) 
training with a mean interreinforcement inter- 
val of about 1 min. Each session consisted 
of 30 1-min. periods separated by 10 sec. of 
blackout. During half the periods, one key 
was illuminated by 550 my. During the 
remaining periods, both keys were illuminated 
by 550 mu. The single stimulus appeared 
for half the time on the right and for half the 
time on the left key. The appearance of the 
single stimulus, and the availability of rein- 
forcement in the two-stimulus case, were 
balanced between left and right keys. The 
order of conditions was randomized. Five 
of the 22 Ss in Exp. 1 received training beyond 


WERNER K. HONIG 


10 sessions until they reached a criterion of 
900 responses in a single session. 

This training procedure was abbreviated 
for Exp. 2, as magazine training and condi- 
tioning of key pecking could be omitted. 
After 10 continuous reinforcements, VI train- 
ing began immediately in the manner de- 
scribed above, All Ss received 7 rather than 
10 sessions of VI training, and easily reached 
900 responses per session by the seventh 
session. 

On the 2 days following the last session 
of VI training, S received a generalization test 
under extinction, consisting of 189 30-sec. 
periods of stimulus presentation alternating 
with 10 sec. of blackout. Six blocks of 21 
periods each were given on the first day of 
testing and three on the second. A single 
stimulus value was presented on 11 of the 21 
periods within each block and a pair of stimu- 
lus values on the remaining 10 periods. The 
single stimulus values ranged from 490 to 610 
ma in 10-my steps, with the omission of 
500 and 600 my. The stimulus pairs were 
composed of these values as summarized 
in Table 1. In Exp. 1, the pairs were com- 
posed of adjacent stimuli in the series. In 
Exp. 2, they were so chosen that the differ- 
ences in mp between the test values and the 
CS in each pair correspond to those for Exp. 1 
in size, but have opposite signs. In other 
words, in Exp. 1 the stimuli in each pair lie 
on the “same side” of the CS, while in Exp. 2 
they lie on “opposite sides.” 

The 21 stimulus conditions were random- 
ized within each block of presentations. 
Within a given block, 490, 520, 540, 560, 580, 
and 610 mp were presented on one key 
and 510, 530, 550, 570, and 590 mp were 
presented on the other key. This arrange- 
ment alternated on successive blocks. _ 

By means of this testing procedure, it was 
possible to present a generalization test 
consisting of singly presented stimulus values 
in the same session with a test consisting 
of different pairs of values. These will be 
distinguished as single-stimulus (SS) and 
double-stimulus (DS) generalization tests, 
even though they occurred in the same session. 
The testing procedure provided control for 
differences between animals and for the 
effects of extinction in the course of testing. 


Results 


Experiment 1.—The data obtained 
from the generalization tests are pre- 
sented in Fig. 1. The mean total 
responses to SS values are shown by 


P- 


PREFERENCE, TRANSPOS 


the single-stimulus gradient (SSG); 
the mean total responses to the mem- 
bers of each stimulus pair are shown 
by the adjacent bars. The filled bar 
in each pair represents the stimulus 
nearer the CS. When the responses 
to each stimulus value on the DS test 
are summed across the two pairs in 
which that value appears, the double- 
stimulus gradient (DSG) is obtained. 

The SSG peaks at 550 mp and de- 
creases to both sides of the CS. There 
are inversions between 490 and 510 
mp and between 560 and 570 mu. 
The former inversion is small and may 
be due to random error, as the gra- 
dient is almost flat in that region. 
The latter inversion is sometimes 
found when stimuli in this region 


are equated for brightness (Honig, 
Thomas, & Guttman, 1959). 
200 
180 
SINGLE STIMULUS 
GRADIENT 
160 
o- --OpOUBLE STIMULUS 
GRADIENT 


N 
x PREDICTED 

DSG VALUES I 

f 


80 


RESPONSES 


60 


490 510 530 


Fic, 1, Single-stimu 


ITION, AND TRANSPOSITION -REVERSAL 


RESPONSES TO WAVE $ 
LENGTH PAIR A,, Àz /* 
de 3 


to wave-length pairs, 


241 


The direction of preference within 
each stimulus pair agrees with the 
difference between the number of re- 
sponses given to the members of that 
pair in the SS test. Accordingly, the 
stimulus nearer 550 mz, received more 
responses in all pairs except 490, 510 
my and 560, 570 ma. It also appears 
that the degree of preference is sys- 
tematically related to the difference 
in response level indicated for the 
corresponding stimuli on the SSG. 

The DSG is very similar to the SSG 
except that the central values are 
somewhat higher. The “predicted 
DSG values” are derived from a 
method of predicting response totals 
in the DS situation which will be pre- 
sented below. 

Experiment 2—The SSG and DSG 
are so similar to those obtained in 


¥ 


I 
I 
! 
I 
I 
I 
l 
l 
I 


550 570 590 


WAVE LENGTH IN Mp 


lus and double-stimulus gradients and responses 


Exp. 1. 


242 


200 


180 


160 


140 


120 


100 


80 


RESPONSES 


60 


40 


20 


S 530, 580, 510, 
490, 590, 510, 570, 540, 550, 560, 530, 580, 510, 
90 B20 B70 540 550 560 530 880 S10 610 


590 


WERNER K. HONIG 


VALUES FOR d,AND àz FROM 
SINGLE STIMULUS GRADIENT 


RESPONSES TO STIMULUS 
Ae PAIR àj, Xp 


WAVE LENGTH IN My 


Fic. 2. Responses to wave-length pairs with corresponding single-stimulus 
values, Exp. 2. 


Exp. 1, except for a somewhat lower 
response level, that these are not 
separately presented on a graph. 

A comparison between the responses 
to the members of each pair and the 
responses to the corresponding stimuli 
on the SS test is presented in Fig. 2, 
where the latter values are plotted 
over the appropriate bars. The direc- 
tion of preference within each stimulus 
pair is again in good agreement with 
SS values. In only two pairs is there 


a reversal: 530, 580 mu and 510, 
610 my. 


EXPERIMENTS 3 AND 4 
Method 


Experiments 3 and 4 provide data on the 
effects of successive and simultaneous dis- 
crimination training on the SSG and DSG 
and on the distribution of responses between 


pairs of stimulus values, Aside from the 
discrimination procedure, the method in 
these experiments was similar to that for 
Exp. 1. The same apparatus and 14 of the 
same Ss were used. ? 
Procedure—On the day following the 
completion of the generalization test in Exp. 
1, S received 10 continuous reinforcements 
with one key illuminated in order to reinstate 
the conditioned operant. Discrimination 
training then began immediately. — Each 
session consisted of 30 1-min, periods of 
stimulus presentation alternating with 10 sec. 
of blackout. Responding to 550 my (S+) 
was reinforced on the VI schedule used pre- 
viously. Responding to 560 ma (S—) was 
never reinforced, and the reinforcement 
programmer was interrupted when it was pre- 
sented in Exp. 3. be 
In the successive discrimination training 
of Exp. 3, both keys were illuminated either 
by S+ or S— during cach period, S+ and 
S— were each presented for 15 periods in 4 
session. The order was randomized but 
excluded the presentation of the same condi- 
tion for more than three consecutive periods. 


ee 


PREFERENCE, TRANSPOSITION, AND ‘TRANSPOSITION-REVERSAL 243 


Reinforcement was available at the left key 
for half of the S+ periods and at the right 
key for the other half. 

The criterion for discrimination was 
reached when a block of 10 consecutive 1-min. 
periods was completed under the following 
conditions: (a) The block contained five 
positive and five negative periods. (b) At 
least two of the negative periods were con- 
secutive. (c) S gave no responses to S—- 
(d) S gave 10 or more responses during each 
presentation of S+. All except 1 S reached 
this criterion within 10 sessions. second 
generalization test (see below) was adminis- 
tered to this S under the considerations that 
on Training Sessions 7 through 10, less than 
2% of its total responses were to S—, with 
no responses to S— on as many as four 
successive S— periods. 

In the simultaneous discrimination train- 
ing of Exp. 4, one key was illuminated by S+ 
and the other by S— during each period. 
The side was randomized, with the exclu- 
sion of more than three successive periods 
with either arrangement. The criterion for 
discrimination was reached when a block of 
five consecutive {-min, periods was com- 
pleted under the following conditions: (a) 


360 g 


S— appeared on one key for two periods and 
on the other key for three periods. (b) S 
gave no responses to S—. (c) S gave at least 
10 responses each period to S+. Of the 8 
Ss in Exp. 4, 6 reached criterion within five 
The other 2 Ss stabilized 
at about 20% responding to s+. They 
apparently developed a chained response 
of pecking at S— and then S+ in rapid 
succession. This could be reinforced, since 
no delay of reinforcement was contingent 
on responding to S—- These 2 Ss were there- 
fore discarded. 

Generalization testing—When S reached 
the criterion for discrimination, the training 
session was discontinued. On the 2 following 
days, S was given a generalization test 
identical to the one it received after VI 
training in Exp. 1- 


Results 


A full description of the course of 
discrimination training in these ex- 
periments has been presented else- 
where (Honig, 1958). Suffice it to 
say that it differed for the two studies 


~ SINGLE STIMULUS GRADIENT 
o----o DOUBLE STIMULUS GRADIENT 


AM, responses TO WAVE LENGTH 
1 “PAIR AL, Az 


X PREDICTED DSG VALUES 


510 


530 550 570 
WAVE LENGTH IN My 


Fic, 3. Single-stimulus and double-stimulus gradien 


wave-length pairs, Exp. ay 


590 


ts and responses to 


610 


RESPONSES 


530 


550 


WERNER K. HONIG 


—— SINGLE STIMULUS GRADIENT 
o----oDOUBLE STIMULUS GRADIENT 


AN RESPONSES TO WAVE LENGTH 
PAIR A, , A 

SLOT a te te 

R X PREDICTED DSG VALUES 


570 


WAVE LENGTH IN My 


Fic. 4, Single-stimulus and double-stimulus gradients and responses to 
wave-length pairs, Exp. 4. 


in a number of respects: (a) The rate 
of responding to S+ was much higher 
in the successive discrimination. 
(b) The median number of responses 
to extinction for S— was seven times 
as great in the successive discrimina- 
tion. (c) The proportion of responses 
to S— was also consistently higher in 
that condition. The time to reach 
criterion was twice as great in Exp. 3, 
but since S+ and S— were each avail- 
able only half the time in the succes- 
sive procedure, the training time in 
terms of total minutes of presentation 
of each stimulus was about the same, 

Single-stimulus gradients.—The dif- 
ferences between the SSGs following 
the two training procedures may be 
seen by comparing Fig. 3 and 4. The 
gradient obtained after successive 
discrimination entirely confirms that 
obtained by Hanson (1959) in that 
(a) the response level at wave lengths 


above 550 my is virtually zero; (b) the 
mode of the gradient is not at 550 my, 
but at 540 mu; (c) the slope of the 
gradient is steeper on both sides of the 
mode than after simple acquisition. 
The SSG obtained in Exp. 4 does 
not differ radically in form from the 
postacquisition gradient of Exp. 1. 
The mode remains at 550 my, and 
there is considerable responding to 
values between 560 and 610 my, with 
a small inversion between 560 and 570 
mu. It does appear, however, that 
the level of responding between 560 
and 590 mu is reduced. The number 
of responses obtained on the SS test 
is 32% less than that obtained for the 
same 6 Ss in Exp. 1 for the values 
below 550 mu, and 57% less for the 
values above 550 my. The largest 
reduction (a mean of 75 responses, or 
54%) was at 560 mp, and the reduc- 
tions (in terms of absolute amount) 


> 


PREFERENCE, TRANSPOSITION, AND TRANSPOSITION REVERSAL 245 


decreased with only one inversion be- 
tween 560 and 610 mz. All Ss showed 
more reduction at the values above 
550 mu than below. 

Responding within wave-length pairs: 
Exp. 3.—The direction of preference 
for all the stimulus values conforms 
to the SSG. This is especially signifi- 
cant for the pair 540, 550 my, as the 
direction of preference is toward the 
postdiscrimination peak of 540 ma. 
The preference of 540 my over S+ is 
an instance of transposition of discrim- 
ination. For the pairs comprised of 
the values between 490 and 540 mz, 
the direction of preference is clearly 
toward S+. ‘These are instances of 
transposition-reversal. 

Experiment 4.—The direction of 
preference conforms to the SSG in 
all but two pairs: 510, 520 my and 
590, 610 mu. Both of these pairs are 
comprised of values near the ends of 
the gradient, where the slope is es- 
sentially zero. There was no trans- 
position between 540 and 550 ma, 
quite in accordance with the absence 
of a peak shift. Responding to the 
negative stimulus presented alone on 
the SS test was not extinguished. 
The simultaneous discrimination did 
not transfer completely to the succes- 
sive situation, while the successive 
discrimination transferred almost per- 
fectly to the simultaneous situation, 
as shown by the bars for the 559, 
560 mp pair in Fig. 3. 

Double-stimulus gradients.—The 
DSGs are in both experiments quite 
similar to the SSGs though on the 
average somewhat higher, following 
the pattern set in Exp. 1 and 2. At 
the modes of the gradients the DSG 
points are considerably higher, and 
there are also large differences for 
Exp. 4 at 560 and 570 mp. These 
differences can be understood on the 
basis of the analysis of DS rates to be 
presented in the next section. 


Prediction of Double-Stimulus Values 


Given the total responses R: to Ay 
and R: to Az on a SS test, the total 
responses rı and Te for the same values 
in the pair A, As on a DS test can quite 
adequately be predicted by the formulae: 


= E) 
n (ge R: 


Ee E 
ze Ri, F T) 


Predictions from these functions have 
been carried out for each S for each DS 
value in each experiment to obtain 
predicted values for r, ra and ri + Te 
Mean predicted are plotted against mean 
obtained values for each stimulus pair 
from Exp. 1 and 2 in Fig. 5. The three 
groups of predicted values, r Ts and 
mit ra are separated along the vertical 
axis for clarity. The diagonal lines 
represent perfect prediction (x = y); 
deviations from this are indicated by 
the vertical distance between each point 
and the line. Identical analyses carried 
out for Exp. 3 and 4 are not presented 


and 


o -orau VALUES FOR PAIR Ay, Ag 

0- VALUES FOR STRONGER 
STIMULUS >) 

A- VALUES FOR WEAKER 
STIMULUS >g 


OPEN SYMBOLS- EXPERIMENT I 
CLOSED SYMBOLS- EXPERIMENT r 


o 20 40 60 
RESPONSES PREDICTED 


ao 100 120 140 160 


Fic. 5. Predicted and obtained double- 
stimulus values, Exp. 1 and 2. 


246 WERNER 
here as the general outcome was very 
similar to that presented for Exp. 1 and 2. 

From Fig. 5 it is evident that this 
method leads to satisfactory results. 
A goodness of fit test suggested by 
Lindquist (1953, pp. 344 ff.) was used 
to test the hypothesis that the obtained 
do not differ significantly from the pre- 
dicted values. The error was divided 
into two components, “vertical place- 
ment,” or differences between predicted 
and obtained means, and “departure 
from pattern,” or the error remaining 
after these means have been equated. 
Each component provides a mean square 
which can be divided by the error vari- 
ance of the obtained data to obtain an 
F ratio when the data have been cor- 
rected for systematic effects, in this 
case wave length and Ss. This analysis 
was carried out for the sets of rı and rz 
values for the four experiments. The 
only errors reaching significance are 
for rz in Exp. 1, where both vertical 
placement and departure from pattern 
are significant at the .001 level. This 
one obtained significant departure from 
prediction indicates that by the present 
method, rz is systematically underesti- 
mated following simple acquisition. 

Prediction of DSG values——Consider 
the three stimulus values A, Na, and As, 
with corresponding SS response totals 
Ri, Re, and Ry A predicted DSG 
value for A: (rapsa) is obtained when the 
predicted values to \2 are summed across 
the pairs Mı, s, and Az, As. The two DS 
response values to d» (ra, and ræ) are 
predicted as follows: 


Re ) R.: 
Rd ye fa 2 
5 (ng R,/ ™® rfa = 


Adding these and collecting terms: 


R: R 
ET od ND 2 
PR (r ER Rep z) 


Each of the fractions can run from 0 to 1. 
Therefore, the coefficient for R, can run 
from 0 to 2, and thus 0 < rapsa 2R; 
The relative size of ræsa and R, (the 
corresponding obtained SSG value) de- 
pends on the sum of the two fractions, 
and this in turn depends on the shape 


K. HONIG 


of the gradient at Aj, Az, and àa. In the 
case, for example, where À: is a peak, 
each of the fractions must be greater than 
.5 since Re is greater than R, and R3; 
the predicted DSG value must therefore 
be higher than the SSG value. In 
general, the two fractions will add to 1 
(thus providing equal SSG and predicted 
DSG values) only if the points on the 
gradient form a geometric series.’ If 
the gradient is rising in linear fashion 
(arithmetic series) the predicted DSG 
points will lie above the SSG values. 
The predicted DSG values are indicated 
as crosses in Fig. 1, 3, and 4 for Exp. 1, 
3, and 4. A comparison with the 
obtained values supports the present 
analysis; particularly where differences 
between the DSGs and SSGs are pre- 
dicted, they are also obtained. The 
DSG peaks, both predicted and ob- 
tained, are considerably higher than the 
SSG peaks, and for the central values, 
where the gradients tend to be linear 
rather than geometric, the DSG lies 
above the SSG. 


DISCUSSION 


The foregoing analysis indicates that 
the direction and degree of preference 
between stimuli lying on the generaliza- 
tion gradient can be predicted from the 
independent response strengths of the 
stimuli. The occurrence of transposition 
in conjunction with a shift in the mode 
of the gradient follows from the general 
predictive principles outlined above. 
In this respect, Spence’s (1937) analysis 
of transposition and his prediction of 
transposition-reversal have been sup- 
ported. On the other hand, the differ- 
ence in results between the successive 
discrimination, which produced a peak 
shift and transposition, and the simul- 
taneous, which did not, are not anti- 
cipated by Spence. While equivalent 
criteria were demanded at the end of 
discrimination training under both pro- 
cedures, in that S did not respond to S— 
for five successive periods, it appears 
from Fig. 4 that simultaneous discrimi- 


* This can be demonstrated algebraically 
by a proof not given here. 


PREFERENCE, TRANSPOSITION, 


nation training did not result in extine- 
tion to S— presented alone. In this 
discrimination, it was only necessary 
for S— to become a cue for switching to 
S+ for perfect discrimination to be 
manifested. In the successive dis- 
crimination, of course, S had no alter- 
native response to S— available during 
the negative periods to obtain reinforce- 
ments, which resulted in extinction under 
that stimulus condition. 

It appears that a shift in the peak of 
the PDG and the concomitant trans- 
position depended in the present study 
on the development of genuine extinction 
to S— alone, rather than a preference 
for S+. But the critical factor respon- 
sible for this extinction need not have 
been in the manner of presentation of 
the discriminanda; there is nothing 
about the simultaneous presentation 
se that necessarily 
prevented transposition. To evaluate 
the factor of the manner of stimulus 


as 

alternatives available for responding to 
S—, and so forth. 
and Riley, Ring, and Thomas ( 
have in effect done this by making both 
stimuli available on each learning trial, 
but preventing the animal from com- 
paring the appropriate discriminanda 
in the “successive” case. They obtained 
more transposition when a comparison 
was possible. Whether the same would 
hold true for the pigeon working with 
the spectral dimension is an open 
question. 

While the arithmetic model presented 
above for the prediction of DS from SS 
values is satisfactory for that purpose, 
it is not derived from assumptions about, 
or observations on, specific behavior 
patterns in the choice situations. It is 
possible, however, to identify the terms 
predictive formulae with 
dimensions in the choice 
situation, and thus to suggest a pattern 
of behavior that can be verified. Assume 
that in the choice situation the pigeon 


entering the 
basic response 


AND TRANSPOSITION-REVERSAL 247 


responds to each stimulus value at the 
same rate as when that stimulus is pre- 
sented alone. This would result in a DS 
response total of R; to à if there were 
no interference from the presence of 
the other choice stimulus. Assume 
further that the other stimulus does 
interfere in that its presence reduces 
the total duration of responding to Ai, 
and that the total duration of responding 
to both values is divided in a proportion 
reflecting the rates to each value. The 


proportion for ; would be Rar, 


which, when multiplied by Rı, provides 
the formula used above for the predic- 
tion of response totals to Mı The cor- 
responding expression for à can be 
similarly derived. This analysis rests, 
of course, on a clear distinction between 
operant rate and the duration of the 
application of a given rate; while this 
distinction has been supported empir- 
ically by Gilbert (1958), it awaits direct 
confirmation in the present circum- 
stances. 


SUMMARY 


The relationship between stimulus prefer- 
ence and the response strength of singly 
nted stimuli was investigated with the 
use of the generalization gradient to provide 
stimuli of different strengths. After being 
trained to peck at a 550-mu stimulus on a 
VI schedule, pigeons were given two con- 
current generalization tests: one consisting 
of single stimulus values ranging from 490 
to 610 mu, and one consisting of pairs of such 
values. The direction of preference within 
each pair was found to be in direct accordance 
with the number of responses obtained on the 
single values during the generalization test. 
Two groups of birds then received discrimina- 
tion training between 550 mu as S+ and 560 
mp as S—, one with successive stimulus 
presentations and one with simultaneous. 
After this, the generalization and preference 
tests were administered a second time. The 
successive discrimination produced a shift 
in the mode of the gradient away from S— 
and a concurrent transposition of the dis- 
crimination between S+ and the new mode, 
with transposition-reversal beyond that mode. 
The simultaneous discrimination produced 
little change in the gradient and no concurrent 
transposition. 


248 


An arithmetic model is proposed for the 
ft; soy of responses to members of stimu- 
pairs from single-stimulus values. The 
determination of stimulus preference by the 
generalization gradient follows directly, and 
transposition is seen to be no more than a 
special case. The differential results obtained 
from successive and simultaneous discrimina- 
tions are discussed with reference to the 
actual extinction to S— obtained from each 
procedure. The operant situation is analyzed 
in terms of some simple response dimensions 
to provide a basis for the arithmetic model 
from which double-stimulus values are pre- 
dicted. 


REFERENCES 


Baker, R. A., & Lawrence, D. H. The 
differential effects of simultaneous and 
successive stimulus presentation on trans- 
position. J. comp. physiol. Psychol., 1951, 
44, 378-382. 

BLovca, D. S. Spectral sensitivity in the 
pemn. J. Opt. Soc. Amer., 1957, 47, 827- 


EHRENFREUND, D. A study of the trans- 
position gradient. J. exp. Psychol., 1952, 
43, 81-87. 

GiLserT, T. F. Fundamental dimensional 
properties of the operant. Psychol. Rev., 
1958, 65, 272-282. 

Grice, G. R. Visual discrimination learning 
with simultaneous and successive presenta- 
tion of stimuli. J. comp. physiol. Psychol., 
1949, 42, 365-375. 


WERNER K. HONIG 


Gutman, N., & Katisn, H. I. crimi 
ability and stimulus generalization. 
exp. Psychol., 1956, 51, 79-88. ‘ 

Hanson, H. M. Effects of discriminatio 
training on stimulus generalization. . 
exp. Psychol., 1959, 58, 321-334. 

Hontc, W. K. Prediction of prefe 
transposition, and transposition- 
from the generalization gradient. 
published doctoral dissertation, Duke Uni 
versity, 1958. 

Howie, W. K., Tomas, D. R., & 
N. Differential effects of coni 
the generalization gradient. J. exp. PS 
chol., 1959, 58, 145-152. 

KENDLER, T. S. An experimental inve: 
tion of transposition as a function of 
difference between training and test 
uli. J. exp. Psychol., 1950, 40, 552-562. | 

Lıxpouisr, M. F. Design and analysis of 
experiments in psychology and 
Boston: Houghton Mifin, 1953. 

Rutey, D. A., Rune, K., & Tuomas, J. 
effect of stimulus comparison on discrimi 
tion learning and transposition. J. € 
physiol. Psychol., 1960, 54, 415-421. 

Spence, K. W. The differential resp 
in animals to stimuli varying within 
single dimension. Psychol. Rev., 1937, 44, 
430-444. Ww 

Tuomrson, R. Transposition in the white — 
rat as a function of stimulus comparison. — 
J. exp. Psychol., 1955, 50, 185-190. 


(Received July 23, 1961) 


educa 


Jeareal of Esperimental Paycheleey 
196, Vel. 64, No. 5, 2491 st 


RESISTANCE TO EXTINCTION AS A FUNCTION OF AGE 
AND SCHEDULES OF REINFORCEMENT ' 


NORMAN KASS' 


U niversity 


The purpose of this study is to 
investigate developmental changes in 
resistance to extinction as @ function 
of the percentage of reinforcement 
provided during acquisition. Rela- 
tively little information is available 
concerning the effects of partial rein- 
forcement on extinction with children, 
and no studies investigating age 
changes have been reported. 

There is reason to assume that the 
development of cognitive processes 
and the growth of experience with 
increasing age would result in in- 
creasingly rapid recognition of the 
change in reinforcement schedule be- 
tween an acquisition and extinction 
period in a simple operant task, with 
a resulting decrease in the resistance 
to extinction following partial rein- 
forcement. Subjects at four age 
levels were tested, and in order to 
increase the generality of the findings, 
four partial reinforcement schedules 
were employed. Additional groups 
received 0% and 100% reinforcement. 


METHOD 


Subjects —The Ss were 216 preschool and 
elementary school boys and girls, approxi- 
mately evenly divided by sex- The Ss were 
obtained from the Institute of Child Develop- 
ment Nursery School, the Village Nursery 
——— 


author was a National Institute of Mental 

Health Postdoctoral Fellow at the Institute 

of Child Development, University of Min- 
wr! i 


and at the ‘University of Minnesota labora- 


tory schools. 
2 Now at San Diego State College. 


of Minnesota 


elementary school in Minneapolis All 
schools enrolled Ss of approximately the same 
intellectual and socioeconomic levels. Au 


A —The apparatus was a simu 
lated slot machine, consisting a blue wooden 
cabinet with i 12 X 14 X 12 in. 


Centered on the front of the cabinet and 3} in. 
from the top was a circular opening 1 in. in 


the The box containing the pennies 
was illuminated with a 7-w. lamp. Pennies 
were di through a rectangular opening 


}-in. wooden shaft, to which a 3 xX} in. 
aluminum handle was attached, projected 
from the front of the cabinet 3 in. from the 


E to operate the apparatus. A 12 X 14 in. 
board was attached to each side of the ma- 
chine to prohibit S from observing E load the 
machine. 

‘Attached to the inside base of the cabinet 
was a dispensing device, which consisted 
a wheel 11 in. in diameter, with 24 slots 
circling the outside edge. The wheel was 
attached to the base by a pivot with a 
spring-loaded brake which allowed the wheel 
to turn without spinning freely. This wheel 
was rotated one slot at a time by means 
of a pawl. The pawl rode in a track and was 
driven by a lever attached to the wooden 
shaft described above. When S pulled the 
handle, it rotated the wheel one slot forward 
and also activated a counter to record the 
response. On the base of the cabinet was an 
opening over which the wheel rotated with 
dimensions corresponding to those of the 
bottom of each slot. This opening was con- 
nected by means of a chute to the opening in 
the front of the machine. The pennies placed 
in the various slots could be dispensed when 
a particular slot was in line with the opening 
in the base. With this arrangement any 
variable-ratio schedule could be presented. 
Each machine had to be reloaded after it was 
played 24 times. 


249 


250 


The apparatus was extremely simple to 
operate, and involved pulling the handle 
down and releasing it. When the handle 
was pulled down, the light which illuminated 
the pennies went out and relit when the 
handle was returned to its original position. 

At the completion of the experiment Ss 
traded their pennies for prizes, such as toy 
wrist watches, badges, umbrellas, flutes, and 
kaleidoscopes. 

Design—The variables employed were 
percentage of reinforcement and chronological 
age of S. The six reinforcement schedules 
provided reinforcement on 0%, 163%, 334%, 
60%, 80%, and 100% of the acquisition 
trials. The Ss were selected from four age 
levels: 4, 6, 8, and 11 yr. 

Each S was given an acquisition and an 
extinction series. The acquisition series con- 
sisted of 30 trials. During this period S 
received, according to his reinforcement 
schedule, either 0, 5, 10, 18, 24, or 30 rein- 
forcements. The reinforcements were pre- 
sented randomly with the restriction that all 
Ss except those in the 0% condition were 
reinforced on the last trial of the acquisition 
series. 

_ The study thus employed a 4 X 6 design 
with 9 Ss ina cell. The Ss at each age level 
were assigned to the reinforcement conditions 
at random. 

Procedure—The Ss were obtained by E 
from the classroom and taken to an experi- 
mental room located in a quiet section of the 
building. The S was told that he was going 
to play the “penny machine game” and was 
seated before the apparatus. The pennies 


in the illuminated window were pointed out 
by E. 


2.0 


@ 


o 


Meon Log Response to Extinction 
E 


1.2 
0 
4 6 8 1i 
Chronological Aga 
Fic. 1. Mean log responses to extinction as 


a function of CA of Ss. 


NORMAN KASS 


The Æ said: 

You see this machine is full of pennies, 
Do you see all these pennies? Now in this 
game you try to get as many pennies out 
of the machine as you can, because the 
more pennies you get out of the machine, 
the better the prize you can buy with the 
pennies. 


The E showed S the prizes he could buy. 
The E continued: 


See all the prizes we have that you can 
buy with the pennies you get out of the ma- 
chine. Now I'll show you how to play the 
game. As long as the light is on, you can 
pull the handle and the pennies come out of 
here. You can play the game as long as you 
want. You don't have to hurry. Just 
tell me when you want to stop. Remember, 
the more pennies you get, the better the 
prize you can buy with your pennies. 


The last statement was repeated on Trials 10 
and 25 of the acquisition series. There was 
no indication from E when the acquisition 
series ended and the extinction series began. 

Comments and questions from S were 
ignored by E, and if S persisted, E replied, 
“You can play the game as long as you like. 
Tell me when you want to stop playing. 

Each 5S played until he indicated he wanted 
to stop or after 370 extinction trials, when S 
was stopped by Æ. Only 8 Ss failed to com- 
plete the 30 acquisition trials and were re- 
placed. All of these Ss were at the 4-yr. age 
level and were in either the 0% or 100% 
group. 

At the end of the session S was allowed 
to choose a prize to buy with his pennies. 
The Ss in the 0% group were told that even 
though they had not won any pennies, they 
could have a prize for playing so well. The 
Ss were asked not to tell other children about 
the game. 


RESULTS AND DISCUSSION 


The primary score used in the 
analysis of the results was the number 
of responses made during the extinc- 
tion period. Because of the hetero- 
geneity of variance among the cells, 
a log transformation of the scores 
was performed. 

The results related to age differ- 
ences are shown in Fig. 1, which 
presents the mean log of the number 
of responses made during the extinc- 


= 


RESISTANCE TO EXTINCTION 


tion period at each CA level for all 
percentages of reinforcement com- 
bined. 

The results related to percentage 
of reinforcement are shown in Fig. 2, 
which presents the mean log responses 
to extinction for each percentage of 
reinforcement for all age levels com- 
bined. The results are in line with 
those obtained with college Ss by 
Lewis and Duncan (1956, 1957). 
There was a consistent decrease in 
number of responses to extinction 
with increasing percentage of rein- 
forcement from 164% to 100%. The 
Ss in the 0% group made somewhat 
fewer responses during extinction than 
Ss in the 163% group, but made a 
greater number of responses than Ss 
in any other group. 

In analysis of variance of the trans- 
formed scores, differences associated 
with CA were highly significant 
(F = 9.54, df = 3/192, P < 001), as 
were the differences associated with 
percentage of reinforcement (F=15.65, 
df = 5/192, P < 001). The interac- 
tion term was not significant (F= 45). 

The Duncan multiple range test 
(Duncan, 1955) was used to deter- 
mine which of the age levels and per- 
centages of reinforcement differed 
significantly. The scores of Ss at CA 
4 differ significantly (P < .01) from 
those of Ss at the other CA levels; the 
scores of the remaining groups do not 
differ significantly from each other. 
The following differences according to 
percentage of reinforcement are signifi- 
cant at the .01 level: 162% vs. each 
of the larger percentages; 0% vs. 
60%, 80%, 100%; 33.3% Vs- 80% 
and 100%. 


The results may be interpreted by the 
hypothesis that the rate of extinction 
is decreased as the discrimination be- 
tween the acquisition and extinction 
series becomes more difficult. The maxi- 
mum similarity between acquisition and 


251 


22 _—_—_—— 


o 
n WI O tne 


Percent Reinforcement during Acquisition 
Fic. 2. Mean log responses to extinction 
as a function of percentage of reinforcement 


during the acquisition series. 


extinction trials occurs in the 0% group, 
where there is no way for Ss to discrim- 
inate the end of the first series and the 
beginning of the second. The maximum 
dissimilarity occurs in the 100% group. 
The slight but nonsignificant increase 
from the 0% to the 163% group provides 
the only deviation from a consecutively 
decreasing number of trials to extinction 
with increasing percentage of reinforce- 
ment. The lack of a significant inter- 
action between percentage of reinforce- 
ment and CA indicates that, in general, 
the trend of the results is similar at each 
age level. There was a curvilinear rela- 
tionship found between CA and number 
of responses to extinction. These dif- 
ferences do not agree with those which 
would be expected from the hypothesis 
that increasing age results in an increas- 
ing ability to discriminate between 
changes in patterns of reinforcement. 
According to such an hypothesis the 
older Ss would be expected to show the 
least resistance to extinction. Since the 
only significant differences were those 
found between Ss at CA 4 and Ss at 
higher CA levels, the main problem is to 
determine why the youngest Ss should 
extinguish more readily. This finding 
might be explained in terms of the 
developmental changes in length of 
attention span. Preschool children may 


252 


not persist in a task as long as older 
children. Further evidence for this 
effect is offered by the fact that all Ss 
who failed to complete the 30 acquisition 
trials were at CA 4. Thus even during 
acquisition the younger Ss stopped 
responding sooner. This indicates that 
the extinction differences found may only 
reflect some other more basic difference 
which would be manifest over a wider 
range of conditions than is represented 
in the present experiment. 


SUMMARY 


The purpose of this study was to deter- 
mine the effects of six different percentages of 
reinforcement upon extinction of a lever- 
pulling response. Four age levels were em- 
ployed to determine developmental changes 
in resistance to extinction. The apparatus 
employed was a simulated slot machine 
designed for use with young children. 

A total of 216 children at CA 4, 6, 8, and 
11 received reinforcement on either 0, %, Ah 
4, or all of the trials of a 30-trial acquisition 
period. The extinction trials were continued 


NORMAN KASS 


until S wished to stop or until 370 extinction 
responses had been made, The response 
measure was the total number of responses 
to extinction. 

A decrease in number of responses during 
extinction was found with increasing per- 
centages of reinforcement. The least number 
of trials to extinction was shown by Ss at CA 
4and the greatest by Ss at CA 8. There was 
no significant interaction between CA and per- 
centage of reinforcement. 


REFERENCES 


Duncan, D. B. Multiple range test and 
multiple F tests. Biometrics, 1955, 11, 
1-42. 

Lewis, D. J., & Duncan, C. P. Effect of 
different percentages of money reward 
on extinction of a lever-pulling response. 
J. exp. Psychol., 1956, 52, 23-27. 

Lewis, D. J., & Duncan, C. P. Expectation 
and resistance to extinction of a lever- 
pulling response as functions of percentage 
of reinforcement and amount of reward, 
J. exp. Psychol., 1957, 54, 115-120. 


(Received July 24, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 3, 253-257 


SUBJECTIVE SCALE OF FORCE FOR A LARGE 
MUSCLE GROUP ' 


HANNES EISLER? 


Harvard University 


Although subjective force is one of 
the classical continua studied in psy- 
chophysics, most studies of force have 
been limited to lifted weights. Re- 
cently, however, Stevens and Mack 
(1959) scaled subjective force, as 
exerted in the squeezing of a handle. 
Their experiments confirmed the psy- 
chophysical power law; the exponent 
obtained was. 1.7. Borg and Dahl- 
ström (1960) investigated muscular 
work carried out on aà bicycle er- 
gometer. Though the variable they 
studied was power rather than force, 
their experiments can also be included 
in investigations of muscular, effort. 
They found an exponent of 1.6. 

The investigation reported here 
consists of a series of experiments 
carried out to scale the subjective 
force exerted by a comparatively large 
muscle group. The muscle group 
chosen contains those muscles that 
are used in pushing or pressing a pedal 
with the foot in a horizontal, forward 
direction. In addition to being in- 
trinsically interesting, these experi- 
ments make it possible to compare the 
subjective scale of force exerted by the 
large leg muscles with the scale ob- 
tained for the smaller muscles of the 
forearm used in squeezing a hand 
dynamometer. To summarize the re- 
sults, a power function was obtained 
for foot pressure also, and its exponent 
did not differ appreciably from the 


1 This research was supported by a grant 
from the National Science Foundation (Psy- 
cho-Acoustic Laboratory Report No. 
PNR-260). 

2 Now at Psychology Department, Uni- 
versity of Stockholm, Sweden, 


GENERAL METHOD 


Scaling.—Five methods were used: mag- 
nitude estimation, magnitude production, 
matching of force of handgrip to force of foot 
pressure and the reverse, and cross-modality 
matching of both handgrip and foot pressure 
to white noise. The magnitude estimation 
was repeated once, and the matching of foot 
pressure to handgrip twice with slight vari- 
ations, so that, all in all, eight experiments 
enter into the present study. Except in the 
first magnitude estimation experiment in 
which O gave four judgments for every 
stimulus, two values for every stimulus were 
obtained from each O in all experiments. 

Subjects. Twelve Os were used in each 
experiment. The groups largely overlapped 
and were identical for the first magnitude 
estimations and the magnitude productions, 
as well as for all the experiments in which foot 
pressure and handgrip were matched. Alto- 
gether, 16 men and 5 women took part in the 
investigation, mostly graduate students of 
psychology and staff members. 

Apparatus.—In all the experiments, O sat 
in a rigid chair and with his right foot pressed 
a pedal in a forward, horizontal direction. 
The distance through which the pedal moved 
was very small. 

In the experiments with magnitude estima- 
tion and the matching of handgrip to foot 
pressure, the pedal was connected through a 
lever system to the platform of a large beam 
scale. The measuring beam of the scale was 
loaded with a given stimulus weight, and 0, 
by pressing the pedal, brought the scale into 
equilibrium. Equilibrium was indicated to 
him by a pointer on a display. 

In the experiment in which force of foot 
pressure was matched to force of handgrip, a 
handle was connected with the platform of 
the scale. Forces exerted as responses (foot 
pressure in magnitude production, foot pres- 
sure when matched to handgrip, etc.) were 
measured with tensile gauges connected with 
the pedal, the handle, or both, depending on 
the particular experiment. 

When force of foot pressure and force of 
handgrip were matched to white noise, Os 
listened to the band of noise (75 to 2400 cps) 
through a pair of earphones, 


253 


254 


Sources of error—Three sources of error 
were evident in the series of experiments re- 
ported, though not every error was to be 
found in every experiment: 

1. The pointer in the display could not be 
kept completely stationary. There seemed 
to be some correlation between the speed of 
its movement and the force exerted. This 
movement may or may not have influenced 
O's judgment. Since this source of error did 
not exist in all the experiments and since the 
agreement among all the experiments is quite 
good, it is probably of only minor importance. 

2. People differ in strength. The problem 
caused by this state of affairs could be at- 
tacked by (a) limiting the range investigated 
to the maximum force of the weakest O, but 
restriction of range is not an appealing solu- 
tion; (b) picking a particular sample of strong 
Os, but restriction of sample is not an attrac- 
tive solution either; (c) or dropping Os as the 
forces required were increased. This solution 
was chosen. It is a legitimate solution if the 
exponent of the power function is independent 
of the strength of O. In that case the only 
effect would be a decrease in the reliability for 
strong stimuli, since the points obtained are 
based on fewer Os. 

3. While he pushed the pedal, O had to 
hold his leg up by his own force. For small 
stimuli the force required was apparently 
greater than the force of pushing. For large 
stimuli the friction between the sole of the 
shoe and the pedal appeared to be sufficient to 
eliminate this extra force. The Os seemed to 
be trying to neglect the holding force and only 
to take the pushing force into account. 


25 


=e 
a 8 


LOG SUBJECTIVE FORCE 
5 


=: 
ês 19 15 20 25 3o 


FORCE OF FOOT PRESSURE 
(L0G POUNDS) 


Fic. 1. Subjective force of foot pressure 
in logarithmic units as a function of physical 
force in log pounds. (The upper scale on the 
abscissa refers to the magnitude estimation 
experiment in which O's leg was supported.) 


HANNES EISLER 


MAGNITUDE ESTIMATION AND 
MAGNITUDE PRODUCTION 


Method 


Whether the function obtained by magni- 
tude estimation was independent of the 
strength of Os was tested crudely in two ways. 
(a) The Os were divided into a group of 7 
who were able to exert the greatest force used, 
400 Ib., and a second group of 5 who were not. 
Medians of the magnitude estimates of the 
six lowest stimuli common to the two groups 
were computed for each group separately. 
A plot of the medians of the two groups 
against each other was essentially linear. 
(b) Medians were computed, for each group 
separately, of the magnitude estimates for the 
highest stimulus force each O could exert, the 
second highest stimulus presented, and so on. 
This procedure is meaningful when the stimuli 
are spaced logarithmically, on the assumption 
that the power law holds for this case. Again, 
when the values for one group are plotted 
against those for the other, the result is es- 
sentially linear. Thus the function relating 
subjective force to physical force appears to be 
largely independent of the strength of the 
particular O. 

The fact that a plot of the logarithms of 
subjective force (geometric means) against 
the logarithms of force in pounds was some- 
what curved indicates that the “threshold” 
cannot be neglected. Consequently, the 
values were corrected for “effective threshold” 
or “subjective zero,” by a method developed 
by Ekman (1961). 


Results 


The data of the magnitude estima- 
tion experiment indicated that sub- 
jective force is a power function by 
Ekman’s criterion. In contrast to 
most other scaling experiments, how- 
ever, this one gave a negative 
threshold. The threshold correction 
amounted to +8.9 Ib. The loga- 
rithms of the magnitude estimates are 
plotted in Fig. 1 against the loga- 
rithms of the corrected (circles) and 
uncorrected (crosses) physical values. 
A least squares fit yielded a slope, 
that is to say, an exponent of 1.51. 

The magnitude production data 
were treated in the same way as the 
magnitude estimation data, For mag- 


OO ee ee 


ee 


a 
- 


SUBJECTIVE SCALE OF FORCE 


nitude production, the threshold was 
positive and the correction small, 
—1.8 1b. The data are also shown in 
Fig. 1. The exponent was 1.70. 

It is common to find this discrep- 
ancy between the exponents for mag- 
nitude estimation and magnitude pro- 
duction; S. S. Stevens has called it the 
“regression effect” (see e.g., Reynolds 
& Stevens, 1960). The O decreases 
the range of the variable over which 
he has control—numbers in magni- 
tude estimation and forces in magni- 
tude production. 

The consensus of these two experi- 
ments is that, after application of 
threshold corrections, subjective force 
of foot pressure grows as a power 
function of physical force with an 
exponent of approximately 1.6. The 
result agrees with the findings of Borg 
and Dahlström (1960), derived from 
fractionation experiments. They too 
obtained the exponent 1.6 (and a 
negative threshold). 


In order to find out whether the negative 
threshold found in the magnitude estimation 
experiment was related to the third source 
of error mentioned above—the force required 
to hold up the leg—the magnitude estimation 
experiment was repea' with the variation 
that O's heel was supported by a leather belt. 
The results are given in Fig. 1. The negative 
threshold was confirmed, although the thresh- 
old correction was smaller in this experiment, 
43.8 Ib. The exponent, however, decreased 
to 1.31. Since Os were divided about whether 
the supporting belt was an improvement, and 
some of them found it more uncomfortable 
than the first experiment, the results of this 
experiment will not be taken into account in 
the conclusion of the whole series of ex- 
periments. 


MATCHING EXPERIMENTS 


Method 


The procedure used in the experiment in 
which handgrip was matched to foot pressure 
was similar to that of the experiment with 
magnitude estimation, except that, instead of 
responding with numbers, O now equated 


255 


squeezes on a handle with the force exerted by 
pressing the pedal. 

Matching foot pressure to handgrip is the 
inverse of the procedure described above. 
Three variations were carried out. Two of 
them differed from each other only in the 
stimuli chosen and the leverage applied be- 
tween handle and scale. The leverage was 
changed in such a way as to increase the pre- 
cision of the forces presented by E. In the 
third variation the Os were asked to press the 
pedal, not simultaneously with the hand 
squeezes, but after them, 

In order that the results for simultaneous 
and successive matching could be compared, 
the medians of the two experiments were 
plotted against each other. Since a straight 
line was obtained, indicating that successive 
matching made no difference, the results from 
the experiment with successive matching were 
not treated further. 

In the final experiment, Os were asked to 
match in random succession both force of foot 
pressure and force of handgrip to a band of 
white noise, whose intensity varied between 
30 and 100 db. re 0.0002 dyne/cm*. The 
point of this experiment was to match force of 
foot pressure with force of handgrip indirectly. 
A noise of a certain intensity was presented 
and O was informed whether he was to press 
the pedal or to squeeze the handle. This was 
the only experiment in which the weakest Os 
did not drop out as the forces were increased. 


Results 


All the matching experiments gave 
curvilinear functions when force of 
handgrip was plotted against force of 
foot pressure in log-log coordinates 
(geometric means). (For the experi- 
ment with white noise, force of hand- 
grip matched to a certain noise in- 
tensity was plotted against the force 
of foot pressure matched to the same 
noise intensity.) Plotted linearly, 
however, straight lines (that did not 
pass through the origin) were ob- 
tained. This outcome indicated (a) 
a nonzero threshold for at least one of 
the continua (with no way to tease out 
the separate threshold corrections), 
and (b) a power relation between 
subjective force of foot pressure and 
subjective force of handgrip, with an 
exponent of 1, Thus, if either of the 


256 


two continua grows as a power func- 
tion of physical magnitude, the other 
grows with the same exponent. 

The straight lines were computed 
according to a variation of the method 
of least squares. The customary 
method weights the high values more 
than the low values. Because of the 
comparatively great ranges covered, 
the application of that method would 
have introduced a heavy bias. There- 
fore the sum of squares of the relative 
deviations, 


ZE- yy E 


where y equals experimental values 
and yı computed ones, has been 
minimized rather than the sum of 
squares of the absolute deviations, 


2(y—y1)?. 
The lines obtained were normalized so 


that they passed through the origin 
with a slope of 1, and they are sum- 


O MANOGRIP MATCHED TO FOOT PRESSURE 
OU FOOT PRESSURE MATCHED TO HANOGRIP 
O FOOT PRESSURE AND HANOGRIP 

o © MOISE 


mare TO whiti 


OPen svunoLs: r 
piessunE eao % FOOT 


FILLED SYMBOLS : pi root 
PRESSURE eee a 


6 5 


(NORMALIZED) 
o 


FORCE OF HANDGRIP 


0 2 
FORCE OF FOOT PRESSURE 
(NORMALIZED) 


Fic. 2. Force of handgrip as a function of 
force of foot pressure from four matching ex- 
periments. (The values are normalized such 
that the four different straight lines obtained 
by a variation of the method of least squares 
are made to coincide as one line, which passes 
through the origin and has a slope of 1. Be- 
cause the range covered is great, it is divided 
into two sections which are superimposed.) 


HANNES EISLER 


O HANOGRIP MATCHED TO FOOT PRESSURE 
AY FOOT PRESSURE MATCHED TO HANDGRIP 
FOOT PRESSURE AND HANDGRIP 


MATCHED TO WHITE NOISE 


v 
© 


o 


FORCE OF HANDGRIP (LOG POUNDS) 
& a 


o 
o 0.5 10 5 20 25 30 


FORCE OF FOOT PRESSURE (LOG POUNDS) 
(CORRECTED FOR “THRESHOLO") 
s 


Fic. 3. Force of handgrip (in log pounds) 
as a function of force of foot pressure (in log 
pounds) from four matching experiments. 
(The values of foot pressure are corrected for 
“threshold.”) 


marized in Fig, 2, Figure 3 shows 
the outcome of the individual match- 
ing experiments in log-log plots after 
the threshold correction has been 
arbitrarily carried out on the foot 
pressure continuum. The lines 
drawn in Fig. 3 have a slope of 1. 


It is of interest to notice that the 
parameter k in Stevens’ power law 


v= hen 


where y refers to a subjective magnitude 
and ¢ to a physical magnitude, seems to 
depend on the particular experimental 
procedure. In matching, two sets of 
subjective magnitudes are equated (Ste- 
vens, 1959): 


Yi = kid 
Yi = heh". 


In the present case, since m, = ny, this 
yields 
Gr = (hy /ki)dy 


where the subscripts 4 and f refer to 
handgrip and foot pressure. It is the 
factor ky/ky whose logarithm constitutes 
the intercepts in Fig. 3, Note that the 
data of both experiments in which foot 
pressure was matched to handgrip scatter 


eee 


SUBJECTIVE SCALE OF FORCE 


around the same straight line, whereas 
the other two matching experiments 
yield different intercepts. Some kind of 
rule of least effort seems to hold, more 
conspicuously for foot pressure: the 
forces over which O has control are small 
relative to those presented by Æ, thus 
yielding a large coefficient k; when both 
forces are matched to white noise, the 
ratio ky/Rx becomes almost 1. 


SUMMARY 


The subjective force of pushing a pedal 
with the leg has been scaled as an instance of 
the subjective force exerted by a large muscle 
group. The following methods were em- 
ployed: magnitude estimation, magnitude 
production, matching the force of handgrip to 
the force of foot pressure and vise versa, and 
matching both foot pressure and handgrip to 
the intensity of white noise. The experi- 
ments involving numbers yielded a power 
function with an exponent of 1.6 relating 
subjective force to physical force. All the 
matching experiments showed that the ex- 
ponent for force of foot pressure and force of 


257 


handgrip is the same. The exponent for 
handgrip has previously been determined as 
1.7. Thus the subjective force of foot pres- 
sure, as measured in this study, approximates 
a power function of physical force not ap- 
preciably different from the one found earlier 
for force of handgrip. The exponent for foot 
pressure approximates 1.65. 


REFERENCES 


Bore, G., & DAHLSTRÖM, U. The perception 
of muscular work. Publication No. 5, 1960, 
Umeå Research Library, Sweden. 

Exman, G. A simple method for fitting 
psychophysical power functions. J. Psy- 
chol., 1961, 51, 343-350. 

Rernoips, G. S., & Stevens, S. S. The 
binaural summation of loudness. J. 
Acoust. Soc. Amer., 1960, 32, 1337-1344. 

Srevens, J. C., & Mack, J. D. Scales of ap- 
parent force. J. exp. Psychol., 1959, 58, 
405—413. 

Stevens, S. S. Cross-modality validation of 
subjective scales for loudness, vibration, 
and electric shock. J. exp. Psychol., 1959, 
57, 201-209. 


(Received July 27, 1961) 


Journal of Experimental Psychology 
1962, Vol. ot No. 3, 258-760 


RE-EXAMINATION OF 


THE SERIAL POSITION 


EFFECT# 


MURRAY GLANZER 
Department of Psychiatry, University of Maryland 
AND STANLEY C. PETERS? 
Walter Reed Army Institute of Research 


Existing generalizations about the 
Serial position effect in rote learning 
have recently been challenged as 
based on artifacts. McCrary and 
Hunter (1953) presented evidence 
that the different curves produced by 
experimental variables, such as pres- 
entation rate, distribution, and list 
difficulty, are a single curve multiplied 
- by different numbers of errors, Their 
contention was borne out by the 
findings of Braun and Heymann 
(1958). A re-examination of the 
Serial position effect seems necessary. 
Since any statement about serial 
position effects must refer to the 
beginning and end of a series of items, 
the first step in the re-examination 
is to analyze the meaning of the 
terms beginning and end. The analy- 
sis consists of three steps: to deter- 
mine what characteristics E points 
to as defining the beginning and the 
end of a list, to analyze these charac- 
teristics, and to determine their 
effect on S’s performance. 

In a rote learning experiment, E 
presents to Sa continuous and repeti- 
tive cycle of events: the series of 
syllables, a gap; the series of syllables, 
a gap. The terms beginning and end 
are used to refer to the characteristics 
associated with this gap. They are 
the following: (a) Primacy-recency : 
The first item § sees, and the last 
item he sees before the cycle repeats 


1 This work was carried out under Contract 
DA-49-007-MD-1004 between the Office of 
The Surgeon General and the University of 
Maryland. 

2 Now at the University of Illinois, 


itself, appear on either side of the gap, 
because Hs usually present the list 
Starting from the gap. (b) Spacing: 
The appearance of the gap coincides 
with a period in which S is not re- 
quired to anticipate syllables. (c) 
Association break: Every item in the 
series is both a stimulus and a re- 
sponse item, except the two on either 
side of the gap. The Sis not required 
to form an association across the gap. 
The first item in a list is usually a pure 
stimulus item. The last item isa pure | 
response item, in that it does not 
function as stimulus for the first item. 

The purpose of the following experi- 
ments was to separate the factors 
that make up the complex called 
the beginning and the end of the list, 
and to determine the part these 
factors play in the serial position 
effect. 


l 


EXPERIMENT l: PRIMACY-RECENCY 


Mitchell's (1934) evidence that 
the usual bowed curve appears in 
lists that have neither spacing nor 
association breaks would indicate a 
considerable role for primacy-recency 
effects. In Exp. I, the role of primacy- — 
recency was evaluated by varying Ss’ 
starting positions: starting some S$ 
at the normal position in the list, 
and starting other Ss at what would 
ordinarily be called the middle of the, 
list. The normal position in the list 
is that part which has the gap fol- 
lowed by the asterisk, or other start- 
ing cue; this may be called the 
structural beginning. The temporal 


258 


eo ee  —_EC EE 


SERIAL POSITION EFFECT 


beginning, a term which may be used 
to refer to the first syllable exposed 
to S, may or may not coincide with 
the structural beginning. The two 
experimental conditions here were 
one in which the structural and tem- 
poral beginning coincided, and one in 
which they did not coincide. If the 
temporal starting point (the primacy- 
recency factor) plays an important 
role in determining the serial position 
efiect, then groups starting at the 
structural middle should show a 
flattened or indented serial position 
curve, 


Method 


Subjects —The Ss were 32 Army medical 
service enlisted men. They were average 
or above average in their scores on Army 
intelligence tests (General Technical score 
of the Army Classification Battery). 

Materials —Two six-syllable lists (Glaze 
20% association value) were used: List gh 
ZID, NUK, WEF, QAM, TUH, BEJ; List A -%; 
QAM, TUH, BEJ, ZID, NUK, WEF. The two lists 
consist of the same series of syllables, with 
the structural beginning placed at different 
positions. 

Procedure. —The Ss were first given two 
three-syllable practice lists (Glaze 100% 
association value). One practice list was 
started from the structural beginning, i.e., 
the asterisk. The other was started from 
the structural middle. Half the Ss were 
started from the structural beginning in their 
first practice list; the other half were started 
from the structural middle in their first 
practice list. After learning both practice 
lists to a criterion of one perfect trial, S 
learned one of the six-syllable lists. 

The presentation rate was 3 sec. per syl- 
lable, with a 6-sec. interval (two blank spaces 


on the drum) between successive cycles of the 


list. All groups received the same instruc- 
tions: to try to anticipate each syllable before 
it appeared. No reference was made to either 
the beginning or the end of the list. The list 
was learned to a criterion of three consecutive 
errorless trials. Half the Ss learned List I; 
the other half, List II. Half the Ss started 
at the structural beginning with the asterisk 
exposed, The other half started at the struc- 
tural middle of the list, with either WEF or 
BEJ exposed. Trials were counted as starting 


259 


at the structural middle of the list for the 
Ss who started at that point. 


Results 


Since the work of McCrary and 
Hunter (1953) and Braun and Hey- 
mann (1958) showed that the total 
number of errors has a multiplicative 
effect on the shape of the serial posi- 
tion curve, here, and in the subsequent 
experiments, error scores were trans- 
lated into logarithms. This transla- 
tion converts the multiplicative effect 
to an additive factor and permits 
direct comparison of the shapes of 
the curves. This procedure was 
adopted instead of dividing through 
by the total number of errors, as in 
the work mentioned above. 

The groups that started in the 
structural middle had a slightly 
higher serial position curve (see Fig. 
1) than those starting at the structural 


beginning. The overall differences 
between the two experimental groups 
130, 
120 
110 AERA 


H 
S 
xc 
4 
W 
8 
a 
z 
Ss —— I STRUC. START 
= 50 menem IE STRUC MIDDLE 
40 
30 
.20 
(Den Meee Pe ee o 
| 2 3 4 5 6 
POSITION 


Fic. 1. Serial position curves with pri- 
macy-recency varied: Exp. I. (Group Il 
started in the middle of the list.) 


260 


TABLE 1 


ANALYSIS OF VARIANCE OF LOG ERROR 
Scores: Exe. I 


Source df MS F 
Between Ss 
Condition (C) 1| .5645 1.69 
Lists (L) 1| .3934 
CXL 1| .1068 
Error (between) | 28 | .3331 
Within Ss 
Position (P) 5 | .9834 | 42.76*** 
EKG 5| .0171 
PXL 5| .0193 
PXCXL 5| .0384 | 1.67 
Error (within) |140 | .0230 


+P <.001. 


were not, however, statistically sig- 
nificant (see Table 1). For the group 
starting at the asterisk, the overall 
mean of the converted error scores 
was 5.5 (SD = 1.4); the mean num- 
ber of trials to criterion was 19.7 
(SD = 9.2). For the group starting 
at the structural middle, the mean 
of the converted errors was 6.1 
(SD = 1.4); the mean number of 
trials to criterion was 22.9 (SD =9.4). 
The experimental operation had no 
discernible effect on the shape of the 
serial position curves, and of course 
the Position X Experimental Condi- 
tion interaction was not significant. 
List structure was the sole effective 
factor, significant at the .001 level. 


Greenhouse and Geisser (1959) have 
pointed out that the significance levels 
used in analyses of variance in repeated 
measurements designs are incorrect, if 
the assumption of equal covariances is 
not met. In that case, the usual test, 
presented in Table 1, overestimates the 
significance level. A lower bound test 
that underestimates the significance level 
can be obtained by appropriate reduction 
of the degrees of freedom. The effect 
of list structure remains significant at 
the .001 level (df = 1/28). 

The lists used in this experiment were 
shorter than the 10- to 12-syllable lists 
usually used in serial position studies. 


MURRAY GLANZER AND STANLEY C. PETERS 


However, the curves (Fig. 1) show the 
same skewed bow shape as the curves 
obtained with longer lists (Hovland, 
1938a). They are also similar to the 
family of curves obtained by Robinson 
and Brown (1926) for lists that range 
from 5 to 17 syllables. There is no 
basis, therefore, for assuming that dif- 
ferent factors are at work in 6-syllable 
lists than those in longer lists. 

The type of scoring used above (based 
on the total number of errors before the 
criterion was met) may conceal an early 
effect of primacy-recency that is swamped 
by the repeated effect of list structure. 
To check on this possibility, the number 
of the first trial on which a correct 
anticipation occurred was scored for 
each position in the list. With the start- 
ing position at the asterisk, the means 
for the six positions were 2.38, 5.00, 8.06, 
8.50, 9.75, and 6.62. With the starting 
position in the middle of the list, the 
means were 3.81, 6.38, 6.31, 7.81, 11.31, 
and 8.50. There is an indication of 
flattening of the curve for the lists started 
at the middle. The Position X Condi- 
tion interaction, however, was not sig- 
nificant (df = 5/140, P > .10). 

It is clear that the structure of the 
list, rather than a primacy-recency 
effect, is the major determinant of the 
serial position effect; the same serial 
position effect appears with or without 
primacy-recency. The results do not 
contradict Mitchell’s (1934) results, since 
in her lists the structural characteristics 
of spacing and association break were 
absent. As will be pointed out below, 
when the structural characteristics are 
removed, primacy-recency effects can be 
seen. For the usual type of list presenta- 
tion, however, the important factor 
seems to be the structure of the list. 
The next step was to examine the two 
factors associated with the structural 
beginning and end of the list: spacing 
and the association break. 


EXPERIMENT II: SPACING 


If spacing is a factor determining the 
appearance of serial position effect, 
then changing the amount of spacing 


iti ick ll em 
$ EO, n 


SERIAL POSITION EFFECT 


should affect the shape of the serial 
position curve. More specifically, 
if the confounding multiplicative ef- 
fect of total number of errors is 
eliminated, then increasing the spac- 
ing should give more pronounced 
serial position effect. McCrary and 
Hunter (1953) and Braun and Hey- 
mann (1958) have presented evi- 
dence that contradicts this hypothe- 
sis, but they compared only 6-sec. 
and 126-sec. intertrial intervals. In- 
tervals between 0 and 16 sec. were 
included in the experiment below. 
Perhaps more important than the 
range of spacing values is the fact 
that in the Hovland (1938b, 1940) 
studies re-analyzed by McCrary and 
Hunter, and also in the Braun and 
Heymann study, the long and short 
spacing intervals were not completely 
comparable. In those studies, an 
interpolated task was given during 
the 126-sec. interval, but not during 
the 6-sec. interval. 


Method 


Subjects —The Ss were 90 Army medical 
service enlisted men. The group was above 
average on Army intelligence tests (General 
Technical score on the Army Classification 
Battery). 

Materials —Two 10-syllable (Glaze 20% 
association value) lists were used: List I: 
*, RUW, GIY, POH, WEF, QAM, ZIX, NUK, BEJ, 
xoc, KAZ; List II: *, ZIX, NUK, BEJ, XOC, KAZ, 
RUW, GIY, POH, WEF, QAM. 

Procedure —The Ss were assigned in equal 
numbers to each of the two list conditions 
and the five spacing conditions. They first 
learned two 3-syllable practice lists (Glaze 
100% value) to a criterion of one perfect trial, 
under the same spacing condition as their 
main list, They then learned one of the 10- 
syllable lists to a criterion of two errorless 
trials. The criterion was set as high as pos- 
sible within the time available, so that the 
serial position curves would represent final, 
stabilized performance. 

The lists were presented at a 2-sec. rate, 
with zero, one, two, four, or eight blank 
spaces (0, 2, 4, 8, or 16 sec.) between the end 
of the list and the reappearance of the begin- 
ning. There were 9 Ss who were unable to 


261 


reach the criterion within the 75 min. avail- 
able: 2 each in the zero- and one-space condi- 
tion, 1 each in the two- and eight-space condi- 
tion, and 3 in the four-space condition, 
These Ss were replaced by others. 


Results 


The mean log error curyes show an 
orderly change from a relatively flat 
curve for the zero-spacing condition, 
to a sharply peaked curve for the 
maximum spacing condition (see Fig. 
2). The statistical evaluation (see 
Table 2) of the effect with the usual 
tests, however, does not find the 
Spacing X Position interaction signif- 
icant (.10 > P > .05). This analysis 
does not, of course, reflect the orderly 
nature of the changes in the curves. 
For example, there is a progressive 
increase in the slope of the curve 
between Positions 1 and 6 as the 
spacing increases. The rank order 
correlations for the difference between 
Positions 1 and 6 with the amount of 
spacing is 1.00 for both the List I 
and List II groups. In both groups, 
there is a corresponding, but less 
marked, decrease in slope between 


170 
1.60 2 


MEAN LOG ERRORS 


U2 3.4 D .6 TOS A 
POSITION 


Fic. 2. Serial position curves with spacing 
varied: Exp. II. 


a 


MURRAY GLANZER A 


TABLE 2 


ANALYSIS OF VARIANCE OF LOG ERROR 
Scores: Exp. IH 


Source df MS F 
Between Ss 
Spacing (S) 4 | 3.5962 | 7.00*** 
Lists Ù) 5 1 | 0.0059 
SXL 4 | 0.3396 
Error (between) | 80 0.5138 
Within Ss 
Position (P) 9 | 2.0380 | 90.18*** 
BS 36 | 0.0315-|__ 1.39 
PXL 9 | 0.0437 | 1.93* 
PXSXL 36 | 0.0168 
Error (within) 720 | 0.0226 
*P <.05. 
** P <.001. 


Positions 6 and 10 as spacing in- 
creases. The rank order correlation 
for the difference between Positions 
6 and 10 with the amount of spacing, 
is .70 in both groups. 

The ahalysis demonstrates the ex- 
pected overall differences between the 
spacing conditions, the means declin- 
ing (P < .001) with increased spacing, 
except for a reversal between the one- 
and two-space conditions. The over- 
all means of the converted errors for 
the zero- to eight-space conditions, 
in order, were as follows: 15.0 
(SD = 2.1), 143 (SD = 2.1), 14.8 
(SD = 1,9), 13.4 (SD = 2.7), 11.6 
(SD = 2.2), The means of the num- 
ber of trials to reach criterion were, 
in order, as follows: 58.4 (SD = 21.8), 
54.4 (SD = 19.6), 60.3 (SD = 21.4), 
47.7 (SD = 22.1), 35.0 (SD = 18.0). 
The Position X Lists interaction re- 
flects differences in the difficulty of 
the syllables. The absence of a sig- 


3 The contribution of intrusion errors to 
the total error score may be of interest. 
The number of complete intrusions was 
tallied and expressed as a percentage of total 
errors for each S. The mean percentage of 
intrusion errors for the zero- to eight-space 
conditions, in order, was as follows: 5.3 
(SD = 6.1), 4.7 (SD = 6.5), 3.0 (SD = 2.5), 
4.2 (SD = 2.7), 7.7 (SD = 6.7). 


ND STANLEY C. PETERS 


~x 


nificant third-order interaction indi 
cates that the effect of the sai 
on the serial position curve does not 
differ for the two lists. The position © 
effect is significant (P < .001) and 
remains at that level under the 
Greenhouse-Geisser lower bound test. 

The results lent some support to 
the hypothesis that the serial position 
curve changes in an orderly fashion 
as a function of spacing. The statis- 
tical support was, however, not satis- 
factory. 


* 


EXPERIMENT II]: SPACING AND 
ASSOCIATION BREAK 


In this experiment, the aim was to 
retest the effect of spacing on the 
serial position effect and also to 
evaluate the effect of the association 
break. The expectation was that 
spacing would increase, and that 
associative chaining across the gap 
would decrease, the serial position 
effect. ` 


Method 


Subjects —The Ss were 120 college stu- 
dents. They were paid for their participa- 
tion. 3 

Materials —In order to insert associative 
chaining between successive presentations 0 
the list, the lists were constructed in quast- 
paired-associates format. Ten chained lists 
were constructed by rotating the syllables 
systematically through the 10 available post- 
tions. Two of the chained lists are shown 
below. 

Chained 1: * ZIX NUK * NUK. BEJ * BEJ 
xoc * xoc KAZ * KAZ RUW * RUW GIY * GIY 
POH * POH WEF * WEF QAM * QAM ZIX (Space 
0, 2, or 8) * ZIX NUK * (etc.) 

Chained 2: * NUK BEJ * BEJ xoc * xoc 
KAZ * KAZ RUW * RUW GIy * Gry POH * pou 
WEF * WEF QAM * QAM ZIX * ZIX NUK (Space 
0, 2, or 8) * NUK BEJ * (etc.) 

To break the chaining, the initial syllable 
in each of the 10 chained lists was replace 
with a substitute syllable, i.e., LEB for ZIX in 
Chained 1; crw for wux in Chained 2; an 
tiv for BEJ, HUQ for xoc, VEP for KAZ, FOJ 
for RUW, LEQ for Gry, JAT for POH, YUS for 


er 
) 


EE —— 


SERIAL POSITION EFFECT 263 
wer, and Fur for QAM, in the other 8 chained 70 
à L60 
All syllables, including the substitute syl- L50 
lables, were from the Glaze 20% association 7 
value list. 140 
Procedure: —The presentation rate was 1 130 
sec. per syllable. Three spacing conditions 2 120 
were used: 0, 2, and 8 spaces, or seconds, Æ . 
between lists. The 10 lists, 3 spacing condi- g uo 
tions, and 2 chaining conditions formed a 8 0f. 
10 X 3 X 2 factorial design. Two Ss were > 90 
assigned to each of the 60 experimental 3 
conditions. = 80 —— 0 space 
The drum displayed an asterisk, followed 70 saa 2 rere 
by a stimulus syllable and then the response 60 i or 
syllable. A stimulus syllable, therefore, ap- 50 
peared every 3 sec. After learning two 3- j 
syllable practice lists in the same quasipaired- 40 
associates format to a criterion of one errorless 30 
trial, each S learned a 10-syllable list to a ied Lo E f 
criterion of two successive errorless trials. i en Se Beet OS) 
POSITION 


Results 

Spacing has the expected effect 
of decreasing total errors during 
learning. The effect is significant 
at the .01 level (see Table 3). The 
overall means of the converted errors 
for the zero-, two-, and eight-space 


< conditions, in order, were as follows: 


TABLE 3 


ANALYSIS OF VARIANCE OF Loc Error 
Scores: Exp. HI 


Source af | MS F 
bE ree ee 
Between Ss 
Spacing (S) 2 | 2.5434 | 6.09** 
Chaining (C. 1 | 0.0097 
Lists (L) 9 | 0.3966 
SX 2 | 0.2680 
SXL 18 | 0.3114 
CXL 9 | 0.7316 | 1.75 
SxXxCXL 18 | 0.2184 
Error (between) 60 | 0.4175 
Within Ss 

Position (P) 9 | 1.0312 | 56.35*** 
PXS 18 | 0.0609 13.3382 
PXC 9 | 0.0250 37 
PXL 81 | 0.0584 3.197% 
PXSXC 18 | 0.0261 | 1.43 
PXSXL 162 | 0.0172 
PXCXL 81 | 0.0178 
PXSXCXL 162 | 0.0184 1.01 
Error (within) 540 | 0.0183 


Fic. 3. Serial position curves with spacing 
varied: Exp. IH. 


13.6 (SD = 2.1), 13.0 (SD = 2.0), 
12.0 (SD = 1.7). The means of the 
number of trials to reach criterion 


were, in order, as follows: 44.7 
(SD = 18.2), 39.6 (SD = 15.0), 31.9 
(SD = 11.1). Chaining does not 


have any significant overall effect. 
The serial position effect was signifi- 
cant (P < .001), with the Green- 
house-Geisser lower bound test leav- 
ing the significance level unchanged. 

The effect of spacing on the serial 
position curve is shown in Fig. 3. 
As spacing increases, the curve moves 
toward a markedly bowed shape of 
classic form, with the beginning lower 
than the end. The effect, as evalu- 
ated by the Position X Spacing inter- 
action, is significant at the .001 level. 
Use of the Greenhouse-Geisser lower 
bound test with 2 and 60 df leaves the 
effect significant, at worst, at the .05 
level. The effect of chaining on the 
serial position curve was not signifi- 
cant. The significant interaction of 
Lists and Serial Position (P < -001) 
reflects differences in the difficulty of 


264 


individual syllables, which were in 
different positions in the 10 lists. 

The plots of the mean unconverted 
error scores display the same pro- 
gression toward a more peaked serial 
position curve, with increased spacing. 
Plots of percentages of total errors, 
of course, give essentially the same 
picture, with respect to the shapes of 
the curves, as that obtained with 
mean log of errors. The data were 
also converted into ranks. Thus, if S 
had the following raw error scores 
for the 10 positions: 6, 8, 15, 21, 18, 
22, 19, 24, 13, 17, his scores were 
converted to the following ranks: 
1.0, 2.0, 4.0, 8.0, 6.0, 9.0, 7.0, 10.0, 
3.0, 5.0. Examination of the mean 
ranks for each position showed the 
same progression found with the 
mean log scores. Using the ranks, 
coefficients of concordance were com- 
puted for each group. For both the 
chained and unchained groups, the 
coefficients increased as spacing in- 
creased. In the chained group, they 
were .18, .30, .48; in the unchained 
group, they were .11, .42,.48. All the 
coefficients are significant (P <.05). 
The regular progression of the coefh- 
cients indicates that the increase in 
the bowing of the serial position 
curve as a function of spacing is 
independent of the particular con- 
version used. 

, Figure 3 shows a slight serial posi- 
tion effect in the zero-space condition. 
Separate analysis of the zero-space 
condition finds the effect significant 
(F = 7.39, df = 1/60, P < .01). The 
chained zero-space condition by itself 
also demonstrates a significant serial 
position effect (F = 4.94, df = 1/60, 
P < .05). The factor underlying this 
effect is the primacy-recency variable. 
Evidently, as in Mitchell’s (1934) 
study, when spacing is absent, the 
primacy-recency factor affects the 
shape of the curve. The effect is, 


MURRAY GLANZER AND STANLEY C. PETERS 


however, too slight to account for the 
usual serial position effect. 


DISCUSSION 


The preceding experiments indicate 
that of the three factors listed initially 
as possible determinants of the serial 
position effect (chaining, spacing, and 
primacy-recency), the most important 
factor is spacing, the appearance of a 
gapin the list. Experiment I eliminated 
primacy-recency as a major factor in the 
usual serial learning situation, since a 
full, unaffected serial position effect was 
shown even when opposed by the pri- 
macy-recency factor. Experiment II 
gave some evidence that the serial posi- 
tion effect was a function of spacing. 
Experiment III completed the case in 
support of the effect of spacing and 
eliminated chaining as a major deter- 
minant of the serial position effect. 
Experiment III also indicated a slight 
effect for primacy-recency when it is 
not opposed by spacing. 

McCrary and Hunter (1943), in their 
re-analysis of Hovland's (1938b, 1940) 
data, and Braun and Heymann (1958) 
display curves that remain unaffected 
by spacing, once the multiplicative 
effect of total number of errors is elim- 
inated, Evidence has been presented 
here that spacing does have an effect. 
The apparent contradiction probably 
stems from differences in procedure. In 
the previous studies, the intertrial inter- 
vals were 6 and 126 sec.; in the present 
experiments, they ranged between 0 an 
16 sec., with large changes appearing be- 
tween the 0- and 2-sec. conditions. In 
the previous studies, an interpolated 
task was used, but only during the long 
interval. This could have counteracted 
the spacing effect by slowing the learning 
of the beginning and end syllables of the 
long interval lists. 

Inhibition (Hull, Hovland, Ross, Hall, 
Perkins, & Fitch, 1940) and interference 
(Atkinson, 1957) constructs have been 
popular in the explanation of the seria 
position curve. From one point of views 
the results above fit in with inhibition of 
interference explanations of the seria 


SERIAL POSITION EFFECT 


position effect. Spacing could function 
as a barrier to protect the items near 
the beginning and the end of the list. 
With the demonstration of a system- 
atic effect of spacing on the serial 
position curve, however, another type 
of explanation becomes possible: a 
facilitation explanation. With this type 
of explanation, interference or inhibitory 
effects between list items are considered 
homogeneous within the list. The serial 
position curve is viewed as a result of the 
facilitative effect of spacing on the learn- 
ing of the first and last items of the list. 
These, in turn, facilitate the learning of 
their neighbors. The skewing of the 
curve can be explained by an effect 
demonstrated by Ribback and Under- 
wood (1950). They showed that once an 
association is learned between a pair of 
syllables, it is easier to attach a third 
association to the second member of the 
pair than to the first. In other words, 
once the association X-Y is learned be- 
tween syllables X and Y, it is easier to 
learn the triplet of syllables X-Y-Z than 
it is to learn A-X-Y. The Ribback- 
Underwood mechanism by itself is not 
sufficient to generate the serial position 
curve, since the mechanism does not 
determine the syllables from which it 
will spread. This determination could 
be made by spacing, which would facili- 
tate the syllable pairs bordering the gap. 
These anchor syllables would in turn 
generate the serial position effect for- 
ward and backward from the gap. 


SUMMARY 


Serial position effects are usually defined 
on the basis of the beginning of a list. In 
rote learning, the term “beginning 
analyzed into three 
a repetitively-appearing gap that separates 
the “end” from the “beginning” of the list: 
(a) Primacy-recency. The first item the $ 
sees, and the last item he sees before the cycle 
repeats itself, appear on either side of the gap, 
because Es usually start the list from the gap. 
(b) Spacing. The appearance of the gap 
coincides with a period in which the S is not 
required to anticipate syllables. (c) Chaining 
versus association break. Every item in the 
series functions as both a stimulus and a 


265 


response item (chaining), except the two on 
either side of the gap. One of these is solely 
a stimulus item ; the other is solely a response 
item. 

Three experiments were carried out to 
determine the role of these three factors in 
generating the serial position effect. In 
Exp. 1, primacy-recency was opposed to both 
spacing and association break by varying S's 
starting position in conventional lists. No 
effect of the variation was found, indicating 
that primacy-recency was not a major factor. 
In Exp. I, spacing was varied, using inter- 
trial intervals of 0, 2, 4, 8, and 16 sec. With 
the multiplicative effect of total number of 
errors held constant, some evidence was ob- 
tained indicating that the serial position curve 
became more peaked as spacing increased. 
In Exp. II, both spacing (intertrial intervals 
of 0, 2, and 8 sec.) and chaining or association 
break were varied. To vary chaining, the 
lists were given in quasipaired-associates 
form. Some lists required that an association 
be formed between the end and the beginning 
of the list; other lists did not. Experiment 
III showed clearly the effect of spacing in 
determining the serial position effect. There 
was no evidence for the effect of association 
break or chaining. Experiment II also indi- 
cated a slight effect for primacy-recency when 
it is not opposed by spacing. On the basis 
of these findings, itis concluded that the major 
factor determining the serial position effect 
is the amount of space between the end and 
the beginning of the list, with an increase in 
spacing producing a more marked serial 
position effect. 


REFERENCES 


stochastic model for 


Arkinson, R. C. A 
Psychometrika, 1957, 


rote serial learning. 
22, 87-96. 

Braun, H. W., & HEYMANN, S. P. Meaning- 
fulness of material, distribution of practice, 
and serial-position curves. J. exp. Psy- 
chol., 1958, 56, 146-150. 

Greennouse, S. W., & Grtsser, S. On 
methods in the analysis of profile data. 
Psychometrika, 1959, 24, 95-112. 

Hovianp, C. L Experimental studies in 
rote-learning theory: II. Reminiscence 
with varying speeds of syllable presenta- 
tion. J. exp. Psychol., 1938, 22, 338- 
353. (a) 

Hovtann, C. |. Experimental studies in 
rote-learning theory: III. Distribution of 
practice with varying speeds of syllable 


266 


presentation. J. exp. Psychol., 1938, 23, 
172-190. (b) 

Hovyianp, C. I. Experimental studies in 
rote-learning theory: VII. Distribution 
of practice with varying lengths of list. 
J. exp. Psychol., 1940, 27, 271-284. 

Hutt, C. L., HOVLAND, C. L, Ross, R. T., 
Hatt, M., Perxins, D. T., & Frrcn, F. B. 
Mathematico-deductive theory of rote learn- 
ing. New Haven: Yale Univer. Press, 1940. 

McCrary, J. W., & Hunter, W. S. Serial 
position curves in verbal learning. Science, 
1953, 117, 131-134. 


MURRAY GLANZER AND STANLEY C. PETERS 


Mircuett, M. B. The effect of serial position 
in the continuous memorization of num- 
bers. Amer. J. Psychol., 1934, 46, 493-494. 

Rippack, A., & Unperwoop, B. J. An 
empirical explanation of the skewness of 
the bowed serial position curve. J. exp. 
Psychol., 1950, 40, 329-335. 

Rosinson, E. S., & Brown, M. A. Effect of 
serial position upon memorization. Amer. 


J. Psychol., 1926, 37, 538-552. 


(Received July 27, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 3, 267-271 


EFFECTS OF INSTRUCTIONS IN PROBABILITY 
LEARNING * 


J. McCRACKEN, C. OSTERHOUT, AND JAMES 


F. -VOSS 


College of Wooster 


The present paper? reports the re- 
sults of two experiments designed to 
investigate probability learning as a 
function of instructions. The major 
instructional aspects studied were 
type of problem presented and in- 
formation regarding sequential char- 
acteristics of the task. Studies of the 
role of instructions per se (Anderson 
& Grant, 1957) have not been ex- 
tensive. However, instructions used 
in studies of two-choice, noncontingent 
probability learning that allowed for 
the possibility of EE, patterns and 
permitted S to respond on a trial-by- 
trial basis have yielded Ai probabili- 
ties approximately equal to Ex prob- 
abilities (e.g., Engler, 1958; Grant, 
Hake, & Hornseth, 1951; Neimark, 
1956). On the other hand, studies 
which provided additional informa- 
tion regarding the nonpatterned E,-E2 
trials yielded Ai frequencies which 
exceeded Ex probabilities (Morse & 
Runquist, 1960; Rubenstein, 1959). 
Experiment 1 was designed, therefore, 
to test the hypothesis that event 
prediction instructions and event pre- 
diction instructions with additional 
information regarding the sequential 
nature of the task yield A: prob- 
abilities equal to and in excess of the 
E, probability, respectively. In addi- 
tion, Exp. 1 tested the hypothesis that 


1 This research was sponsored by the Na- 
tional Institute of Mental Health (M-3531). 
The authors wish to thank N. H. Anderson 
for his helpful comments. 

2The notation used in this paper is as 
follows: A: and Az denote S's predictions of 
events E: and Es respectively, where E; is 
the more frequent (.70) event and Ex the less 
frequent (.30) event. 


instructions which present no problem, 
but tell S to engage ina conversation, 
yield nondifferential EE» effects. A 
transfer condition was employed in 
order to determine the permanency of 
instructional effects. 

Experiment 2 was designed to test 
the hypothesis that more specific in- 
structions to avoid a trial-by-trial 
basis of responding and instructions to 
consider the task as essentially a dis- 
crimination problem yield A, maxim- 
ization. Discrimination instructions 
were considered to be the extreme case 
of the lack of a trial-by-trial basis of 
responding since S was told to consider 
the E,-E, trials either in blocks of 
trials or as one two-event discrimi- 
nation. 


EXPERIMENT 1 
Method 


The E-E; probability of .70—.30, ran- 
domized within 20-trial blocks, was used for 
both experiments. Each of two 33% associa- 
tion nonsense syllables (Hilgard, 1951), PIB 
and Faj, occurred as the E, event for one-half 
of the randomly assigned Ss, The Ss were 
students at the College of Wooster. 

Original training (OT).—Instructions for 
Group 1 essentially asked S to engage in a 
conversation: 


You have before you a list of two non- 
sense syllables. Using only the two words 
on this list, you and I are going to hold a 
conversation. To begin this conversation, 
you will say one of the words to me. L, in 
turn, will say one of the words to you. 
You will look over your list and say the 
word which you think is a reasonable 
statement at that time. We will continue 
in this manner until the end of the experi- 
ment. Remember, use only the two words 
before you for all of your contributions to 
the conversation. 


267 


268 


Instructions for Group 2 asked .S to predict a 
sequence of events: 


You have before you a card with two 
nonsense syllables listed on it. Using only 
these two words, you and I will hold a 
special type of conversation. You will 
begin by choosing one of these words and 
saying it aloud to me and | will then respond 
with one of the words. From this point on, 
you will try to predict the word I will say. 
If you are correct in your prediction, I will 
say that same word after you. We will 
continue in this manner (your prediction, 
my response) until the end of the ex- 
periment. 

Your task is merely to try very hard to 
predict correctly each separate word I 
shall say. 

Remember, use only the two words before 
you for all of your predictions and try hard 
to get each of your predictions right. 


Instructions for Group 3 presented S 
sequential information and asked S to regard 
the situation as one problem. The first two 
paragraphs of the instructions for Group 2 
were used, followed by: 


There is no pattern to discover. Your 
task is merely to get as many predictions 
correct as possible. Consider the entire 
experiment as one general problem and 
solve it so as to have the most correct you 
possibly can. It may be necessary to have 
some wrong predictions in order to best 
solve this general problem. Remember: 
treat the entire experiment as one general 
problem and attempt to solve it in order to 
get as many correct predictions as possible 
—even if you must have some wrong pre- 
dictions to do so. 


ast 
= 


BA 


s 


A 
6 a 


ee > a E 


Responses 
g 


ay 
ae 


Ar 
[A 


s 
è 


NEEN 


3 a s 4 4 6 7 
TRIAL BLOCK 


Fic. 1. Percentage of A, responses as a 
function of Trial Blocks 1-7 for Groups 1-6 of 
Exp. 1 and 2. 


J. McCRACKEN, C. OSTERHOUT, AND J. F. VOSS 


TABLE 1 
PERCENTAGE OF A, RESPONSES FOR THE 
Seven 20-TriaL BLOCKS OF THE 
NINE TRANSFER CONDITIONS 


Group 

Block | 
Trial Í 

u | 21 | 31 | 12 | 22 | 32 [13 | 23 | 33 
1 |44 |52 |50 |52 |60 |56 |48 |73 | 69 
2 |51 |59 |58 | 67 | 73 | 70 |63 | 82 76 
3 56 | 56 | 61 | 65 | 76 | 71 | 73 | 83 | 74 
4 46 | 61 | 56 | 66 | 76 | 68 78 | 85 75 
5 |50| 53 | 59 | 73 | 75 | 68 80 | 86 78 
6 |49 |57 | 63 |75 | 70 | 76 | 84 | 84 | 83 
7 |48)|55 | 59 | 70 | 76 | 72 ll k 


Note.—The first digit of the group number designates 
the OT condition, the second designates the T condition. 


Thirty Ss served in each group for 140 oT 
trials. The E responded 2 sec. after S gave 
a response, 

Transfer (T).—Each OT group was ran- 
domly divided into three equally sized sub- 
groups. Each subgroup received one of the 
three sets of OT instructions after the 140 OT 
trials. The T (second instruction) groups 
were run for another 140 trials, with the same 
respective E,-E: words, probability condition, 
and counterbalancing of words, but different 
20-trial block randomized E,-E2 sequences. 


Results? 


Original training —Figure 1 pre- 
sents the Groups 1, 2, and 3 A, per- 
centages as a function of trial block 
for OT trials. Group 1 responded at 
approximately chance level, whereas 
Groups 2 and 3 approached matching. 
An analysis of variance performed on 
the A, frequency data revealed a 
significant instruction group effect 
(F = 32.50, df = 2/84). A Duncan 
range test indicated 1-32.t A Trial 


3 A preliminary experiment was performed 
involving Group 1 instructions and different 
E,-E2 sequences. The results were roughly 
equivalent to the present Group 1 results. 
The effect of the verbal counterbalancing an 
the interactions including word used are not 
significant for any analysis and are not further 
discussed, 

‘Significant differences are indicated by 
dashes between the group numbers, with an 
increase in Ay frequency from left to right. 


oat 


PROBABILITY LEARNING 


Block 7 analysis of variance revealed a 
significant instruction group source of 
variation (F = 6.89, df = 2/84) and 
the same range results. 

Transfer.—Table 1 presents the per- 
centage A, responses of the nine 
transfer groups for Trial Blocks 8-14. 
The data of Fig. 1 and Table 1 indicate 
a relatively rapid shift in performance 
level with the introduction of the T 
instructions. Analysis of variance 
performed on the Ai frequency data of 
Trial Blocks 8-14 revealed a signifi- 
cant T instruction group source of 
variation (F = 25.32, df= 2/72). 
Subsequent range test results indi- 
cated 1-2-3. The OT instruction 
and T Instruction X OT Instruction 
sources of variation are not significant. 

Analyses of variance were per- 
formed on the data of Trial Blocks 8 
and 14. The OT instruction group 
source of variation in Trial Block 8 is 
significant (F = 6.38, df = 3/72), 
with subsequent range test results of 
1-23. These results agreed with the 
results of Trial Block 7. The OT 
instruction group source of variation 
for Trial Block 14 is not significant. 
The Trial Block 8 and 14 findings 
likely reflect the reduced influence of 
OT instructions. 

The T instruction group source of 
variation is significant in the Trial 
Block 8 (F = 6.49, df = 2/72) and 
Trial Block 14 (F = 21.02, df = 2/72) 
analyses. Subsequent range tests re- 
vealed: 12-3 and 1-23 on T: rial Block 
8 and 1-23 on Trial Block 14. 


EXPERIMENT 2 


The lack of difference between 
Groups 2 and 3, except for the overall 
T analysis, implied that the sequential 
information and suggested approach 
presented to Group 3 did not differ- 
entiate the problem from the simple 
event prediction for Group 2. Ex- 
periment 2 was therefore designed to 


269 


test the hypothesis that instructions 
to avoid a trial-by-trial basis of re- 
sponding and instructions which force 
S to consider the task as essentially 
one two-choice discrimination problem 
yield maximization. 

Group 4 was presented instructions 
to consider the task as one problem 
and to avoid a trial-by-trial basis of 
responding. Group 5 was instructed 
to predict one event when told to 
“Choose” (after 20 trials) and to 
maintain that response until again 
told to “Choose” (after 20 trials). 
Group 6 was instructed to consider 
the situation as a discrimination task 
and select the response more likely to 
occur once such a discrimination was 
made. 


Method 


The first paragraph of instructions for 
Groups 4, 5, and 6 was the same as that of 
Groups 2 and 3. The Group 4 instructions 
continued: 


Please note that you should ignore a trial- 
by-trial approach as much as possible. 
Neglect of this approach is helpful because 
there is no pattern or system to the sequence 
of words I shall say. Thus, you should ap- 
proach this task as one problem and use 
any strategy you are able to devise in order 
to get as many words correct as possible. 
You will not be able to get all of the words 
correct, but you should try to get a maxi- 
mum number of words correct. Remem- 
ber, you are to approach the task as one 
problem and employ a strategy which will 
enable you to make a maximum number of 
correct predictions. 


Group 5 instructions were: 


Please note that you should ignore a 
trial-by-trial approach as much as possible. 
Neglect of this approach is helpful because 
there is no pattern or system to the words 
Į shall say. In addition, you will not be 
able to get all of the words correct. How- 
ever, in order to facilitate your getting a 
maximum number of correct predictions, 
the following procedure will be used. 
When we begin, you will try to predict the — 
word I shall say. Then, at a given point 
in the sequence, I shall say “Choose.” 


270 


Responses 


% A. 


eee ee ee a 
Erea ae) ee 


.70 Word 

Fic. 2. Percentage of Ai responses as a 
function of E: run length for Groups 1, 2, and 
3 of Exp. 1 and Group 4 of Exp. 2. 


After I say “Choose,” you are to again 
predict, but you must predict with the 
same word until I again say ‘‘Choose.” 
This procedure will continue until the end 
of the experiment. Remember, when I say 
“Choose,” you must only predict with one 
of the words until I again say “Choose.” 
Naturally, you should say the word that 
you think will occur more frequently until 
I again say “Choose” so that you will make 
a maximum number of correct predictions. 
Group 6 instructions were: 


Please note that you should ignore a 
trial-by-trial approach as much as possible. 
Neglect of this approach is helpful because 
there is no pattern or system to the words I 
shall say. In addition, I want to point out 
that you will not be able to get all of the 
words correct. However, in order to 
facilitate your getting a maximum number 
of correct predictions, you are to use the 
following procedure. 

Consider the experiment as essentially 
one of discrimination. Therefore, if you 
are able to discriminate or tell which word 
occurs more frequently, you are to respond 
with that word on all of the trials. Since 
there is no pattern to the words, you will 
get as many words correct as possible by 
responding in this manner. Remember, as 
soon as you are able to, you should tell 
which word occurs more frequently and re- 
spond with that word om each prediction. 


The E,-E, sequences were those of Trials 
1-140 of Exp. 1. The N was 10 for each 
group. Other conditions were comparable to 
those of Exp. 1. 


Results 


Run Length of 


The A, percentages for Groups 4, 5, 
and 6 during Trial Blocks 1-7 are 


J. McCRACKEN, C. OSTERHOUT, AND J. F. VOSS 


presented in Fig. 1. A test for sig- 
nificance between Groups 4, 5, and 6y 
was not performed because of the 
limitations placed upon the A, re- 
sponses by the instructions. 

The results of Group 4 indicate that 
instructions to respond to the situa- 
tion as one problem and to avoid a 
trial-by-trial approach yielded an A 
percentage approaching 90%. The 
Group 5 Aj; percentage approached 
approximately the same level. Group 
6 tended to choose the more frequent 
word during the first trial block, al- 
though after an Es run, an Az response ~ 
occasionally occurred. i 

The differences among Groups 1, 2, 
3, and 4 were clarified by an analysis 
of negative recency effects. Figure 2 
presents the percentage of Ai re- 
sponses as a function of Ey repetitions. 
Groups 1 and 4 did not show negative 
recency effects, whereas Groups 2 and 
3 yielded such tendencies. These 
results imply that Groups 2 and 3 
responded more to sequential char- 
acteristics of the stimuli than Groups 
1and4. The frequency distributions 
of A; responses of the Trial Block 7 
data of Groups 1, 2, 3, and 4 were 
essentially unimodal. 


\ 
A 
) 


DISCUSSION 


In general, the findings indicated 
that conversational instructions yielded 
chance results; event prediction instruc- 
tions, with or without additional sequen- 
tial information and problem orientation, 
yielded approximate matching; event 
prediction instructions which included 
the avoidance of a trial-by-trial orienta- 
tion yielded an A; frequency greater than 
the E; frequency; and instructions which 
forced S to choose one alternative fo 
blocks of 20 trials or to discriminat 
which event occurred more frequently 
yielded asymptotic A; levels of approx! 
mately 90% to 95%. The results o 
Group 5 are in agreement with the result 
of Galanter and Smith (1958). 

The results are relevant to the area 0 


PROBABILITY LEARNING 


decision making in’a probabilistic situa- 
tion (Edwards, 1961) in that the Ay 
response level is a function of the problem 
presented. Instructions which include 
the possibility of Eı-Es patterns and do 
not specifically state that a trial-by-trial 
approach should be avoided apparently 
yield results based upon sequential E\-E2 
characteristics. On the other hand, the 
results of Groups 5 and 6 suggest that 
maximimizing occurs when the task is 
presented as one discrimination-type 
problem and the responses indicated are 
A, and As for large blocks of trials rather 
than trial-by-trial sequences of A, and As 
responses. 

The conversation instruction results 
suggest that in a probability situation, 
differential E,-E. frequency is not a 
sufficient condition to yield A-Aa dif- 
ferences. 


SUMMARY 


Instructions in two 70 —.30 noncontingent 
probability experiments were varied for six 
groups. Groups 1 to 6 were told to consider 
the task as (a) conversation, (b) event predic- 
tion, (c) event prediction—with additional 
task information, (d) event prediction— 
avoiding a trial-by-trial basis of responding, 
(e) a problem involving responding for blocks 
of 20 trials, and (f) a problem involving the 
discrimination of the two events. 

Group 1 responded at an approximate 50% 
A, level, Groups 2 and 3 approximately 
matched, and Groups 4, 5, and 6 exceeded 
85% Ai responding. Analysis of negative 


271 


recency effects for Groups 1-4 suggested that 
Groups 1 and 4 were not responding to the 
sequential nature of the task, whereas Groups 
2 and 3 were. 


REFERENCES 


ANDERSON, N. H., & GRANT, D. A. A test 
of statistical learning theory model for two- 
choice behavior with double stimulus 
events. J. exp. Psychol., 1957, 54, 305-317. 

Epwarps, W. Behavior decision theory. 
‘Annu. Rev. Psychol., 1961, 12, 473-498. 

ENGLER, J. Marginal and conditional stim- 
ulus and response probabilities. J. exp. 
Psychol., 1958, 55, 303-317. 

GALANTER, E. H., & Smrt, W. A. S. Some 
experiments on a simple thought problem. 
Amer. J. Psychol., 1958, 71, 359-366. 

Grant, D. A., HAKE, H. W., & HORNSETH, 
J. P. Acquisition and extinction of a 
verbal conditioned response with differing 
percentages of reinforcement. J. exp. 
Psychol., 1951, 42, 1-5. 

Hitcarp, E. R. Methods and procedures in 
the study of learning- In S. S. Stevens 
(Ed.), Handbook of experimental psychology. 
New York: Wiley, 1951. Pp. 517-567. 

Morse, E. B., & RUNQUIST, W. N. Prob- 
ability-matching with an unscheduled ran- 
dom sequence. Amer. J. Psychol., 1960, 
73, 603-607 

Nemarx, E. D. Effects of type of nonrein- 
forcement and number of alternative re- 
sponses in two verbal conditioning situa- 
tions. J. exp. Psychol., 1956, 52, 209-220. 

RUBENSTEIN, I. Some factors in probability 
matching. J. exp. Psychol., 1959, 57, 413- 
416. 

(Received August 3, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 3, 272-279 


COGNITIVE FACTORS IN HEART RATE CONDITIONING? 


BISHWA B. CHATTERJEE? ann CHARLES W. ERIKSEN 


University of Illinois 


Studies on autonomic conditioning 
in human Ss typically have given little 
attention to the role of S's cognitive 
or verbalizable expectancies upon the 
development of the conditioned re- 
sponse. Where attention has been 
directed to these variables the em- 
phasis has been on demonstrating that 
humans can develop conditioned auto- 
nomic responses without verbalizable 
awareness. Experiments by Diven 
(1937) and Haggard (1943) have been 
interpreted as showing that human Ss 
will condition GSRs without verbal- 
izable awareness of the contingent 
relationship between the CS and the 
UCS and Razran (1946) has inter- 
preted his data on salivary condition- 
ing along the same lines. 

In these studies, however, only a 
crude assessment was made of S's ver- 
balizable awareness. There was no 
attempt to make a fine grained 
analysis of specific expectancies and 
their relation to autonomic response. 
Even the conclusion of conditioning 
without awareness is extremely equiv- 
ocal. Lacey and Smith (1954) have 
shown that neither the Diven nor 
Haggard studies contain the necessary 
logical or statistical comparisons of 
data to permit a conclusion of un- 
conscious conditioning. Further, 
Chatterjee and Eriksen (1960) have 
shown that Lacey and Smith’s own 
data on heart rate conditioning with- 
out awareness may well have been due 
to an artifact arising from the method 
of computing conditioning scores. 

There are but few studies that have 


1 This research was supported by Mental 
Health Grant M-1206. 


2 Now at the University of Michigan. 


attempted to manipulate experiment- 
ally S's cognitive expectancies in 
autonomic conditioning or to correlate 
closely his verbalizable expectancies 
with autonomic responses. Notter- 
man, Schoenfeld, and Bersch (1952) 
informed one group that there would 
be no further shocks and found ex- 
tinction of conditioned heart rate was 
much quicker than in an uninformed 
group. Branca (1957) undertook in- 
tensive questioning of his Ss and 
related their expectations of shock as 
well as their verbalizations as to 
whether the shock was painful or not 
to the frequency and occurrence of 
conditioned GSRs. He concluded 
(Branca, 1957) “expectation of shock 
as a painful or fearful experience was 
necessary and sufficient to produce 
responses to the experimental and 
generalization stimuli in this experi- 
ment and such expectancy was the 
result of awareness of the existing rela- 
tionships between the experimental 
stimuli and experience with the un- 
conditioned stimulus” (p. 549). 

In the present experiment we 
studied the acquisition and extinction 
of a conditioned heart rate response 
under different conditions of experi- 
mentally manipulated cognitive €x- 
pectancies. Since previous methods 
of determining autonomic condition- 
ing have been shown to result in 
spurious evidence of precise condi- 
tioning (Chatterjee & Eriksen, 1960; 
Eriksen, 1958) an improved and more 
rigorous criterion of conditioning was 
employed. In addition the general- 
ization of conditioned heart rate re- 
sponses along a semantic and a color 
dimension was determined. 


272 


—" 


HEART RATE CONDITIONING 


METHOD 


Design.—Stimuli were common words pre- 
sented to S by means of an optical projector 


duration of .5 sec. The S began chain 
associating to the word upon exposure. 
8.5 sec. a small light located above the screen 
flashed for 1 sec. This was the signal for S 
to stop associating. Following a 15-sec, rest 
interval, the next stimulus word was pre- 
sented. The stimulus list consisted of 12 
nouns and adjectives all with a high frequency 
count in the Thorndike-Lorge (1944) tables. 

In the conditioning session the stimuli were 
arranged in seven blocks of trials, each block 
containing all 12 words randomized within 
the block. In the extinction phase, which 
followed the conditioning phase without 
interruption, a second list of 12 words was 
used. Only one word, BOAT, the CS, was 
common in both lists. The extinction session 
consisted of three blocks of trials with each 
block again containing randomization of the 
12 words. Each word was exposed with a 
deep colored background of either green, 
yellow, or red. There were four words in 
each color in both stimulus lists. The ex- 
tinction list contained the word BOAT, five 
words semantically related to BOAT, and six 
words semantically dissimilar. The stimulus 
words with the associated colors for the con- 
ditioning and extinction phases are shown in 
Table 1. 

To establish cardiac conditioning a painful 
shock was always presented at the termination 
of chain association to the word BOAT. The 
shock, lasting for 1 sec., coincided in time and 
duration with the light signal for stopping 
associations. There were no shocks in the 
extinction phase. 

The electrotachographic response of S was 
continuously recorded during the experi- 
mental period. Upon completion of the ex- 
tinction trials S was interrogated with a series 
of questions to estimate his awareness of 
shock frequency following each of the stimulus 
words. 

Procedure.—The S was seated in a com- 
fortable reclining chair facing a large wood 
screen that contained the ground glass pro- 
jection screen. All of the apparatus was 
concealed from the sight of S. The S was 
instructed as follows: 


We're going to do an experiment on 
chained association. A common word will 
be flashed for one-half second on this screen 
and you are to read that word and then go 
on speaking the words that come to your 


273 


TABLE 1 


Coron Scueme or THE Stivutvs WORDS 


onditioning Stimulus List| Extinction Stimulus List 


Word Color Word Color 
BOAT green BOAT green 
CHILD green SEA green 
RICH green FACE green 
PAPER green INDUSTRY | green 
BIRD yellow | sir yellow 
HOUSE yellow | SAIL yellow 
ALONE yellow | DRESS yellow 
BLUE yellow | PLOT yellow 
SOFT red VESSEL red 
TREE red WAVE 
SKY red NATURE red 
PEACE red MONEY 


mind. Do not try to make any meaningful 
sentences or any sense out of your free 
association. Rather, give your mind a free 
rein, It is best to be relaxed and not hold 
back anything. Continue speaking one 
word after another until you notice this 
little light bulb flash for one second. This 
is the signal for you to stop. Then wait 
until the next word is flashed on the screen 
when you again begin your free associations 
as before. We will use only a limited 
number of words over and over again. 


The S was then given training with a few 
practice words and any questions he had were 
answered. Heart rate was recorded through 
Standard Lead 1 and shock was administered 
through electrodes placed on the right calf 
with 1}in. interelectrode distance. The S 
was told that electric shock was an integral 
part of the experiment and that we would 
begin by determining his threshold for pain. 
A number of shocks were then administered 
at increasing voltage levels until S reported 
the shock as quite painful. This was the 
level used for that S. 

Three basic groups of Ss were formed by 
adding to the above instructions in the follow- 
ing ways. Group I Ss were told that a shock 
would follow one particular word but that no 
other words would be followed by shock. 
They were further told when the extinction 
trials began by informing them there would be 
no further shocks. Group II Ss were told 
that following the presentation of a particular 
word in the list there would always be a shock 
and each of the remaining words in the list 


274 


would be followed by one shock sometime 
during the presentation. They were further 
told that during the latter part of the experi- 
ment all the shocks would cease. Group III 
was told that a certain number of shocks 
would be administered at certain points of 
time during the experiment but E could not 
tell them beforehand when the shocks would 
come. These differences in instructions for 
the three groups were designed to lead to 

_ different cognitive expectancies concerning 
conditioning arrangements. In keeping with 
these differences in instructions Group I Ss 
were given only 7 shocks while Ss in Groups 
II and III received 18 (7 to the word Boat 
and 1 each to the remaining 11 words present 
during the conditioning trials). 

Before beginning the actual conditioning 
phase an adaptation period was employed in 
which 12 training words were used. 

Upon termination of the extinction phase 
the electrodes were removed and all Ss were 
put through an interrogation as to their 
expectancies and beliefs concerning the nature 
of the experiment. In addition to questions 
designed to elicit S’s hypotheses as to the 
purpose of the experiment and the basis on 
which shock was administered, the list of 
stimulus words were administered to S one 
by one and he was asked to indicate the 
number of times he thought shock had fol- 
lowed each, This provided an estimate of S's 
expectancy of shock for the different words. 

Apparatus—The stimulus words were 
typed in small letters and mounted in 35- 
mm. slides. Colored cellophane sheets were 
wrapped around the slides to provide varia- 
tion in color. A Bell and Howell Robomatic 
slide projector was used in projecting the 
stimuli on the ground glass screen. Stimulus 
presentation, shock administration, and dura- 
tion were automatically programed and timed. 

A Grass Model 5P4 polygraph was used 
for continuous recording of Ss’ electrotacho- 
graphic responses. In addition to the pen 
recording the tachogram, three event marking 
pens were also triggered from appropriate 
leads from the timing equipment. They were 
used to indicate the onset and termination of 
the CS, of the stop signal, and the UCS. 

Shock was faradic stimulation from a 
Harvard inductorium supplied with a 3-v. de 
source in the primary, 

Subjects. —Seventy-six undergraduate stu- 
dents (19 females and 57 males) in intro- 
ductory psychology and education courses 
were used. Assignment to groups was 
alternated on the basis of the order of their 


appearance for the experimental session 


BISHWA B. CHATTERJEE AND CHARLES W. ERIKSEN 


except that the number of females in each 
group was made roughly proportionate.* 


RESULTS 


The response measure used in as- 
sessing conditioning, extinction, and 
generalization was a cardiac response 
difference score. The latencies be- 
tween the four heat beats immedi- 
ately prior to the presentation of the 
stimulus were determined for each of 
the 120 stimulus presentations during 
the conditioning and extinction trials 
for each S. Also, the latencies be- 
tween the four beats immediately 
following presentation of the CS were 
determined. The cardiac response 
difference was obtained by subtracting 
the shortest post-CS latency from the 
shortest pre-CS latency on each pres- 
entation.! 


| 


7 


— 


The 9 Ss in Group I were all able | 


to verbalize during the inquiry period 
that shock had followed only the 
critical word Boat. The 11 Ss con- 
stituting Group II had received a total 
of 18 shocks: 7 to the word Boat and 
once each to the remaining 11 words 
during the conditioning phase. Again 
all these Ss reported during the in- 
quiry that they had discriminated the 
word Boat from the remaining words 
by the third or fourth presentation of 
Boat. On the other hand, none of the 
other 56 Ss comprising Group HI 
were able to verbalize clearly and un- 
equivocally that shock had followed 
BOAT nearly all the time during the 
conditioning trials and occurred only 


3 After 11 Ss had been run in each group, 
the data were examined. Due to recording 
failures 2 Ss had to be discarded from Group I. 
The existence of conditioning in Groups I and 
IT was at this point quite evident, but the 
negative results in Group III led us to add an 
additional 45 Ss to this group in order to 
insure a more sensitive test of this condition. 

* The cardiac UCR in the present experi- 
ment was almost without exception a decrease 
in latency between beats. 


HEART RATE CONDITIONING 275 


once to each of the remaining words. 
However, when Ss were asked to state 
their expectancies for shock to each 
of the stimulus words, it was possible 
to subdivide Group HII into two 
groups; Group III-A, the partially 
aware group consisted of 29 Ss who 
met the criterion of reporting that 
they had received at least three or 
more shocks to the word BOAT and not 
more than two shocks to any of the 
remaining words. The remaining 27 
Ss formed Group III-U. These Ss 
reported frequency of shock to the 
word Boat less than or equal to two 
times or reported more than two 
shocks to BOAT but at the same time 
reported an equal or greater number 
of shocks to one or more of the remain- 
ing words. This division of Group 
III was employed to permit a more 
refined analysis of the relationship 
between cognitive expectancy and 
conditioning and generalization. 

Conditioning and extinction.—Since 
conditioning is a monotonic increasing 
function of the number of reinforced. 
trials, a rigorous test of conditioning 
is to plot the number or percent of Ss 
in each experimental group who gave 
the maximum positive cardiac re- 
sponse difference to BOAT in each block 
of trials (chance level is 8.3%). 

The results of this measure of con- 
ditioning and extinction are plotted 
in Fig. 1. It is apparent that the con- 
ditioning varies from group to group. 
It is greatest in Group I, somewhat 
less so in Group II, and in Groups 
III-A and III-U the curves indicate 
that no conditioning occurred. 

The binomial test was used to de- 
termine whether the obtained propor- 
tions of Ss in each group giving the 
maximum response to BOAT on the 
last acquisition trial block differed 
significantly from the chance level. 
For Group I the probability is .003; 


for Group II, .06; for Group III-A, 
11; and .40 for Group HI-U. 

The above method for assessing 
conditioning is quite rigorous and is 
designed to reveal the specificity of 
the stimulus-response relation. It is 
informative, however, to examine con- 
ditioning and extinction in the differ- 
ent groups in terms of an alternative 
criterion. First, the sizes of the re- 
sponse differences in each block for 
each word were ranked separately 
from small to large for each S and then 
the rank value for BOAT was averaged 
through Ss by group and trial block. 
Curves obtained in this way are shown 
in Fig. 2. 

While this measure is more com- 
parable to the averaging techniques 
that have been used in previous 
autonomic conditioning studies it 
nonetheless reveals essentially the 
same conclusions as the previous 
analysis. Again Groups I and II 
show evidence of conditioning whereas 
Groups III-A and III-U do not. 

An analysis of variance following 


o—« Group I (N=9) 
o---@ Group IL(Nel!) 

o---© Group IITA (N=29) 
æ- Group II-U(N=27) 
sassa Chance Expectancy 


a 
ke} 


Conditioning —><Extinctional 


~ 
[s] 


cts 
a 


Percentage of Subje 


3 64 
Blocks of Trials 


Fic. 1. Percentage of Ss whose maximum 
cardiac response difference follows the stim- 
ulus BOAT. 


276 
k— Conditioning —ot< Extinction» 
u © 
n 
A 
10) A 
1 
\ 
9 \ 
<8 
€ 
(4 
Er 
o =e 
v 
=6 
5 @—» Group I (N=9) 
* @---0 Group I(N=1 1) 
4 -@ Group IA (N=29) 
nse a Group II-U (N=27) 
3 ==se= Chance Expectancy 


1 3.4 6. ¢ 
Blocks of Trials 


Fic. 2. Mean rank of cardiac response differ- 
ence which follows the stimulus BOAT. 


Lindquist’s (1953) Plan I for the con- 
ditioning trials only was used to test 
the significance of the effects in Fig. 2. 
There was a significant Between- 
Group effect (F = 6.35, df = 3/72, 
P < .01) and a significant Group X 
Trial Blocks interaction (F = 2.27, 
df = 18/432, P < .01). The signifi- 
cant Between-Group effect was ana- 
lyzed further by means of ¢ tests using 
the appropriate error terms from the 
analysis of variance. On the seventh 
trial block, the difference between 
Groups I and II was not significant 
(t = .14, df = 72), nor was the differ- 
ence between Groups III-A and III-U 
(t = .55, df = 72). However, both 
Groups I and II differed significantly 
from both Group II subgroupings 
(t = 2.88, df = 72, and t = 2.95, 
df = 72, respectively). 

A separate but similar analysis of 
variance was applied to the extinction 
trials using the rank scoring. The 
only significant effect was Between 
Groups (F = 3.32,df = 3/72, P < .05). 
Further analysis by the ¢ test revealed 
the significant Between-Group effect 


BISHWA B. CHATTERJEE AND CHARLES W. ERIKSEN 


to be due to the differences between 
Groups II and III-U. 

Cardiac responsiveness—While the 
above results are quite clear in in- 
dicating the role of cognitive ex- 
pectancies on the acquisition of con- 
ditioned heart rate responses, it is also 
informative to investigate the effects 
of these cognitive expectancies upon 
other characteristics of heart rate be- 
havior, specifically heart rate re- 
sponses to nonconditioned words. 
This was done by determining the 
cardiac response difference to all 
words except Boat for each trial block 
for each S. In Fig. 3 the average 
response difference for these 11 words 
has been plotted as a function of trial 
blocks during conditioning and ex- 
tinction for each of the four groups. 

There is a tendency for the cardiac 
response difference to words other 
than the CS to decrease throughout 
the conditioning and extinction trials. 
Also there is a marked difference be- 


o——o Group I (N= 9) 
=a Group I (N=11) 
S= GroupIIrA (N*29) 
S0 Group I-U (N=27) 


Conditioning ——»|<Extinction->| 


Meon Cardiac Response Difference in Milliseconds 
œ 
N 
ia 


6 7 8 9 
Blocks of Trials 
Fic. 3. Mean cardiac response difference fol- 
lowing all stimulus words excluding BOAT. 


HEART RATE CONDITIONING 277 


tween groups. Group I shows less 
reactivity to nonconditioned words 
throughout the conditioning and ex- 
tinction sessions with Group H falling 
between Group I and Groups III-A 
and III-U. The latter two groups are 
indistinguishable in performance. 

The significance of the effects shown 
in Fig. 3 was evaluated by an analysis 
of variance using Lindquist’s Plan i 
In this analysis conditioning and ex- 
tinction trials were included in the 
same analyses. The only significant 
effects were for Between-Trial Blocks 
(F = 4.13, df = 9/648) and Between 
Groups (F = 10.03, df = 3/72). 

Since the biggest group difference 
in Fig. 3 is due to Group I, it was 
desired to determine whether the 
different cognitive expectancies be- 
tween Group II and Groups III-A and 
IlI-U had an effect upon the cardiac 
response difference to the noncondi- 
tioned stimuli. To determine this, 
the above analysis was repeated using 
only the latter three groups. Again 
there was a significant Trial Block 
effect (F = 3.90, df = 9/676), but 
the Between-Group effect (F = 1.1, 
df = 2/64) was not significant. 

The above analyses have suggested 
that heart rate conditioning and 
heart rate behavior are in part a 
function of Ss’ verbalized expectancies 
concerning shock. We can ask the 
reverse question as to whether differ- 
ences in heart rate behavior will pre- 
dict Ss’ verbalizations. For this 
question the data of Group III are 
available. The Ss in this group were 
asked to report the number of shocks 
they thought they had received to 
each of the stimulus words. Our pre- 
vious analysis would suggest that 
words having a large cardiac response 
difference should have a higher num- 
ber of reported shocks than words 
with a low cardiac response difference. 
To test this possibility the word 


giving the greatest cardiac response 
difference on the seventh conditioning 
trial was selected for each S along 
with the word producing the smallest 
cardiac response difference. The 
number of reported shocks to these 
two words was determined for each S 
in Group III, The mean number of 
shocks reported to the word with the 
greatest cardiac response difference 
was 1.37 as compared with 1.03 for 
the word with the smallest difference. 
A t test for correlated scores gave a 
value of 1.71 significant at the .05 
level for a one-tailed test. 

Tests of generalization —In assessing 
generalization only the cardiac re- 
sponses on the first trial block in the 
extinction series were examined. As 
will be recalled, the stimulus words 
for extinction contained not only the 
CS but words semantically similar 
and dissimilar and colors that were 
the same and different from the GS} 

Detailed analyses of the data from 
Group III failed to yield any signifi- 
cant or suggestive evidence of either 
semantic or color generalization. 
Similarly, separate analyses of the 
data from Groups I and IT who had 
shown conditioning, gave no evidence 
of generalization. Since the previous 
analyses had suggested that cognitive 
expectancy was an essential correlate 
of heart rate responses, and these Ss 
had not verbalized semantic or color 
relations, the lack of generalization 
might have been anticipated. 


DIscussioN 


Conditioning was evident in Groups I 
and II but here all Ss had clearly verbal- 
izable expectancies concerning the rela- 
tionship between stimulus and shock. 
The heart rate response of Ssin these two 
groups shows a further correspondence 
with their cognitive expectancies in the 
extinction behavior. Group I Ss were 
told following the seventh conditioning 


278 


trial that there would be no further 
shocks. The data are clear in showing 
almost complete extinction on the first 
extinction trial. Group II Ss on the 
other hand were not informed of the 
termination of the conditioning trials and 
there is little or no evidence of extinction 
of their heart rate response during the 
three extinction trial blocks. 

Group III Ss who had received a mini- 
mum amount of information concerning 
the relationship between the stimuli and 
shocks prior to the conditioning session 
show no evidence of having conditioned 
heart rate responses, Even when this 
group of Ss is subdivided into subgroups 
based upon their verbal expectancies of 
shocks obtained postexperimentally there 
is no evidence that the subgroup showing 
some verbal discrimination between the 
shock and the nonshock stimuli has con- 
ditioned. It may be that it was only in 
the last of the conditioning trials that 
these Ss began to form cognitive expect- 
ancies concerning the relationship be- 
tween specific stimuli and shock. Thus 
there was insufficient time for condi- 
tioned heart rates to occur. 

But even among the Group III Ss 
there is some evidence of a relation be- 
tween cognitive expectation and heart 
rate behavior. When the stimulus word 
with the greatest heart rate response is 
compared with the smallest heart rate 
response on the last conditioning trial, 
it is found that these words are discrimi- 
nated in the terms of S's verbalized shock 
expectancies. 

The effects of cognitive expectancies is 
also apparent on other aspects of heart 
rate behavior in the experimental situa- 
tion. Group I Ss who knew that only 
one word would be shocked during the 
experimental session showed an appreci- 
ably smaller heart rate response to the 
nonconditioned stimulus words through- 
out the conditioning and extinction 
session. Similarly Group II Ss who 
knew that words other than the CS would 
be shocked only once during the session 
showed less heart rate response to these 
nonconditioned stimuli than did the 
Group III Ss who had no definite expec- 


BISHWA B. CHATTERJEE AND CHARLES W. ERIKSEN 


tation of relationships between stimuli 
and shock. 

The absence of any stimulus general- 
ization along either the semantic or the 
color dimension is also consistent with 
the findings concerning the importance 
of cognitive expectancies on the occur- 
rence of heart rate response changes to 
the stimuli. Since conditioning occurred 
only in the Group I and II Ss general- 
ization can only be expected to occur in 
these groups. However, Ss in these 
groups had clearly verbalizable expect- 
ancies concerning the relationship be- 
tween the conditioned stimulus and 
shock. Since their verbalizations during 
the inquiry period did not express rela- 
tionships between colors or semantic class 
and the occurrence of shock, the hypoth- 
esis that cognitive expectancy determines 
heart rate behavior would predict a 
failure of generalization to occur among 
these Ss. 

While the data considered so far are 
quite unequivocal in demonstrating a 
relationship between heart rate behavior 
and cognitive expectancy the question 
can be raised as to whether the cognitive 
expectancy is necessarily prior to or a 
determiner of the heart rate phenomena. 
There are several factors in the present 
experiment that would indicate that cog- 
nitive expectancy is a determiner of the 
heart rate response rather than some- 
thing that develops concurrently with 
the conditioned heart rate. In previous 
studies the typical procedure has been 
to allow awareness or expectancies to 
develop along with the CR. In this type 
of design it is impossible to determine 
causal relationships. However, by ma- 
nipulating the expectancies of our Ss 
prior to the beginning of the conditioning 
sessions we have largely controlled the 
expectancies that existed in our Ss before 
conditioning of the heart rate occurred. 

There is further evidence of the pri- 
macy of cognitive expectancies in the 
extinction behavior of the Group I Ss. 
The knowledge that no further shocks 
would occur was sufficient to produce 
almost complete extinction without ex- 
periencing the CS in the absence of the 


HEART RATE CONDITIONING 


UCS. This result is consistent with that 
of Notterman et al. (1952). 

The results we have obtained are also 
quite consistent with those of Branca 
(1957), who reports marked correspond- 
ence between conditioned autonomic be- 
havior and Ss’ verbalizable expectancies 
in the conditioning situation. 

One further comment is in order con- 
cerning the low relationship between 
cognitive expectancy and heart rate 
behavior. While the above evidence 
shows a definite relationship between 
cognitive expectancies and heart rate be- 
havior the relationship is certainly not a 
very high one. Eriksen (1958) has 
pointed out elsewhere the existence of 
large noncorrelated errors between dif- 
ferent response systems such as the 
verbal and the autonomic and the present 
low relationships are probably a reflec- 
tion of the amount of error in cognitive 
expectancies as well as heart rate be- 
havior. In view of other evidence it is 
most likely that the largest source of 
error is in the heart rate response rather 
than in Ss’ verbalization of his ex- 
pectancies. 


SUMMARY 


The conditioning and semantic and color 
generalization of the heart rate was studied as 
a function of different cognitive expectan- 
cies of the Ss. Cognitive expectancies were 
manipulated prior to conditioning by means 
of different instructions to the three experi- 
mental groups. 

A high correspondence was found between 
heart rate and verbalizable expectancies. 
Clear evidence of heart rate conditioning was 
obtained only in those cases where S could 
verbalize the relationship between the CS and 
UCS. Those Ss who were informed that 
there would be no more shocks at the be- 
ginning of the extinction trials showed almost 
a complete loss of the CR without experienc- 
ing nonreinforced presentations of the CS. 
Heart rate responses to nonconditioned stim- 
uli were also found to vary as a function of 


279 


cognitive expectancies. There was some in- 
dication that observed differences in heart 
rate could be used to predict differences in 
verbalized expectancies. 

There was no evidence of either semantic or 
color generalization of the conditioned heart 
rate. This finding was considered consistent 
with the above since Ss did not include such 
generalized expectancies in their verbal- 
izations. 


REFERENCES 


Branca, A. A. Semantic generalization at 
the level of the conditioning experiment. 
Amer. J. Psychol., 1957, 70, 541-549. 

CHATTERJEE, B. B., & ERIKSEN, C.W. Con- 
ditioning and generalization as a function 
of awareness. J. abnorm. soc. Psychol., 
1960, 60, 396-403. 

Diven, K. Certain determinants of condi- 
tioning of anxiety reactions. J. Psychol., 
1937, 3, 291-308. 

Eriksen, C. W. Unconscious processes. 
In M. R. Jones (Ed.), Nebraska symposium 
on motivation: 1958. Lincoln: Univer. 
Nebraska Press, 1958. Pp. 169-227. 

Haccard, E. A. Experimental studies in 
affective processes: I. Some effects of 
cognitive structure and active participation 
on certain autonomous reactions during and 
following stress. J. exp. Psychol., 1943, 
33, 257-284. 

Lacey, J. L., & SMITH, R. L. Conditioning 
and generalization of unconscious anxiety. 
Science, 1954, 120, 1045-1052. 

Linpquist, E. F. Design and analysis of 
experiments in psychology and education. 
Boston: Houghton Mifflin, 1953. 

NOTTERMAN, J. M., SCHOENFELD, W. N., & 
Berscu, P. J. Partial reinforcement and 
conditioned heart rate response in human 
subjects. Science, 1952, 115, 77-79. 

Razran, G. Stimulus generalization of con- 
ditioned responses. Psychol. Bull., 1946, 
46, 337-365. 

THorNDIKE, E. L., & LORGE, I. Teacher's 
word book of 30,000 words. New York: 
Teachers College, Columbia University, 
Bureau of Publications, 1944. 


(Received August 5, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 3, 280-287 


TEST OF THE HYPOTHESIS OF PSYCHOLOGICAL 
REFRACTORY PERIOD * 


JACK A. ADAMS 


University of Illinois 


Telford (1931) found that the 
second of two reaction time measures 
was lengthened when the spacing of 
two successive stimuli was reduced to 
500 msec., and his hypothesis of psy- 
chological refractory period has mo- 
tivated a number of confirming studies 
and analyses (Adams, 1961; Craik, 
1948 ; Davis, 1956, 1957, 1959; Marill, 
1957; Vince, 1948; Welford, 1952, 
1959). A prominent interpretation of 
these findings is that an incoming 
stimulus is subjected to a central 
decision process before discharge oc- 
curs down the motor nerves, and a 
second stimulus impinging during this 
decision time is either disregarded, de- 
graded, or delayed in immediate mem- 
ory until the decision mechanism has 
been cleared. A general conclusion is 
that S isa one-channel data processing 
system. 

Hick (1948), Poulton (1950), and 
Elithorn and Lawrence (1955) have 
suggested a counterexplanation in 
terms of expectancy. In this usage, 
“expectancy” refers to S having 
learned certain properties of the sta- 
tistically defined time relationships 
between the first and the second stim- 
ulus presented over a relatively long 
series, and S is thought to be most 
alert for responding when the inter- 
stimulus interval is somewhere in the 
vicinity of the mean delay. When a 
very short interval occurs, S$ is not 


1 This research was supported by the 
United States Air Force under Contract No. 
AF 49(638)-371, monitored by the Air Force 
Office of Scientific Research of the Air Re- 
search and Development Command. Ac- 
knowledgment is due the assistance of William 
F. McDonald and Luther J. Tromater. 


ready, and his response to the second 
stimulus is lengthened. An implica- 
tion of the expectancy hypothesis is 
that S can be a multichannel system 
when conditions allow the acquisition 
of appropriate expectancies. 

All findings can be explained about 
equally well by both hypotheses, but 
a discriminating test seems possible by 
manipulating the statistical structure 
of interstimulus time intervals. So 
far, studies of this topic have used 
only highly uncertain, random inter- 
val distributions (low redundancy). 
The expectancy hypothesis would 
predict that refractoriness is a func- 
tion of the statistical structure of time 
intervals by making S more expectant 
for certain classes of intervals, while 
the one-channel hypothesis would re-. 
gard refractoriness a function of in- 
tervals but not statistical structure. 
The investigation reported here mani- 
pulated the redundancy of inter- 
stimulus intervals in a two-dimen- 
sional, bisensory discrete tracking 
task where an audio signal occurred 
with or lagged a visual signal by a 
defined time interval. 


METHOD 


Apparatus:—The discrete tracking ap- 
paratus was used (Adams & Chambers, 1962). 
This device can be used as a two-dimensional 
bisensory discrete tracking task where the 
visual and the audio inputs each have three 
discrete states, or as a one-dimensional track- 
ing task with either audio or visual stimuli. 
In the visual dimension, S had three hori- 
zontally arranged jeweled stimulus lights 
(red, white, and green) in front of him at eye 
level, and these lights came on in a repetitive 
sequence at defined time intervals (see below). 
Beneath each stimulus light was a small neon 


280 


PSYCHOLOGICAL REFRACTORY PERIOD 


response feedback light that informed S of 
the position of his control, and this direct 
display of response feedback cues, as well as 
stimulus lights, made the task one of pursuit 
tracking. The S had to keep the feedback 
light aligned with the frequently changing 
stimulus light as much as possible. The 
audio dimension had a 600-, 800-, and 1000- 
cycle pure tone as stimuli, and S heard them 
over a headset and responded with the same 
type of control as used for the visual dimen- 
sion. The auditory error coding was in 
terms of a complex tone. When S had the 
control in the correct position, he heard a pure 
tone but, when he was wrong, he heard a 
complex tone made up of two of the three 
fundamental frequencies. The correct stim- 
ulus was presented as a pure tone in the com- 
plex, and, superimposed on it was a second 
tone, rapidly interrupted, whose frequency 
was determined by the position of the control. 
Thus, when S was in error he had feedback on 
the correct stimulus and the wrong control 
position, and this was formally equivalent to 
the pursuit tracking format of the visual task. 
Moreover, being pursuit tracking, Shad direct 
feedback of his response time to stimulus 
change in each dimension. He could see how 
long it took for the visual feedback light to 
become aligned in the correct position, and 
similarly he could hear the duration of the 
interrupted tone. The instructions were to 
always respond and nullify error as quickly as 
possible and, in the bisensory task, to give 
equal attention to both dimensions. 

Response was with a 2-in. control stick 
mounted on the wide armrest of S's chair, 
and he could have a control for one or both 
hands, depending on whether tracking was 
unisensory or bisensory. The control moved 
freely through its are, although electrically it 
could only assume three states. Audio or 
visual stimuli could be switched to either con- 
trol. The direction of control movement was 
always horizontal. An Esterline-Angus oper- 
ations recorder allowed E to completely record 
all stimulus and response events and their 
time relationships on a trial. 

Procedure.—There were three groups of Ss, 
distinguished by the statistical distribution of 
time delay intervals governing the amount 
that an audio stimulus followed a visual one in 
two-dimensional bisensory tracking. Each 
group was given three practice sessions on 
different days, usually within the same week. 
A session was 12 2-min. trials, with 5 min. 
rest between Trials 4 and 5, and 8 and 9, and 
50 sec. intertrial rest for the remainder of the 
trials. The basic phenomenon of psycho- 
logical refractory period should be manifest 


281 


as delay in response to the audio stimulus 
when it follows the visual stimulus too closely, 
and it is necessary to have a unisensory audio 
control condition to demonstrate this delay. 
In addition, it was considered advisable to 
have a unisensory visual control condition to 
see if time uncertainty influenced response to 
the visual stimulus as well as audio. Each S$ 
provided his own unisensory control measures. 
For the 12 trials of a session, 4 were uni- 
sensory visual, 4 unisensory audio, and 4 
bisensory. The order of these three task 
conditions was counterbalanced among Ss of 
each group, and the particular order assigned 
to an S was the same on each session. Also, 
within each group, the assignment of visual 
and audio signals to left and right hands was 
counterbalanced. The operations recorder 
was used on Trials 2 and 3 of each block of 4 
trials in Session 3 to provide a detailed 
analysis of individual responses. 

Stimulus series—The type, duration, and 
interstimulus intervals of audio and visual 
events were programed on punched tape and 
automatically read by a motorized tape 
reader. A single stimulus input tape was 
constructed for each group, and it had 60 
audio-visual bisensory stimulus pairs on each 
trial. A given group used the same tape on 
each session. Whenever a unisensory trial 
was required, Æ disconnected the unwanted 
stimulus series in the other sensory dimension 
from the presentation. The 60 visual events 
on a trial had durations of 1.5, 2.0, and 2.5 
sec., and there were 20 of each duration. The 
order of the 60 visual events was separately 
randomized for each trial. All groups had 
the same order of visual events on a given 
trial. 

The audio delay intervals were in the range 
where the phenomenon of psychological re- 
fractory period was expected to be maximal, 
as well as somewhat beyond this range, and 
were 0, 100, 200, 400, and 800 msec. The 
approach was mainly to manipulate the fre- 
quency of audio delay intervals of 100 msec., 
where the phenomenon of psychological re- 
fractory period is known to be high, If ex- 
pectancy is a significant explanatory mecha- 
nism for behavior, the amount of degradation 
in audio RT should be influenced by the 
frequency of events at the very small delay 
intervals. The S should be more expectant 
with a greater frequency of small intervals and 
consequently should have less decrement in 
audio RT. On the other hand, if expectancy 
is not a relevant explanatory framework, no 
difference should be expected. The statistical 
distribution of audio time delay intervals on a 
trial for each group, mean delay, and percent- 


282 


TABLE 1 


NUMBER AND PROBABILITY OF DELAY 
INTERVALS BETWEEN VISUAL AND 
Aupio STIMULUS ON A BISENSORY 

TRIAL FOR EACH GROUP 


Group 
Audio Delay 
Interval LU MU HU 
(in Msec,) 
t N t N p N 
0 .05| 3|.10} 6 |.20} 12 
100 ‘| .80] 48 | .60 | 36 | .20| 12 
200 05] 3 |.10} 6 |.20| 12 
400 .05| 3 |.10] 6] .20] 12 
800 05} 3|.10} 6] .20} 12 
Mean (msec.) 150 200 300 
% Redundancy 52 24 0 


age redundancy (Attneave, 1959) are shown 
in Table 1. The groups were designated low 
uncertainty (LU), medium uncertainty (MU), 
and high uncertainty (HU). The order of 
the delay intervals was separately randomized 
for each trial. 

It should be emphasized that the experi- 
ment deals only with time uncertainty in 
tracking, not event uncertainty as investi- 
gated by Adams and Chambers (1962) with 
this task. Each sensory dimension was a 
repetitive series of the three stimulus events 
which simply required S to move the control 
back and forth. And, on the bisensory trials, 
the same audio and visual events were always 
paired together. Thus, the stimulus series 
always had event certainty. However, because 
the time patterning of events was statistically 
determined, the stimuli always had time un- 
certainty, which was the basic experimental 
variable. 

Subjects —There were 18 Ss in a group. 
The 54 Ss were university male under- 
graduates who were paid for their participa- 
tion. They were randomly assigned to 
groups. 

Performance measurement.—Overall pro- 
ficiency in discrete tracking, such as mea- 
sured by time on target, is a function of (a) 
off-target time between the onset of the 
stimulus and the onset of the response, 
whether the response is correct or not, (b) 
number of errors, i.e., movements of the 
control to the wrong position, and (c) duration 
of each error before it is corrected. Because 
the motor movements required were simple 


JACK A. ADAMS 


and repetitive, errors were negligible and so 


proficiency primarily was determined by the 
off-target time of a correct response to change 
of a stimulus light or tone. These time values 
for individual responses are called response 
times (RT), and are distinguished from 
classical reaction time where special steps are 
taken to see that a response is nonanticipatory 
and always follows the stimulus (Woodworth, 
1938). In our discrete tracking task a re- 
sponse could follow a stimulus as in classical 
reaction time studies, or it could be anticipa- 
tory as might be expected from a time- 
sensitive S who had acquired expectancy 
states. A better understanding of expectancy 
was hoped for by freely allowing, measuring, 
and analyzing anticipatory responses. The 
RTs were measured as the difference in milli- 
seconds, between the onset of a stimulus and 
the occurrence of a correct response to it. 
Consistent with Poulton (1952), Adams and 
Xhignesse (1960), and Adams and Chambers 
(1962), a positive sign was assigned when the 
response preceded the stimulus, and a negative 
sign when it followed. 

The basic analysis was conducted on an 
RT score for an S, which was the algebraic 
mean of all his response times to individual 
stimuli of a particular set of stimulus events. 
Each S had a Unisensory Visual RT, @ 
Unisensory Audio RT, and, for each audio 
delay interval, a Bisensory Visual RT and a 
Bisensory Audio RT score. 


RESULTS 


Unisensory-bisensory audio compari- 
sons.—Figure 1 is a plot of group 


VISUAL 


W meoo 
=a 
ao 
33 
me 
wo 
¥ el eas missar Ome 
aL ce cme Ss ¥ 
+0 
anean 8 we ny -> 
AUDIO DELAY INTERVAL 
(MILLISECONDS) 
Fic. 1. Mean response times to unisensory 


and bisensory visual and audio stimuli. 


4 


aaa 


PSYCHOLOGICAL REFRACTORY PERIOD 


means for Unisensory and Bisensory 
RT scores for visual and audio. The 
lower part of Fig. 1 presents the data 
that are most critical for assessing the 
one-channel hypothesis. The hy- 
pothesis predicts that Bisensory Audio 
RT will be lengthened when the inter- 
stimulus interval is very brief because 
S must have a finite period of time to 
process the visual stimulus and its 
response. Figure 1 shows the ex- 
pected effect, and it is evident for all 
groups, particularly at the zero audio 
delay interval. Using the £ test for 
related measures, a comparison was 
made for each group between Uni- 
sensory Audio RT scores and Bisen- 
sory. Audio RT scores at delay inter- 
vals of 0, 100, and 200 msec., where 
the effects of psychological refractory 
period should be most evident. For 
all three groups, the Bisensory Audio 
RT was significantly poorer (P < .01) 
than the Unisensory Audio RT at the 
zero interval, but only Group HU had 
significant retardation at the 100- 
(P < .01) and 200- (.01 < P < .05) 
msec. audio delays. Beyond the three 
initial points, the Bisensory Audio 
RTs had near zero or positive values 
and indicate the presence of anticipa- 
tory responding. 

The greater persistence of signifi- 
cant decrement in the Bisensory Audio 
RT scores for Group HU is in line 
with the expectancy hypothesis. 
However, an analysis of variance of 
Unisensory Audio RT scores gave a 
significant F ratio (OL <7 Bex .05), 
with Group HU having the lowest 
score, which could accentuate the 
differences. Examination of the uni- 
sensory audio data suggested that this 
was due to the differential presence of 
anticipatory responding among the 
groups. Because virtually all studies 
of psychological refractory period 
have used discrete reaction time tasks 
where special steps are taken to avoid 


283 


the influences of anticipation in per- 
formance measures, it is of special 
interest to see if the same support can 
be found for the expectancy hypoth- 
esis when only nonanticipatory meas- 
ures are used. Using the same ap- 
proach as related experiments that 
dealt with anticipation (Adams & 
Chambers, 1962; Adams & Xhignesse, 
1960), three classifications of in- 
dividual RT measures were made as a 
means of examining anticipatory be- 
havior: Beneficially Anticipatory RT 
values in the range of +133 msec. 
that had less off-target time than ideal 
RT values (Klemmer, 1956, 1957) 
and, by giving little or no off-target 
time, allow the reasonable inference 
that positive anticipatory mechanisms 
were operating; Nonanticipatory RT 
values which were less than —133 
msec., and in the range for classical 
reaction time where S’s response 
occurs substantially after the stim- 
ulus; and Detrimentally Anticipatory 
RT values which were greater than 
+133 msec. where S responded well 
ahead of the stimulus and could net 
as much, and often more, off-target 
time than if he had waited for the 
stimulus to occur and responded as in 
classical reaction time. An S's RT 
score in each of these classifications 
was the algebraic mean of all his 
individual RTs in a classification fora 
particular experimental condition. 
Groups LU, MU, and HU had 3, 4, 
and 11% of their individual uni- 
sensory audio RTs, respectively, that 
could be classified as either beneficially 
or detrimentally anticipatory. Using 
only Nonanticipatory Audio RTs, a 
Nonanticipatory Audio RT score was 
computed for unisensory audio and 
for bisensory audio under delay con- 
ditions of 0, 100, and 200 msec. At 
test for related measures was made 
between unisensory and bisensory 
audio under each of the three delay 


284 


conditions. None of the tests 
achieved the .05 level of significance 
for Group LU, and Group MU had a 
t ratio at the .05 level for the zero 
interval only. Group HU had a t 
ratio significant at better than the .01 
level for the zero delay, and t's sig- 
nificant at the .05 level for both the 
100- and 200-msec. intervals. The 
same trend in support of the ex- 
pectancy hypothesis is evident as 
before. 

Unisensory-bisensory visual compari- 
sons.—The upper part of Fig. 1 shows 
group mean RTs for unisensory and 
bisensory visual performance. An 
analysis of variance test revealed uni- 
sensory visual performance to be sig- 
nificantly poorer than unisensory au- 
dio (P < .01). While this is the tra- 
ditional finding of visual RT being 
slower than audio RT, the two uni- 
sensory series are not directly com- 
parable because the unisensory visual 
series had less time uncertainty than 
unisensory audio. ` The dominant 
trend in bisensory is for Ss to tem- 
porarily withhold their visual response 
as a function of the audio delay 
interval, and this trend is about the 
same for all groups. As with audio 
performance, ¢ tests for related meas- 
ures were run between unisensory and 
bisensory visual RT scores. None 


+300 
WOH UNCERTAINTY O——O 


s MEDIUM UNCERTAINTY fpi 
= Low UNCERTAINTY 0—0 
kd 
2 4200 
23 
& 
8s 
wo 
za 

= 

3 
wa 
FERL 
w 
4 
A 
iva 
2 
6 

o 
© oo 200 400 00 
AUDIO DELAY INTERVAL 
(MILLISECONDS) 
Fic, 2. 


Mean Difference RT values show- 
ing time lag between the visual and the audio 
response in bisensory tracking as a function 
of the audio delay interval. 


JACK A. ADAMS 


were significant at the zero delay 
interval but all were significant at 100 + 
msec. and beyond (P < .05). 
‘Characteristics of bisensory perform- 
ance.—To see if further insight could 
be obtained into reasons for the pat- ` 
terns of response decrement, a new 
measure was devised for the bisensory 
data, called the Difference RT, which 
gave the time between the two 
bisensory responses. This measure — 
was used to check whether the amount — 
and trend of decrement could be 
related to the way in which Ss lagged 
the audio response behind the visual 
response as a function of uncertainty 
of the series. The expectancy hy- 
pothesis suggests that Group 
would lag the most because they 
would have developed higher ex- 
pectancies for longer delay intervals. 
The Difference RT is defined by the 
formula (Bisensory Visual RT) — (Br 
sensory Audio RT) + (Audio Delay 
Interval), where the value for the 
delay interval is always positive, am 
the algebraic convention for RT 8 
retained. The Difference RT is post- 
tive when the audio response follows 
the visual response, and negative 
when it precedes. Figure 2 shows the 
plot of mean Difference RTs. All 
individual RT values, plus and minus; 
entered the group means in Fig. 2. 
The three groups all have the same 
general trend, with Group HU at @ 
higher level throughout (longer lags). 
Figure 2 shows that when the audio 
delay interval was zero or quite small, 
Ss lagged the audio stimulus by # 
small amount. But as the audio 
delay interval increased, Ss increa’ 
the lag of audio responses correspond: 
ingly. It would appear that the 
simultaneous occurrence of the visua 
and the audio stimulus is a cue for # 
rapid visual response and a small lag 
of the audio response. But if therë 
is audio delay, and the visual stimulus 


D SS 


PSYCHOLOGICAL REFRACTORY PERIOD 


285 


TABLE 2 
PERCENTAGES OF THE THREE POSSIBLE ORDERS OF BISENSORY RESPONSE PAIRS 


Group and Audio Delay Interval (in Msec.) 
ee Low Uncertainty (LU) Medium Uncertainty (MU) High Uncertainty (HU) 
Pair 
All All All 
o | 100 | 200 | 400 | 800 | Inter-} © | 100 | 200 | 400 | 800 Inter-| 0 | 100 |200 |400 | 800 | Inter- 
vals vals vals 
Visual first | 36| 30 | 37 |40 | 44 | 32 | 32) 34 | 40 55/41 | 37 |56| 53 |60 |57 61| 57 
Audio first 25| 15 |14|20|16| 16 | 28) 18 |24 16 | 20} 20 | 11) 13 |12 | 12 |10 12 
Simulta- 39| 55 | 49 |40 | 40 | 52 | 40) 48 | 36 29 |39 | 43 | 33] 34 |28 | 31 | 29 31 
neous 


Note.—Percentages ate based on the total number of correct response pairs made by a group under the condi- 


tions specified by a column heading. 


is on momentarily by itself, then these 
delay states of affairs become a cue 
for delays in the visual response and 
longer lags of the audio response, 
although Fig. 1 shows that when the 
delay interval is 800 msec. the lag is 
poorly timed because Audio RTs are 
detrimentally anticipatory by 200- 
300 msec. Thus, S learns to interpret 


-the immediate temporal properties of 


stimuli, and responds differentially to 
them, and Fig. 2 shows that the over- 
all probability structure of delay 
intervals tends to differentially influ- 
ence group mean performance in each 
case. The small intervals of 0, 100, 
and 200 msec. are most critical for 
issues in question, and an analysis of 
variance (Lindquist, 1953, Type I) 
was performed on the Difference RTs, 
with uncertainty a between-Ss vari- 
able and audio delay a within-Ss 
variable. Both main effects were 
significant at the .02 level. The same 
analysis was performed on Difference 
RTs computed from nonanticipatory 
responses, and again both main effects 
were significant (P < 01). 

The Difference RTs show that 
Group HU lags the audio response 
more, which means longer RTs when 
audio delay intervals were small, and 
thus the decremental effect which is 
the evidence for the one-channel hy- 


pothesis. A related implication is 
that Group HU should have more 
bisensory response pairs occur in a 
visual-audio sequence rather than in 
a joint, simultaneous fashion. To 
evaluate this, the three possible 
sequences of response pairs were tabu- 
lated for each group, and the results 
are shown in Table 2. The order of 
responses can be either visual first, 
audio first, or simultaneous respond- 
ing. The criterion for simultaneity 
was the two responses within +33 
msec. (Adams & Chambers, 1962). 
It is noteworthy that each audio delay 
interval has a percentage of response 
pairs in each of three orders, and 
there is no marked tendency towards 
“grouping,” or simultaneous respond- 
ing, to be particularly concentrated 
at very small intervals as Vince (1948) 
and Welford (1959) hypothesize would 
occur because of a stimulus pair being 
perceived as an entity under these 
conditions. Group differences are 
evident, particularly when the per- 
centage is taken over all intervals, and 
the trends are ordered in accord with 
the expectancy view. Group HU had 
a dominant tendency to make the 
visual response first and lag the audio 
response. Group LU had stimulus 
pairs that almost always occurred 
with a very small time separation, and 


286 


they tended to respond with more 
simultaneity. Group MU had. an 
intermediate position. To test these 
differences, each S was given a score 
in each of the three categories of re- 
sponse order that was the total num- 
ber of responses made over all delay 
intervals. These were called the 
Visual First score, the Audio First 
score, and the Simultaneous score. A 
simple analysis of variance was per- 
formed for each of the three sets of 
scores, and the Visual First scores 
and the Simultaneous scores had be- 
tween-groups differences that were 
significant at the .01 level. Audio 
First scores were significant between 
the .01 and .05 levels. ‘The impor- 
tance of Audio First pairings is not 
readily interpretable, but it could 
represent a tendency towards error 
in closely timed simultaneous re- 
sponding. 


Discussion 


The results are consistent with the 
expectancy hypothesis. Decrement in 
response to the second of two closely 
spaced stimuli was greatest when the 
stimulus series had high time uncer- 
tainty, and was reliably less when time 
uncertainty was moderate or low. The 
reason was that Ss in the high uncer- 
tainty group had a greater likelihood of 
receiving a visual-audio stimulus se- 
quence with a relatively long audio delay, 
and they learned to respond more fre- 
quently with a visual-audio order and to 
lag the audio response longer. The Ss 
had an expectation, or set, for the visual 
stimulus to come on first and for the 


audio stimulus ordinarily to be delayed’ 


and require a lagged response. Even 
when the two stimuli were presented 
simultaneously and Ss had direct and 
immediate information that a lag was 
not required, the audio response was still 
delayed. In fact, the Bisensory Audio 
RTs at the zero interval were the longest 
of any obtained. Thus, in this study, 
the decremental effect that has come to 
exemplify psychological refractory period 


JACK A, ADAMS 


emerges as a learned tendency to respond 
with a visual-audio sequence and to lag , 
the audio response when extensive prac- 
tiee has been given under conditions of 
temporal uncertainty. 

The findings do not allow unequivocal 
rejection of the one-channel hypothesis, 
because even Group LU with low tem 
poral uncertainty in their stimulus series 
had a significant amount of decrement 
in Bisensory Audio RT scores for stim- 
ulus pairs that occurred simultaneously. 
Remembering the wealth of temporal 
anticipation effects present in the data, 
this finding could be interpreted to mean 
that there is a one-channel decision mech- 
anism that momentarily delays each S-R 
sequence, but that it cannot find a mean- 
ingful place in any scientific account of 
human behavior that does not give 
central focus to temporal expectancy 
states. Nevertheless, when these a 
are weighed with those of the Adams and | 
Chambers (1962) study, there are de- 
fensible grounds for questioning the ome 
channel view. Using the same task 
Adams and Chambers presented findings 
that dovetail with those here because onè 
of their experimental conditions was com- 
plete temporal and event certainty. Not 
only did they find impairment absent, 
but they found that time and event cet 
tainty were the conditions for bisensory” 
performance actually being superior to 
the aggregate unisensory control per 
formances because of the influences © 
temporal anticipation. The conclusion 
from these two investigations is that 
given sufficient temporal certainty © 
events, § can process at least two simul- 
taneous stimulus series with the same 
proficiency as a single stimulus series 
providing there is event certainty. 
remains to be fully determined, however 
whether a necessary central delay tim 
exists for the resolution of event uncer 
tainty. Adams and Chambers found firs 
evidence for impairment under condi 
tions of event uncertainty and tempor 
certainty. 


SUMMARY 


An experiment was performed to test th 
hypothesis of psychological refractory 


PSYCHOLOGICAL REFRACTORY PERIOD 


that is offered to account for the established 
finding that response to the second of two 
closely spaced stimuli shows decrement. One 
line of explanation argues for a central deci- 
sion time, where time must be allowed for 
processing the first stimulus and response 
before the second sequence can be undertaken. 
A competing explanation is the expectancy 
hypothesis which ascribes decrement to S's 
past experience with the random array of 
interstimulus intervals that is usually used in 
experiments on this topic. Through practice, 
S comes to expect a longer delay and the 
decrement is because he is not optimally 
ready to respond. 

The experiment involved a two-dimen- 
sional, bisensory discrete tracking task. The 
statistical structure of interstimulus time 
intervals was the experimental variable aimed 
towards discriminating between the two 
hypotheses by asking if decrement could be a 
function of the temporal organization of 
stimuli. The results supported the expect- 
ancy hypothesis. Reliably less decrement 
was found for Ss who trained on a stimulus 
series with a predominance of small time in- 
tervals and could learn behavior appropriate 
to them. 


REFERENCES 


Apams, J. A. Human tracking behavior. 
Psychol. Bull., 1961, 58, 55-79. 

ADAMS, J. A., & CHAMBERS, R. W. Response 
to simultaneous stimulation of two sense 
modalities. J. exp. Psychol., 1962, 63, 
198-206. 

Apams, J. A., & XHIGNESSE, L. V. Some 
determinants of two-dimensional visual 
tracking behavior. J. exp. Psychol., 1960, 
60, 391-403. 

ATTNEAVE, F. Applications of information 
theory to psychology. New York: Holt, 
1959. 

Craik, K. W. J. Theory of the human 
operator in control systems: II. Man asan 
element in a control system. Brit. J. 
Psychol., 1948, 38, 142-148. 

Davis, R. The limits of the “psychological 
refractory period.” Quart. J. exp. Psy- 
chol., 1956, 8, 24-38. 


287 


Davis, R. The human operator as a single 
channel information system. Quart. J. exp. 
Psychol., 1957, 9, 119-129. 

Davis, R. The role of “attention” in the 
psychological refractory period. Quart. J. 
exp. Psychol., 1959, 11, 211-220. 

ELITHORN, A., & Lawrence, C. Central 
inhibition: Some refractory observations. 
Quart. J. exp. Psychol., 1955, 7, 116-127. 

Hick, W. E. The discontinuous functioning 
of the human operator in pursuit tasks. 
Quart. J. exp. Psychol., 1948, 1, 36-44. 

KLEMMER, E. T. Time uncertainty in simple 
reaction time. J. exp. Psychol., 1956, 51, 
179-184. 

KLEMMER, E. T. Simple reaction time as a 
function of time uncertainty. J. exp. Psy- 
chol., 1957, 54, 195-200. 

Linpguist, E. F. Design and analysis of ex- 
periments in psychology and education. 
New York: Houghton Mifin, 1953. 

Marl, T. The psychological refractory 
phase. Brit. J. Psychol., 1957, 48, 93-97. 

Poutton, E. C. Perceptual anticipation and 
reaction time. Quart. J. exp. Psychol., 
1950, 2, 99-112. i 

Poutton, E. C. Perceptual anticipation in 
tracking with two-pointer and one-pointer 
displays. Brit. J. Psychol., 1952, 43, 222- 
229. 

TELFORD, C. W. Refractory phase of volun- 
tary and associative responses. J. exp. 
Psychol., 1931, 14, 1-35. 

Vince, M. A. The intermittency of control 
movements and the psychological refrac- 
tory period. Brit. J. Psychol., 1948, 38, 
149-157. 

WELFORD, A. T. The ‘psychological refrac- 
tory period” and the timing of high-speed 
performance: A review and a theory. 
Brit. J. Psychol., 1952, 43, 2-19. 

WeLrorD, A. T. Evidence of a single- 
channel decision mechanism limiting per- 
formance in a serial reaction task. Quart. 
J. exp. Psychol., 1959, 11, 193-210. 

WoopwortH, R. S. Experimental psychology. 
New York: Holt, 1938. 


(Received August 14, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 3, 288-294 


THE COURSE OF EMOTIONALITY IN THE DEVELOPMENT 
OF AVOIDANCE 1 


HOWARD S. HOFFMAN ann MORTON FLESHLER 


Pennsylvania State University 


Dual process theories of avoidance 
(Hull, 1943; Miller, 1951; Mowrer, 
1950; Solomon & Wynn, 1953) are 
predicated on the assumption that the 
acquisition of the instrumental avoid- 
ance response involves, and moreover 
depends upon, the concurrent acquisi- 
tion of a conditioned emotional re- 
sponse (sometimes identified as fear or 
anxiety). The purpose of the present 
investigation was to assess the condi- 
tioned emotional responses (CERs) 
that occur during the acquisition of 
avoidance. 

The results of such a study provide 
a test of dual process theory because 
the failure to demonstrate a meaning- 
ful relationship between emotional 
and instrumental behavior would cast 
considerable doubt on the dual process 
position. To the extent that mean- 
ingful relationships are observed, how- 
ever, the results would provide an 
opportunity to evaluate the sequence 
of changing interactions between in- 
strumental and emotional behaviors 
which, according to dual process 
theory, occur during acquisition. 
Theoretical specification of this se- 
quence of interactions has, of neces- 
sity, been somewhat speculative. Ex- 
periments which have documented the 
acquisition of avoidance behavior 
have either inferred the state of 
concurrent emotionality from crude 
indices (bolus counts, incidence of 

1 This research was supported by National 
Institute of Mental Health Grant M-2433, 
and also by a grant from the Pennsylvania 
State University Central Fund for Research, 
The authors wish to thank Peter Day and 


Edmund Sequin for their assistance in the 
course of this study. 


freezing, trembling, etc.) or more 
frequently from the effects of various 
experimental operations upon the 
avoidance response itself (Solomon & 
Brush, 1956). Studies which have 
focused upon the development of 
emotional responses, on the other 
hand, have seldom employed avoi- 
dance procedures. One exception, 
however, is a study by Black (1956) 
which assessed cardiac reaction during 
the acquisition of avoidance. Al- 
though Black’s results appear to offer 
some support for dual process theory, 
they are difficult to interpret because 
he also found that about two-thirds of 
the total cardiac reaction could be 
attributed to the muscular occurrence 
of the instrumental response itself. 
The approach of the present study 
was to track the course of emotionality 
by developing an avoidance response 
while Ss were engaged in positively 
reinforced ongoing behavior. In this 
paradigm, the index of emotionality 
is the decrement in rate of positively 
reinforced responding (conditioned 
suppression) which occurs during the 


presentation of a warning signal which 


precedes electrical shock. Condi- 
tioned suppression has been examined 
extensively and found to provide a 
sensitive and reliable index of emo- 
tional responses (Brady & Hunt, 
1955; Estes & Skinner, 1941). The 
present study differs from the usual 
conditioned suppression experiment in 
one major respect. In studies © 
conditioned suppression, the noxious 
event is unavoidable. In the present 
arrangement, on the other hand, 4 
specific instrumental response, during 


288 


— 


EMOTIONALITY IN AVOIDANCE 


the CS, terminates that stimulus and 
prevents the noxious event. 


METHOD 


Subjects —The Ss were 12 female rats of 
Sprague-Dawley stock. They were approxi- 
mately 10 mo. old at the start of the ex- 
periment. 

Apparatus.—The experimental chamber 
was a sound insulated Skinner box fitted with 
a one-way vision observation port. In the 
middle of the front wall was a recess to which 
Noyes food pellets were delivered. On either 
side of the recess was a manipulandum. To 
the right was a bar of 0.25 X 0.75 in. alumi- 
num which projected 1.5 in. into the test 
chamber at a height of 1.5 in. and which 
actuated a microswitch when a downward 
force of 20 gm. was applied. At the left, 1.5 
in. above the floor was a plate (1.5 in. square) 
which protruded 0.5 in. from, and was parallel 
to, the front wall. When a horizontal force 
of at least 15 gm. was applied perpendicular 
to the plate, it actuated a second microswitch. 

The walls, manipulanda, and grid floor 
were wired to carry electrical shock, and 
during shock the polarity of the grid bars, the 
manipulanda, and the walls was continu- 
ously scrambled, so as to make unauthorized 
escape highly improbable. The shock power 
was supplied by an Applegate constant cur- 
rent stimulator set at 1.5 ma. 

Acoustic signals were delivered through a 
5-in. speaker mounted on the back wall of the 
chamber. Out of a second speaker, mounted 
at the side, white noise was continuously 
presented in order to mask sounds having ex- 
ternal origin. The warning signal was a 
pure tone at 3500 cps with an intensity of 
88 db. re .0002 dynes per cm.’, when measured 
in front of the speaker. The tone was 
generated by a Hewlett-Packard audio 
oscillator. 

A series of timers, steppers, and relays 
was used to establish the several stimulus- 
response contingencies which the research 
demanded. The circuitry was such that a 
response (either bar press or plate press) was 
defined as the initial closure of the correspond- 
ing microswitch. Thus, holding responses 
had no effect on the program. 

All stimuli and responses were recorded on 
an Esterline-Angus operations recorder. In 
addition, counters were used to record the 
number of responses occurring during the tone 
and during the periods which preceded and 
followed each tone. A Standard Electric 


289 


timer was used to measure the latency of the 
bar press response to the tone. 

Procedure-—All Ss were treated alike in 
each of the several stages of training. The 
rats were first taught to escape from shock, 
After 20 min. of adaptation to the box, shock 
was turned on periodically; only a bar press 
terminated shock. After 50 presentations of 
shock, the median escape latency had stabi- 
lized at 0.75 sec. Twenty-five additional 
shocks were then delivered on each of two 
successive sessions in order to establish the 
bar press to terminate shock, as a well-learned 
habit. During these two sessions, the median 
escape latencies were 0.74 and 0,75 sec., re- 
spectively. The Ss were then placed on 
restricted feeding and from then on were 
maintained at 75% of their previously deter- 
mined free feeding body weights. During the 
period of weight reduction Ss were placed in 
the box with the bar removed, so that 
generalized emotional responses could ex- 
tinguish. In successive sessions, Ss were 
trained to the food magazine, shaped to press 
the plate and were run on a six-response fixed 
ratio schedule of reinforcement. (The Ss 
were exposed to the fixed ratio schedule in 
order to produce efficient response topog- 
raphies.) A variable interval schedule of 
reinforcement with a mean of 30 sec. was in 
effect for the remainder of the experiment. 
Twenty sessions of plate pressing for food on 
the VI schedule were given to permit response 
rates to stabilize. By the end of these ses- 
sions, the rates had leveled off at a median 
value of 42 responses per min. These and all 
subsequent sessions each lasted 2.5 hr. and 
were run every other day. 

During the next two sessions, while Ss 
pressed the plate for food, 20 tones, each 
lasting 60 sec., were presented without shock 
at intervals of 5 to 7 min. This was done so 
that in later phases of the study, it would be 
possible to determine whether or not observed 
decrements in plate pressing during tone were 
attributable to the pairing of tone and shock. 

The procedure was the same for the next 
three sessions except that the bar had been 
replaced. In these sessions a bar press which 
occurred during a tone terminated that tone. 
This procedural detail was initiated in order 
to determine whether or not, prior to its 
pairing with shock, the offset of tone would 
reinforce the bar press. 

In the following session, avoidance training 
was initiated. The tone was programed to 
remain on for 70 sec. with shock programed 
to occur during its final 10 sec. A bar press 
at any time during the tone, but prior to 


290 


MEASURES OF AVOIDANCE BAN PRESE 


100: Latency oF 
Alesina. K AVOIDANCE RESPONSES 


CE 6 6 
MEDIAN LATENCY (SECONDS? 


PERCENT AESPONSE 
g 


ie 
‘Sessions 


Y 


gnon AVOIDANCE TRIALS 


2s 45 6 7 Bs OH B 


‘AVOIDANCE TRIALS 
p - 
1 pane 
N /ATNON AvoIOANCE TRIALS 
v 


/ 
v 

DURING POST-TONE PERIOD 
sé 


SESSIONS 


2 
Š 
3 
H 
2 
=Ẹ 
s4 
8 
3 


Fic. 1. Measures of avoidance and con- 
current indices of emotionality throughout the 
course of acquisition. (AD refers to the final 
session of tone adaptation.) 


shock onset, terminated that tone and pre- 
vented the occurrence of the shock. A bar 
press during shock terminated both the tone 
and the shock. Since the program of VI 
food reinforcement for plate presses was 
independent of the sequence of tone-shock 
Pairings, food reinforcement could occur at 
any time during either the tone or the shock. 
Avoidance training was terminated after 12 
sessions (20 tones per session). 

Experimental measures—For each S on 
each trial, the following information was ob- 
tained: (a) whether or not shock was avoided; 
(b) whether or not a bar press occurred in the 
60-sec. interval that ended with the onset of 
tone (hereafter this interval will be identified 
as the pretone period); (c) the latency of the 
bar press during tone; (d) the rate of plate 
press during the 60-sec. pretone period, i.e., 
the number of plate presses during the pretone 
period divided by 60; (e) the rate of plate 
press during the tone, ie., the number of plate 
presses during the tone divided by the dura- 
tion of the tone (as determined by the latency 
of the bar press); and (f) the rate of plate 
press during the 60-sec. posttone period that 
began with offset of tone. 

On each trial and for each S, Measures d 
and e and Measures d and f were then com- 
bined to form two suppression ratios. The 


HOWARD S. HOFFMAN AND MORTON FLESHLER 


first suppression ratio was the rate of J 
press during tone divided by the rate of plate 


press during the pretone period, and serves as 
an index of the relative emotionality during 
the tone. The second suppression ratio was 
the rate of plate press during the posttone 
period divided by the rate of plate press 
during the pretone period, and serves as an 
index of the relative emotionality during the 
posttone period. These ratios were the 
entries employed in all subsequent statistical 
analyses of concurrent emotionality. 


RESULTS 


Figure 1 shows the session-by- 
session growth in avoidance, and also 
the changes in relative emotionality 
which develop concurrently. The top 
section of this figure shows the per- 
centage of shocks that were avoided 
on each session, the percentage of 
times one or more bar presses occurred 
in the pretone period, and the median 
latency of the bar press during tone. 

As seen in Fig. 1, during the final 
session of tone adaptation, (“AD” in 
the figure) Ss seldom pressed the bar 
during either the tone or the pretone 
period. Since three sessions of tone 
adaptation had preceded this one, it 
is clear that, prior to its pairing with 
shock, the tone exhibited little, if any, 
control over the bar press. This 
result suggests that the tone itself 
was not intrinsically aversive, and 
that tone offset was not intrinsically 
reinforcing. Avoidance conditioning 
was instituted on Session 1. From 
Session 1 on, if the bar was pressed 
during the 60-sec. warning period, it 
terminated the tone and permitted S 
to avoid shock. If, however, a bar 
press during tone had a latency of 
more than 60 sec., it also occurred 
during shock and hence represented an 
escape response. Although both the 
shock and the tone were programed to 
terminate after 10 sec., no S ever 
permitted shock to remain on for 
longer than 2 sec. During the first 
session of avoidance conditioning the 


A LE 


EMOTIONALITY IN AVOIDANCE 


median latency of escape was 0.79 sec. 
and the latency did not systematically 
change as sessions progressed. This 
result is in no way surprising since Ss 
had previously received extended 
training on escape. 

As seen in Fig. 1, the tendency to 
avoid increased with each session 
until by Session 6, 99% of the shocks 
were avoided. The tendency to bar 
press in the pretone period (interval 
responses) increased during the first 
three sessions, but with continued 
training gradually declined. The 
latency of the avoidance response 
simultaneously decreased, until by the 
end of Session 6, it had stabilized at 
about 6 sec. 

A series of ¢ tests for related meas- 
ures was conducted on the frequencies 
of interval vs. avoidance responses 
during each of the first four sessions. 
The values of t (df = 11 for each) 
were 2.87, 4.01, 2.43, and 3.72 for 
Sessions 1 through 4, respectively. 
Since each of these values is significant 
at the .05 level (for a two-tailed test), 
it is clear that the three functions 
which appear in the top section of Fig. 
1 represent the development of a well- 
discriminated avoidance behavior. 

The bottom sections of Fig. 1 show 
the several indices of emotionality 
that were obtained on each session 
during the acquisition. The solid line 
in the middle section of Fig. 1 shows 
the median suppression ratio during 
tone for trials on which an avoidance 
response occurred. It can be seen 
that the tone caused essentially no 
suppression during the final adapta- 
tion session but that with the intro- 
duction of shock, it rapidly developed 
the capacity to suppress ongoing plate 
presses. 

The dashed line in the middle sec- 
tion of Fig. 1 shows the median sup- 
pression ratio, during tone, on those 
trials during which S failed to avoid. 


291 


Since, as sessions progressed, the 
number of these nonavoidance trials 
decreased rapidly, the data for Ses- 
sions 4 and 5 have been combined. 
No data are shown beyond Session 5 
because, in this period, the number of 
nonavoidance trials was too small to 
yield reliable measures of suppression. 

The suppression ratios obtained on 
avoidance trials provide an initial test 
of the dual process position, since a 
theoretical interpretation of the dis- 
criminated avoidance which occurred 
from Session 1 on (Fig. 1) must as- 
sume that at least during acquisition, 
a CER to the warning stimulus also 
occurred. 

If, on a given trial, there were no 
systematic differences between the 
rate of plate press in the pretone 
period and the rate during tone, then 
during a given session, the number of 
suppression ratios above one should 
equal the number below one, i.e., the 
median suppression ratio would be 
one. If, as predicted by theory, the 
tone consistently evoked a CER, the 
median suppression ratio should be 
less than one. Sign tests conducted 
on the suppression ratios during tone, 
on avoidance trials, provided support 
for a dual process interpretation since 
each value of z, for Sessions 1 through 
12, was greater than 4.0 (P < .01 for 
a two-tailed test, in each case). 

There is, however, a question of the 
degree to which the low value of these 
ratios reflects the cessation in posi- 
tively reinforced responding which 
must occur when S leaves the plate 
and executes the bar press. Two 
sources of information were used to 
assess this question. First, a series of 
sign tests were conducted on the sup- 
pression ratios, during tone, for non- 
avoidance trials. In Sessions 1, 2, 
and 3, as well as in Sessions 4 and 5 
combined, the values of z were all 
greater than 2.82 (P < 01 in each 


292 


case). Thus, even on trials which 
were unconfounded by the occurrence 
of the avoidance response, the tone 
generated a substantial degree of sup- 
pression. Secondly, Ss were observed 
throughout acquisition. In general, 
behavior in the presence of the tone 
appeared to involve considerable emo- 
tionality. Such plate presses as oc- 
curred, were performed in a tentative 
manner and the instrumental avoi- 
dance response involved an extremely 
slow sequence of movements, even on 
trials with latency as short as 6 sec. 
Both -behaviors stood in sharp con- 
trast to the quick energetic movements 
which typified the plate press during 
the pretone period and the instrumen- 
tal bar press during shock. Finally, 
it may be noted that even if the sup- 
pression ratios on avoidance trials 
were adjusted for the maximum time 
necessary to execute the bar press 
(approximately .79 sec. as estimated 
from the latency of the bar press 
during shock), only a small increase 
would occur and the general configura- 
tion of the data would be unaltered. 

A second assertion derived from 
dual process theory, is that during 
acquisition the tendency to avoid will 
be directly related to the CER mag- 
nitude. As seen in Fig. 1, suppres- 
sion, during tone, was consistently 
greater on avoidance trials than on 
nonavoidance trials. 

A Mann-Whitney test was used to 
assess the reliability of these data. 
For Sessions 1 and 2, the differences 
seen in the middle section of Fig. 2 
were highly significant: z = 3.15 in 
Session 1 and z = 4.02 in Session 2 
(P < .01 in both cases). The values 
of z in Session 3 and in Sessions 4 and 5 
combined, while in the direction pre- 
dicted by theory, only tended toward 
significance at the .05 level (z = 1.79 
in Session 3 and z = 1.92 in Sessions 
4 and 5 combined). 


HOWARD S. HOFFMAN AND MORTON FLESHLER 


Despite the failure to attain sig- 
nificance for the later sessions, the 
general pattern of these results sup- 
ports the proposition that early in 
acquisition, the probability of the 
avoidance response is directly related 
to the suppressing capacities of the 
tone. 

The bottom section of Fig. 1 shows 
the course of emotionality in the post- 
tone period on avoidance trials and on 
nonavoidance trials. A third asser- 
tion of dual process theory is that 
reinforcement for successful avoidance 
consists of a reduction in conditioned 
emotionality. It can be seen that, 
during the first few sessions, even 
when avoidance responses occurred, 
positively reinforced behavior tended 
to be suppressed during the posttone 
period. This finding raises the ques- 
tion of whether successful avoidance 
responses were actually accompanied 
by a reduction in emotionality and if 
so, at what point did this effect begin? 
Sign tests were also used to assess this 
question. However, in those tests, 
the paired items were the two sup- 
pression ratios (tone and posttone) 
from a given animal on a given trial 
where shock was avoided. The tests 
revealed that in Session 1, suppression 
during the posttone period was not 
reliably different from suppression 
during the tone; z = 1.85 (P > .05). 
From Session 2 on, however, with the 
occurrence of an avoidance response, 
the level of relative suppression under- 
went a statistically significant de- 
crease. Each of the values of z, for 
Sessions 2 through 12 was greater than 
3.74 (P < .01 for a two-tailed test, 
in each case). Thus, if reinforcement 
consists of a reduction in emotionality, 
these data suggest that the avoidance 
response was reinforced only from the 
second session on. 

Figure 2 shows the percentage of 
avoidance response per block of five 


EMOTIONALITY IN AVOIDANCE 


trials and serves to illustrate the 
changes in performance that occurred 
within sessions. The within-session 
changes in emotionality are not shown, 
because when based on samples of 
only five trials, the random fluctua- 
tions of the several indices were of 
such magnitude as to obscure any 
underlying trends. 

As seen in Fig. 2, during the final 
session of tone adaptation there were 
no systematic changes in the tendency 
to press the bar during the tone. 
With the introduction of shock at the 
end of tone, however, a performance 
emerged in which, during the initial 
five sessions, the tendency to avoid 
increased markedly within each ses- 
sion and decreased (to a lesser extent) 
in the 48-hr. interval between sessions. 
After Session 5, the performance had 
reached a stage in which very few 
shocks were received and such changes 
as occurred within and between ses- 
sions were small and unreliable. 


DiscussION 


Interpretation of these results must 
recognize that the suppression ratio is, at 
best, an index of relative emotionality. 
It reflects the magnitude of a change in 
emotionality, but it provides no informa- 
tion about the absolute level of emo- 
tionality just prior to the change (i.e., 
the emotionality during the pretone 
period). Although none of the experi- 
mental measures in the present study 
were geared to provide an accurate 
assessment of the absolute level of 
emotionality, the data on the rate of 
plate presses during the pretone period 
is relevant to this question. During the 
final session of tone adaptation, the 
median pretone rate was 46 responses per 
min, During Session 1, when shocks 
occurred frequently, the median rate fell 
to 25 responses per min. However, as 
sessions progressed (and shock frequency 
decreased) the median pretone rate in- 
creased with each session, until by 
Session 7, it had reached 44 responses per 


293 


noywwoos, 


| 
| 


Fic. 2.. Percentages of avoidance responses 
per block of five trials throughout the course 
of acquisition. (AD refers to the final session 
of tone adaptation.) 


min. Thereafter, the median pretone 
rate remained relatively constant, never 
falling below 40 responses per min., nor 
rising above SO responses per min. 
From these data, it may be hypothesized 
that with the introduction of shock, the 
absolute level of emotional reactivity 
increased greatly, but that as sessions 
progressed, it slowly declined. Ap- 
parently, the obtained suppression ratios 
(represented in Fig. 1) reflect changes in 
emotionality over and above a moving 
baseline of generalized emotional re- 
activity. In this respect, it should be 
noted that a number of investigators 
have shown that some form of generalized 
emotional reactivity is an important 
variable in conditioning (Spence, 1958; 
Spence, Farber, & Taylor, 1954). More- 
over, the results of a previous study by 
Hoffman, Fleshler, and Chorney (1961) 
cast additional light on this issue. In 
that study, it was found that even after 
extensive training, certain rats would fail 
to avoid on the early trials of each ses- 
sion, but would achieve a high level of 
performance by the end of the session. 
The results of that study indicated that 
the occurrence of shock was the critical 
factor in this warm-up-like phenomenon 
and for this reason suggested that 
warm-up in avoidance reflects a motiva- 
tional process. Apparently, as shocks 
occur, their emotional aftereffects persist 
and summate to produce a state of emo- 
tional reactivity which facilitates 
avoidance. 

In the present study, warm-up was 
exhibited during acquisition (Fig. 2) and 
in the absence of evidence to the con- 
trary, it is reasonable to assume that this 


294 


feature of the performance also reflects 
the action of the lingering emotional 
aftereffects of aversive stimulation. Al- 
though existing dual process theories 
have not formally treated this particular 
process, Spence (1956) in dealing with 
classical conditioning, suggests that “. . . 
the drive level operating at the time of 
the conditioned anticipatory response is 
a function of the residual effects of the 
internal response (r,) to the noxious 
stimulus of the preceding trials. Thatis, 
such emotional responses are assumed to 
have a relatively persisting effect that 
extends well beyond the range of tem- 
poral intervals usually employed in 
conditioning experiments . . .”’ (p. 186). 
It is clear that a motivational process, 
such as Spence describes, can be readily 
incorporated within existing dual process 
theory and hence that the occurrence of 
warm-up is consistent with a dual process 
interpretation of avoidance. 


SUMMARY 


In order to examine the interplay between 
instrumental and emotional behavior during 
the acquisition of a discriminated avoidance 
response, rats were trained to press a bar to 
avoid shock while they were concurrently 
engaged in pressing a plate for food. The 
course of emotionality was tracked by assess- 
ing the several levels of suppression of ongoing 
plate presses during each of the various phases 
of the acquisition process. The results re- 
vealed a complex relationship between the 
level of performance on avoidance and the 
several concurrent indices of emotionality. 
In general, the results support the dual process 
hypothesis that conditioned emotionality con- 
trolled by the warning signal provides motiva- 
tion for the avoidance response, while a 
decline in emotionality (with the offset of the 
signal) reinforces the response. The results 
also suggest that the lingering motivational 
aftereffects of Aversive stimulation play an 


HOWARD S. HOFFMAN AND MORTON FLESHLER 


important role in the early phases of ac- 
quisition. 


REFERENCES 


Brack, A. H. The extinction of avoidance 
responses under curare. Unpublished doc- 
toral dissertation, Harvard University, 
1956. 

Brapy, J. V., & Hunt, H. An experimental 
approach to the analysis of emotional be- 
havior. J. Psychol., 1955, 40, 313-325. 

Estes, W. K., & Skinner, B. F. Some 
quantitative properties of anxiety. J. exp. 
Psychol., 1941, 29, 390-400. 

Horrman, H. S., FLESHLER, M., & CHORNEY, 
H. Discriminated bår-press avoidance. 
J. exp. Anal. Behav., 1961, 4, 309-316. 

Hurl, C. L. Principles of behavior. 
York: Appleton-Century, 1943. 

Miter, N. E. Learnable drives and rewa rds. 
In S. S. Stevens (Ed.), Handbook of experi- 
penia psychology. New York: Wiley, 
1951. 

Mowrer, O. H. Learning theory and per- 
sonality dynamics. New York: Ronald, 
1950. 

Soromon, R. L., & Brusu, E, S. Experi- 
mentally derived conceptions of anxiety 
and aversion. In M. R. Jones (Ed.), 
Nebraska symposium on motivation: 1956. 
Lincoln: Univer. Nebraska Press, 1956. 
Pp. 212-305. 

Sotomon, R. L., & Wynne, L. C. Traumatic 
avoidance learning: Acquisition in normal 
dogs. Psychol. Monogr., 1953, 67 (19, 
Whole No. 354). 

Spence, K. W. Behavior theory and condi- 
tioning. New Haven: Yale Univer. Press, 
1956, 

Spence, K. W. A theory of emotionally 
based drive (D) and its relation to perform- 
ance in simple learning situations. Amer. 
Psychologist, 1958, 13, 131-141. 

SPENCE, K. W., FARBER, I. E., & TAYLOR, J- 
The relation of electric shock and anxiety 
to level of performance in eyelid condition- 
ing. J. exp. Psychol., 1954, 48, 404—408. 


New 


(Received August 14, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No, 3, 295-299 


ON THE RELATIONS AMONG 
TRIBUTE TO ESTIMATES OF 


SOME FACTORS THAT CON- 
VERTICALITY 


C. R. CURRAN ax H. L. LANE 
University of Michigan 


Recent research has identified some 
factors that effect nonveridical percep- 
tion of verticality (Asch & Witkin, 
1948a, 1948b; Wapner, Werner, & 
Chandler, 1951; Wapner, Werner, & 
Morant, 1951; Werner, Wapner, & 
Chandler, 1951; Witkin, 1949, 1950, 
1952; Witkin & Asch, 1948a, 1948b). 
Through the experimental manipula- 
tion of visual context, body tilt and 
body support, among other variables, 
visual, proprioceptive, labyrinthine, 
and tactile cues have all been im- 
plicated as contributors to erroneous 
estimates of the upright. Further 
progress in the analysis of perception 
of the upright would seem to require 
at least the following three steps: 


Isolation of the sensory events involved.— 
This could be achieved through experi- 
mental control, requiring manipulation 
of variables one at a time, or through 
partialing out the effects of these vari- 
ables, requiring multidimensional experi- 
mental design, or by both techniques. 
Earlier studies have often confounded 
the effects of sensory events in several 
modalities when investigating the rela- 
tion between a relatively complex experi- 
mental operation and perception of the 
upright. For example, the procedure of 
tilting S has been the most widely used 
experimental operation in research on 
perception of the upright, yet it may 
produce concurrent changes in visual, 
proprioceptive, labyrinthine, and tactile 
stimulation. Werner, Wapner, and 
Chandler (1951) have interpreted the 
effect of body tilt on estimates of the 
upright as evidence for their sensory- 
tonic field theory of perception, which 
suggests that the degree of muscular 
involvement plays an important role in 


determining these judgments. This in- 
ference must remain tentative, however, 
until the cluster of sensory changes 
effected by body tilt is experimentally 
analyzed. A similar case may be made 
for analyzing the complex effects of 
changes in the visual field. Witkin and 
Asch (1948b) have established that “the 
effect of the visual field upon the per- 
ceived upright tends to be stronger and 
more consistent the more richly articu- 
lated the field” (p. 782). However, 
richness of articulation is undoubtedly a 
multidimensional affair. 

Use of independent variables measured 
on ratio scales—Recent studies of percep- 
tion of the upright have typically em- 
ployed dependent variables measured on 
ratio scales (e.g-, the angle between a 
rod called vertical by S and the true 
vertical) and one or more experimental 
treatments defined by nominal or ordinal 
scales (e.g., “tilted standing” and “tilted 
sitting”). Clusters of variables, such as 
support, lend themselves to nominal 
scaling; fractionation of these clusters 
should point to stimulus variables defined 
in terms of physical dimensions and thus 
measured (typically) on ratio scales. 
When both the dependent and independ- 
ent variables are measured on ratio 
scales the predictive power of the findings 
is greatly enhanced, because the ratio 
scale contains the interval scale within 
itself (as well as the ordinal and nominal 
scales) (Stevens, 1960). 

Quantitative analysis of the effects of 
variables and their interactions in multi- 
dimensional experimental design.—lf a 
set of variables affecting perception of 
the upright has been isolated, if each can 
be measured on a ratio or, at least, 
interval scale, and if several levels or 
values of each variable are incorporated 
in an experimental design, then it is 


295 


296 


Fic. 1. Arrangement of apparatus for 
measurement of nonveridical perception of the 
upright. 


possible to quantify their relative effects 
and their interaction effects in determin- 
ing perception of the upright and to 
obtain the predictive power we desire. 


The research to be reported repre- 
sents a modest attempt to assess the 
effects of incorporating these three 
methodological improvements in an 
investigation of perception of the 
upright. The study describes the 
relation between changes in visual, 
proprioceptive, and labyrinthine stim- 
ulation and the degree of tilt of a rod, 
when it is reported to be vertical by 
the observer. 


METHOD 


The major experimental variables were: 
the illumination of the visual field (10%, 107, 
or 10% ft-c), the degree of body tilt (10° or 
30°), and the counterbalancing weight (equi- 
librium, [W], W plus 6 Ib. and W plus 12 Ib.). 
Parametric variables were: direction of tilt 
(left or right) and starting position of the rod 
(left or right). 

Twelve male, naive undergraduates were 
Ss in sessions lasting from 60 to 90 min. 
All Ss had normal, uncorrected vision. Their 
heights ranged from 5 ft. 7 in, to 6 ft. 2 in., 
their weights from 131 to 179 lb. Earlier 
research has reported wide individual differ- 
ences in judgments of the upright under 
distorting conditions (Witkin, 1949) but con- 
siderable consistency of judgment within 
individuals. A four-way experimental design 
was therefore employed, comprised of the 
three major variables (illumination, tilt, and 


C. R. CURRAN AND H. L. LANE 


counterbalancing weight) and a “blocking 
variable” (Ss). Direction of tilt and starting 
position of the rod were confounded with Ss, i 
so that the four levels of the blocking variable 
were: right tilt, right rod (N = 3), left tilt, 
right rod (N = 3), etc. Each 5 gave 3 
judgments under each of 18 combinations of 
the levels of illumination, tilt, and weight, 
presented in counterbalanced order. 

Figure 1 is a schematic representation of 
the apparatus. A triangle (T) with base 
angles 60° and 80°, was constructed from 
2 X 4 in. boards and served as a tilt reference 
for E. The Æ aligned the median plane of $ 
with the appropriate arm of the tilt reference 
by visual inspection before each trial. With 
S standing erect, the light source, 4 ft. behind ` 
him, and the center of the stimulus field, 4 ft. 
in front, were at eye level and in approxi- 
mately the same vertical plane. The .S stood 
on a pedestal (P) that was held firmly in place 
on the floor. The surface of the pedestal 
slanted upward at an angle that was set equal 
to the angle of tilt, so that § was perpendicular 
to the surface when aligned with the tilt 
reference. When S was tilted, he grasped the 
supporting rope in the hand contralateral to 
the direction of tilt. The rope passed over a 
pulley suspended from the ceiling (18 ft. high) 
and terminated in a bucket of cement, selecte 
by E. 

This cement weight was selected in the 
following way. At 10° tilt, a weight, Win 
was determined that was within .5 lb. of 
3% of S's weight, At 30° tilt, War=25% 
of S's weight, +.5 Ib. Under either tilt con- 
dition, the levels of the counterbalancing 
variable were then W, W + 6 Ib., and W + 12 
lb. The value of W under each condition of 
tilt approximated the weight that would just 
balance S. The moments of force around the 
pedestal are approximated by: M (ho sine T} 
= Wha where M = weight of S, ho = dis- 
tance from the fulcrum to S's center O 
gravity, T = angle of tilt from the vertical, 
W = counterbalancing weight, and h, = dis- 
tance from the fulcrum to S's arm. 

Under all experimental conditions, S 
viewed the stimulus field monocularlys 
through a reduction tube (3 in. long, 2 in. 1 
diameter) attached to a pair of goggles. In 
this manner, the visual field of the eye contra- 
lateral to the direction of tilt was restrict 
to the stimulus field, while vision in the 
homolateral eye was blocked. i 

The field had an illuminance of approxi- 
mately 107 ft-c with unfiltered illumination 
from a source whose color temperature Wa 
approximately 1800°K (34 v. applied to å 


ESTIMATES OF VERTICALITY 


500-w. lantern slide projector). Wratten 
neutral density filters were inserted to produce 
illuminances of 107% and 10~ ft-c giving the 
three levels of the illumination variable. The 
stimulus field consisted of a piece of white 
poster board, 60 X 40 in., with a luminous 
reflectance of approximately 0.83. A piece of 
linear graph paper, 8 X 8 in., with blue lines 
heavy-ruled at 1-in. intervals and light-ruled 
at .1-in. intervals, was mounted on the front 
of the poster board and aligned with the true 
vertical by means of a plumb line. (The 
luminous reflectance of the graph paper was 
approximately 0.77; at an illuminance of 10~ 
ft-c no S reported seeing the rulings while 
at 10-2 ft- the paper and its rulings were 
visible to all Ss.) Behind the poster board, 
a small motor rotated a shaft that punctured 
the board at its center and protruded 4 in. 
A brass rod (reflectance 0.40) 12 in. long, $ in. 
diameter, was mounted at right angles to this 
shaft in front of the stimulus field for ob- 
servation by S, anda 26-in. rod was mounted 
in parallel behind the poster board for ob- 
servation by E. T wenty-four inches above 
the pivotal point of the rod at the center of the 
field, Æ could read the point of intersection 
of the rod with a horizontal line, ticked at 
Lin. intervals; he could therefore read the 
position of the rod to an accuracy of about 
0.5°. An ac motor with reduction gears 
swept the rod from its starting position, 
50° + 10° to the left or to the right of the 
vertical, toward the upright at a constant 
rate of 1.5° per sec. When S, who was ob- 
serving the position of the rod under a given 
condition of tilt, weight, and illumination, 
judged it to be vertical, he said ‘‘stop” aloud, 
at which time E pressed a normally closed 
switch to stop the motor. He then read the 


T 5 apa See] 


1O DEGREES TILT 


ERROR N ESTIMATING UPRIGHT (degrees) 
p- s 
v 9 


- “é w? 
CONTERWEIGHT tibs.) 


Fic. 2. The effect of counterbalancing 
weight, illumination, and tilt (10°) on percep- 
tion of the upright. 


[ PC E J| 


30 DEGREES TILT 


ERROR IN ESTIMATING UPRIGHT (sepert) 


COUNTERWEIGHT Libs.) 


Fic. 3. The effect of counterbalancing 
weight, illumination, and tilt (30°) on percep- 
tion of the upright. 


rotary position of the rod, using a pencil light 
to read the horizontal scale. Following a 
trial, S was instructed to close both eyes while 
E changed tilt, weight, illumination, and 
position, in accordance with the protocol. 


RESULTS AND DISCUSSION 


Figures 2 and 3 summarize the find- 
ings of this experiment by depicting 
the effect of the weight and illumina- 
tion variables on estimates of the 
upright at each of the two levels of the 
tilt variable. Examination of these 
figures reveals the relative contribu- 
tion of these three variables as well as 
their interaction effects. Clearly, the 
major variable is the level of illumina- 
tion: the error solid grows rapidly in 
volume as the level of illumination is 
decreased at 10° tilt and even more 
rapidly at 30° tilt; this comparison 
reveals an Illumination X Tilt inter- 
action. At relatively high levels of 
illumination the counterbalancing 
weight has little or no effect at 10° 
tilt and only a slight effect at 30° tilt. 
As illumination is decreased, however, 
the counterbalancing weight plays an 
increasing role in determining non- 
veridical perception of the upright. 
Comparison of Fig. 2 and 3 reveals a 
Weight X Tilt as well as a Weight 


298 


TABLE 1 


ANALYSIS OF VARIANCE OF ESTIMATES 
OF THE UPRIGHT 


Source af F 
Illumination (I) 2 SISON 
ilt 1 CPA jae 
Counterweight (C) 2 56.74** 
LXC 4 5234r 
CET 2 4.52* 
ET 2 3.20* 
EXCXT. 4 4.33** 
Within cells (MS) 576 (0.85) 
*P <.05. 
**P <.01. 
X Illumination interaction, and a 


Weight X Tilt X Illumination inter- 
action. The solid obtained at 30° tilt 
is appreciably larger in all cells than 
that obtained at 10° tilt, showing the 
net effect of the tilt variable. As just 
indicated, there are also obvious Tilt 
X Weight and Tilt X Illumination 
interactions. 

Table 1 presents an analysis of 
variance of the estimates of verticality. 
As anticipated, the variance attribu- 
table to replications within Ss is rela- 
tively small. All three main effects 
and their first- and second-order inter- 
actions are significant at the .05 level 
or beyond. 


The present findings support Witkin’s 
(1949) analysis of the relative contribu- 
tion of visual as opposed to somesthetic 
cues in the determination of perception 
of the upright. These findings show, 
furthermore, that relatively few visual 
cues to the vertical can yield extremely 
accurate estimates of the vertical, even 
under marginal visibility, with the body 
tilted and delicately poised. It appears 
that the visual field need not be “richly 
articulated” to permit accurate estimates 
of the upright under distorting condi- 
tions. However, removal of these few 
visual cues by approximately halving the 
brightness of the field yielded almost a 
hundredfold increase in error in per- 
ceiving the upright. 

When visual cues were minimized, at 


C. R. CURRAN AND EH. L. LAN 


the lowest illuminance, and propriocep- 
tive and cutaneous cues were minimized 
in the condition of equal moments around 
the pedestal, the degree of tilt was ob- 
served to have a considerable effect on 
the magnitude of the error in perceiving 
the upright. Inference from these find- 
ings suggests that, in the absence of 
visual and somesthetic cues to the up- 
right, other cues to the static position of 
the body are available; perhaps the 
utricular otoliths, which are thought to 
play a role in static positional adjust- 
ments of the body (Geldard, 1953, p. 262) 
are the source of this stimulation. 

When the condition of equilibrium is 
displaced through the addition of count- 
erbalancing weights, greater effort is 
required on the part of S to maintain his 
balance and his alignment with the tilt 
reference. As the counterbalancing 
weight is increased, the magnitude of the 
error in judging the vertical is increased. 
This finding may be related to the 
qualitative prediction of Werner, Wapner, 
and Chandler (1951): “Within the frame- 
work of the sensory-tonic field theory of 
perception, the degree of muscular in- 
volvement is expected to be an important 
variable’ (p. 346). The results of the 
present study also confirm the observa- 
tion of these authors that the apparent 
vertical is shifted to the side opposite the 
direction of the body tilt. It will be 
remembered that the direction of tilt was 
confounded with the starting position of 
the rod and with Ss and incorporated 
into the experimental design as a four- 
level blocking variable. A posteriori 
comparison of the weighted means of the 


right rod, right tilt and left rod, left tilt “) 


Ss with the weighted means of the right 
rod, left tilt and left rod, right tilt Ss 
showed that homolateral tilt and rod 
starting positions produce a greater 
magnitude of error than contralateral 
positions ye variables (Scheffé's 
method; a 5). 

It is noteworthy that all interaction 
effects are large and significant. Al- 
though these interaction effects have not 
been demonstrated previously in the 
analysis of perception of the upright, this 
seems a plausible way for a perceptual 


ESTIMATES OF VERTICALITY 


process to be controlled by the relevant 
yariables. As indicated earlier, this type 
of analysis is facilitated through the use 
of multidimensional experiments with 
unidimensional variables sampled at 
several levels and measured on ratio 
scales. 


SUMMARY 


Several variables that have been shown to 
influence the perception of the upright were 
incorporated in a multidimensional design to 
permit analysis of their several effects and 
interactions. Minimal visual cues had a 
dramatic effect in reducing nonveridical 
perception of the vertical. Distortion of 
body tilt and balance produced effects of 
lesser magnitude. All the first- and second- 
order interactions of these variables had large 
and significant effects on perception of the 
upright. 


REFERENCES 


Asca, S. E., & WITKIN, H. A. Studies in 
space orientation: I. Perception of the up- 
right with displaced visual field. J. exp. 
Psychol., 1948, 38, 325-337. (a) 

Ascu, S. E., & Wrtxty, H. A. Studies in 
space orientation: Il. Perception of the 
upright with displaced visual fields and 
body tilted. J. exp. Psychol., 1948, 38, 
455-477. (b) 

GELDARD, F. A. The human senses. New 
York: Wiley, 1953. 

Srevens, S. S. Ratio scales, partition scales, 
and confusion scales. In H. Gulliksen and 
S. Messick, (Eds.), Psychological scaling: 
Theory and application. New York: Wiley, 
1960. 


299 


Warner, S., Werner, H., & CHANDLER, K. 
Experiments on the sensory-tonic field 
theory of perception: I. Effect of extraneous 
stimulation on the visual perception of 
verticality. J. exp. Psychol., 1951, 42, 
341-345. 

Warner, S., WERNER, H., & Morant, R. B. 
Experiments on the sensory-tonic field 
theory of perception: III. Effect of body 
rotation on the visual perception of ver- 
men sph J. exp. Psychol., 1951, 42, 351- 
357. 

Werner, H., WAPNER, S., & CHANDLER, K. 
Experiments on the sensory-tonic field 
theory of perception: Il. Effect of sup- 
ported and unsupported tilt of the body on 
visual perception of verticality. J. exp. 
Psychol., 1951, 42, 346-350. 

Wirxiy, H. A. Perception of body position 
and of the position of the visual field. 
Psychol. Monogr., 1949, 63(7, Whole No. 
302). 

Wire, H. A. Perception of the upright 
when the direction of the force acting on 
the body is changed. J. exp. Psychol., 
1950, 40, 93-106. 

Wrin, H. A. Further studies of perception 
of the upright when the direction of the 
force acting on the body is changed. J. 
exp. Psychol., 1952, 43, 9-20. 

Wirriy, H. A., & Asca, S. E. Studies in 
space orientation: III. Perception of the 
upright in the absence of a visual field. J. 
exp. Psychol., 1948, 38, 603-614. (a) 

Wirriy, H. A., & Asca, S. E. Studies in 
space orientation : IV. Further experiments 
on the perception of the upright with dis- 
placed visual fields. J. exp. Psychol., 
1948, 38, 762—782. (b) 


(Received August 15, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 3, 300-310 


THE PERSPECTIVE ILLUSION: PERCEIVED SIZE AND 
DISTANCE IN FIELDS VARYING IN SUG- 
GESTED DEPTH, IN CHILDREN 
AND ADULTS! 


JOACHIM F. WOHLWILL 


Clark. University 


Since the Renaissance it has been 
generally known that perspective 
drawings can convey a strong impres- 
sion of depth in two-dimensional stim- 
ulus fields; curiously, however, this 
phenomenon has received little sys- 
tematic attention on the part of 
psychologists. To be sure, Gibson, 
in his treatment of space perception, 
and of pictorial perception generally, 
has repeatedly pointed to the im- 
portant role of perspective (Gibson, 
1950, 1954, 1960); yet, apart from a 
largely exploratory study by Smith, 
Smith, and-Hubbard (1958), the 
actual effects of perspective on percep- 
tion have not been experimentally 
investigated. 

The present study is concerned with 
one aspect of this problem, viz. the 
extent to which stimulus fields con- 
structed according to the principles of 
perspective geometry will affect judg- 
ments of size and length within such 
a field. Thus, one might expect the 
height of an object located at the 


1 This investigation was supported by a 
grant from the National Science Foundation, 
G-16031. The author wishes to thank 
Lawrence Houle, Principal of the Columbus 
Park School in Worcester, Massachusetts, and 
the teachers of this school for their coopera- 
tion in supplying subjects and facilities for the 
experimental work at the grade levels. The 
helpful comments of Keith Smith and Ben- 
jamin White of the MIT Lincoln Laboratories, 
regarding the informational analysis pre- 
sented at the end of this paper, are also 
gratefully acknowledged. Finally, Peter 
Schiller provided valuable assistance in the 
design and construction of the apparatus. 


bottom (i.e., apparent front) of a 
perspective drawing to be perceived as 
smaller relative to a similar object 
located at the top (i.e., apparent rear) 
of the drawing. In fact, just such an 
essentially illusory effect is dramat- 
ically illustrated by Gibson (1950, 
p. 182), by means of a perspective 
drawing of a corridor in which several 
barrel-shaped objects are depicted: 
the rearmost barrel appears strikingly 
expanded in size in comparison with 
the front one. More systematic in- 
vestigation of these effects, and es- 
pecially of their variation as a function 
of the characteristics of the stimulus 
fields responsible for the suggestion of 
depth should provide a clearer picture 
of this illusion, its magnitude and its 
determinants. Such a study should 
furthermore be of direct relevance to 
Gibson’s (1950) general theory of 
space perception, which emphasizes 
the information to depth contained in 
the gradients of texture-density, etc- 
present in any two-dimensional pro- 
jection of a three-dimensional st imulus 
field, be the projection retinal, photo- 
graphic, or in the form of a perspective 
drawing. 

The aim of this study is thus tO 
investigate experimentally the effects 
of perspective drawings on the percep- 
tion of relative linear extent in the 
plane of the drawing, with reference 
both to the perceived size of objects 
in this plane and the perceived dis- 
tance between points in the plane- 
The principal variable manipulated in 


300 


nea. LL LLL AL A 


THE PERSPECTIVE ILLUSION 


the study is the nature of the “‘in- 
formation to depth” contained in the 
field: first, the greater the amount of 
this information, as expressed in the 
density of the texture of the field, the 
greater should be the distorting effect 
of perspective; second, the introduc- 
tion of redundancy into the field, 
expressed in terms of the patterning 
of the elements of texture, so as to 
enhance linear perspective, should 
likewise increase the effect. The 
effect should be maximal, finally, in a 
field portraying directly the geometri- 


301 


cal relationships involved in a per- 
spective transformation. 

An additional variable of consider- 
able interest in this domain of percep- 
tion is that of the age of Ss. There is 
considerable evidence that spatial 
relationships generally exert relatively 
little influence on the perception of 
young children (cf. Wohlwill, 1960), 
so that one might postulate that the 
distorting effect of perspective is 
absent, or at least rather small in 
magnitude in early childhood, and will 
increase with age. Indeed, Glasser 


Fic. 1. Perspective drawings represent 
texture-density (left vs. right figures) 


ing four of the stimulus field 
and randomness (top vs. bottom figures, 


a 


is utilized in the study, with 
) as variables. 


302 


(1944) has claimed that young chil- 
dren rarely perceive a perspective 
illusion in a figure very similar to that 
of Gibson referred to above, and 
accordingly attributes the effect to the 
role of experience; however, no data 
are adduced in support of this state- 
ment, nor is the age of Ss to which it is 
intended to apply further specified. . 


METHOD 


Stimuli.—Six different stimulus fields, 
drawn on sheets of drafting paper 23 X 29 in. 
in size, were employed in this study. The 
shape of all of these fields was uniform, 
consisting of a trapezoid superimposed on a 
rectangle; the bases of the trapezoid were 
7% in. and 12 in. and its height 7 in., while the 
dimensions of the rectangle were 12 X 18 in. 
The intersection of the two sides of the 
trapezoid at a point 12 in. above its lower base 
defined the vanishing point used for con- 
structing the perspective drawings. 

These fields were filled with different per- 
spective drawings, all of which were based on 
a grid of 36 columns fanning out from the 
vanishing point and 62 rows spaced so as to 
produce the foreshortening of distance re- 
quired by the laws of perspective. The grid 
itself, with lines of uniform thickness drawn 
in India ink, made up the first panel (illus- 
trated in Fig. 2, below). Four of the remain- 


Fie. 2, 


Apparatus, with grid panel 
displayed. 


JOACHIM F. WOHLWILL 


ing panels, shown in Fig. 1, were constructed 
by filling in selected cells from this grid in 
India ink, according to the following plan: 7 
(a) High density, nonrandom (Fig. 1, top 
left); Every third column was selected from 
the grid; within each the cells to be filled in — 
were determined by a table of random num- ~ 
bers, so as to yield an average proportion of 
4 of the cells filled in per column, or } of the 
total number of cells in the field. (b) Low 
density, nonrandom (Fig. 1, top right): From 
among those cells selected under a, a subset 
was chosen by means of a table of random 
numbers, consisting of 3% of the cells of a, or 
a density of .18 per column, i.e., .06 for the 
total field. (c) High density, random (Fig. 1, 
bottom left): From every column of the grid — 
4 of the cells were randomly selected to be 
filled in. (d) Low density, random (Fig. 1, 
bottom right): From among the cells selected 
for c, a subset consisting of 3% of these cells, 
i.e., .06 of the cells of the total field, was 
chosen. 

It should be noted that the proportion of 
cells from the total grid that were filled in was 
the same for both high-density panels (.20), ” 
as well as for both low-density panels (.06). 
Further, it will be seen that the essential 
difference between the random and the non- 
random panels lies in the greater sense of 
linear perspective afforded by restricting the 
cells to a limited set of columns, in the case of 
the latter. 

Finally, the sixth panel was a control field, 
which was entirely blank, except for the 
border of the field, drawn in in India ink as 
in the other panels. 

These six panels will henceforth be referred | 
to as: G (Grid); H-NR (high-density, non- 
random); L-NR (low-density, nonrandom); 
H-R (high-density, random); L-R (low: — 
density, random); and C (control). b 

Apparatus.—The apparatus used for dis- 
playing the stimulus panels and for manip- 
ulating the size and distance variables is 
shown in Fig. 2. (Figure 2 also shows Panel 
G, as it was exposed in the apparatus.) The 
apparatus consisted essentially of a rectan- 
gular piece of plywood, 29.5 X 35 in., covered 
by a sheet of Plexiglas fitted into. a frame 
which was attached by means of hinges to the 
side of the plywood base. The Plexiglas 
cover could thus be opened for the insertion 
and removal of the individual panels expos 
behind it. 

For the distance judgments, two points 
were chosen in the stimulus field of the panels, 
defining the distance to be bisected by S 
The points were located on an imaginary line 
passing through the vanishing point of the 


THE PERSPECTIVE ILLUSION 


perspective drawings, one point towards the 
top edge, the other near the bottom. Screws 
were driven through two corresponding spots 
on the Plexiglas cover, directly superimposed 
on the two points in the stimulus field as it 
was exposed underneath the cover. 

‘An endless loop of fine nylon thread ran 
around a trapezoidal path marked by these 
two screws and two others near the left side 
of the Plexiglas cover and at the same heights 
as the former pair. This loop could be moved 
in either direction by pulling on a knot 
located along the vertical side, this movement 
causing a little red plasticine ball to travel up 
and down between the first two screws. A 
small indicator attached to the left vertical 
segment of the line, which moved along a 
ruler glued to the Plexiglas, enabled Æ to read 
off the height of the ball corresponding to the 
midpoint of the distance as perceived by S. 

For the size judgments, two rectangular 
cutouts were made in each panel, through 
which the stimulus objects—two light-blue 
rectangular sheets of metal—appeared. One 
of the cutouts was at the top of the panel, the 
other towards the bottom, their lower-left 
vertices being located along a line through 
the vanishing point, well to the right of the 
line involved in the distance judgments (cf. 
Fig. 2). 

The stimulus objects were attached to two 
wooden slides which could be moved up or 
down, along a line which was an extension of 
the diagonal of the rectangles, ie., in an 
oblique direction relative to the field of the 
panel; movement of these slides thus caused 
the portion of the blue rectangle exposed 
through the cutout to vary in size, but with a 
constant ratio of height to width, Each slide 
was raised and lowered by turning a crank, 
to which it was connected by a set of pulleys. 

This arrangement permitted continuous 
variation of both the top and bottom stimulus 
objects, so that either could be used as a 
variable and the other as a standard. The 
size of the standard rectangle, in terms of the 
length of the diagonal, was 7.5 cm.; the 
diagonal of the variable rectangle varied from 
0 to 12 cm, In order to prevent Ss from 


utilizing the amount of white in the cutouts - 


around the rectangles as a cue, the two cutouts 
were made unequal in size, the bottom one 
being 4.3 X 11.3 cm. (a size corresponding to 
the maximum size of the variable), whereas 
the top one measured 5.3 X 12.3 cm. 

The whole apparatus was displayed in a 
vertical position to the S, by sliding it into a 
metal frame mounted vertically on a wooden 


base. ; 
Procedure. —The adult Ss were tested in a 


303 


room illuminated only by overhead fluores- 
cent lighting. They were seated on a stool 
at a distance of 3 m. from the apparatus, $0 
that their eyelevel was approximately even 
with the center of the stimulus panel. The S 
was told that the experiment concerned his 
ability to make judgments of size and distance 
relationships. Specifically, for the distance 
judgments he was told that Æ would make the 
little red dot travel upward along the line be- 
tween the two screws, and instructed to say 
“stop” when he thought the dot was exactly 
midway between the screws. This wording 
of the instructions was intended to foster a 
set for objective, rather than phenomenal 
judgments; however, E discouraged S from 
attempting to “‘figure out” where the mid- 
point was intellectually. The S was further 
informed that he would have an opportunity 
to correct any setting if he was dissatisfied 
with it. 

For the size judgments the instructions 
were similar, S being asked to say “stop” 
when the top (or bottom) rectangle was just 
equal in size to the bottom (or top) one. 

In his manipulation of the red dot and of 
the variable rectangle, E always faced away 
from S, so as to avoid giving him any in- 
voluntary facial cues that might influence his 
judgment. Movement of the stimuli was 
carried out at a fairly even rate, although for 
the variable size stimulus perfect smoothness 
was not realizable with the apparatus as 
constructed. 

Special procedures were used with the 
group of children from Grade 1, in order to 
ensure that they properly understood the 
instructions. These procedures consisted of 
a series of pretest judgments, involving, 
(a) marking the middle of two lines drawn on 
a 8} X 11 in. sheet of paper, one line hori- 
zontal, the other oblique, (b) deciding when a 
bead which Æ moved along a string over the 
surface of a table, first parallel to the edge, 
then obliquely, was at the middle of the 
string, and (c) making a preliminary distance 
setting on the apparatus, without any panel 
underneath the Plexiglas cover. There were 
no instances in which any child gave an 
indication of failing to understand the in- 
structions, either through questions directed 
to E, or through markedly deviant judgments 
on the pretest. 

All of the children were routinely asked, 
“Js that it?” or “Is that where you want it?” 
after every judgment, and allowed to change 
it if they wished (as the adults had been also). 
This was done in an effort to minimize 
errors of anticipation. The children were 


further asked, after being presented with the 


304 JOACHIM F. WOHLWILL 
TABLE 1 
MEAN DISTANCE SETTINGS (IN Cm.) 
Stimulus Panels Combined 
Groups 
c L-R H-R L-NR | H-NR G Mean G 
1 21.91 22.19 | 22.45 | 22.23 | 22.97 23.12 | 22.48 1.35 
eras 4 22.00 | 22.71 | 22.59 | 22.85 | 22.96 23.02 | 22.69 1.32 
Grade 8 21.40 | 22.01 | 22.00 | 22.27 | 22.44 | 22.72 | 22.14 1.15 
Adults 21.06 | 21.08 | 21.20 | 21.19 | 21.43 21.80 | 21.30 0.86 
ined M 21.59 | 22.00 | 22.06 | 22.13 | 22.45 22.67 
cries ve 1.42 1.81 1.78 1.82 1.87 Ad 


Note, —Values tabled represent bisections of a 41-cm. vertical distance. 
a SD of Ss’ mean scores, based on between-Ss error terms for each age level, 
b SD of scores for all Ss, based on residual error term calculated for each stimulus panel. 


H-R and H-NR panels, whether they could 
think of anything that might look like what 
they were seeing. 

Design.—For both distance and size judg- 
ments S made two judgments for each stim- 
ulus panel, one ascending, the other descend- 
ing. Size and distance judgments were al- 
ways made consecutively for any stimulus 
panel, before a new panel was exposed; half 
of the Ss always judged size first, the other 
half judged distance first. The six panels 
were presented in a Latin square design, in 
two different sequences, one being the reverse 
of the other. Regardless of the particular 
panels exposed, all Ss started with the ascend- 
ing judgment on the first, third, and fifth 
panels and with the descending judgment on 
the second, fourth, and sixth panels. Finally, 
on the size judgments half of the Ss were 
tested with the top stimulus as the standard, 
and the other half with the bottom stimulus 
as the standard, 

Subjects.—There were four groups of 24 Ss 
each, representing samples of children from 
Grades 1, 4, and 8 and college-age adults. 
The mean ages of these four groups in years 
and months were 7:1, 9:10, 14:0, and (ap- 
proximately) 20:0, respectively. The school 
children all came from a lower-middle class 
grade school, and were thus not strictly com- 
parable in IQ and related variables to the 
adults, who were college undergraduates 
(mostly freshmen and sophomores) enrolled 
in an introductory psychology course. All 
Ss reported they had normal eyesight, either 
uncorrected or corrected. (For the youngest 
children, the pretest with the apparatus given 
before the experiment proper allowed E to 
satisfy himself of the adequacy of S's eye- 
sight.) 


RESULTS 
Distance » 


The distance between the two 
screws which was to be bisected was 
47.3 cm. The vertical component of 
this distance (on which the recorded 
distance data are based) was 41.0 cm. 
Thus settings of the red dot larger 
than 20.5, the objective midpoint, 
would indicate an influence of sug- 
gested depth. This influence is ap- 
parent in the judgments at all ages 
and for all stimulus conditions, includ- 
ing Panel C, as shown by the means 
shown in Table 1. (Additional com- 
parisons between the H and L panels 
and between the NR and R panels are 
provided in Table 2.) 


TABLE 2 
MEAN DIFFERENCES IN DISTANCE SETTINGS 


(IN CM.) BETWEEN PANELS VARYING 
IN DENSITY AND IN RANDOMNESS 


Groups H va. L NR vs. R 
Grade 1 50 28 
Grade 4 .00 .26 
Grade 8 08 35 
Adults 18 17 
Combined 19 .26 


Note.—Positive values indicate greater magnitude of 
illusion for first-listed panels. 


THE PERSPECTIVE ILLUSION 


As the analysis of variance of these 
data is, rather lengthy and complex, 
suffice it to present it in abbreviated 
form. It falls into four parts. The 
first, involving all between-Ss effects 
(age, order, and their interaction), 
disclosed the variance due to age to be 
significant at better than the .01 level 
(F = 5.95, df = 3/88). The second, 
involving within-Ss effects summed 
over direction (stimuli, singly and in 
interaction with age and order), 
showed the variance due to the stim- 
ulus fields to be highly significant 
(F = 14.92, df = 5/440), but no sig- 
nificant interactions. The third part, 
involving within-Ss effects summed 
over.stimulus panels (direction, singly 
and in interaction with age and order) 
showed a highly significant effect due 
to ascending vs. descending direction 
(F = 152.54, df = 1/88), as wellas an 
interaction between direction and age 
significant at between the .05 and .01 
levels. (F = 3.05, df = 3/88). The 
fourth part, finally, is comprised of 
within-Ss effects due to simple and 
higher-order interactions involving 
both the stimulus and the direction 
variable; here the simple interaction 
was significant at the .01 level 
(F = 3.09, df = 5/440). 

These results may now be sum- 
marized as follows: 

Stimulus fields —The observed dif- 
ferences between stimulus fields are in 
good agreement with those postulated 
in our introduction: the effect of 
perspective was greatest for the G 
panel and least for the C panel (cf. 
Table 1); it was also greater for high- 
than for low-density fields, and greater 
for nonrandom than for random fields 
(cf. Table 2). Duncan's multiple range 
test (cf. Edwards, 1960, pp. 136ff.) 
indicates the following comparisons 
between means (for all groups com- 
bined) to be significant at the .05 level : 
G vs. all others except H-NR; H-NR 


305 


vs. H-R, L-R, and C; L-NR vs. C; 
H-R vs. C; L-R vs. C. In addition, 
orthogonal comparisons (Edwards, 
1960, pp. 140 ff.) between the two H 
ys. the two L fields, as well as between 
the two NR vs. the two R fields, both 
show differences significant at better 
than the .01 level. 

Age.—The overall effect appears to 
decrease with age, except that Grade 4 
Ss showed slightly (but nonsignifi- 
cantly) higher mean values than 
Grade 1 Ss. An application of Dun- 
can’s multiple range test, however, 
shows that all three of the children’s 
groups are significantly differentiated 
from the adults, but not from one 
another. Although there was a sug- 
gestion that the first graders were 
somewhat more influenced by the 
density variable than the other groups 
(cf. Table 2), the interaction between 
age and stimulus panels was not 
significant. 

Direction.—A notable feature of the 
results was the finding of a very 
marked anticipation effect: for all age 
groups and stimuli combined, the 
mean ascending judgment was 21.44, 
while the mean descending judgment 
was 22.87. This effect itself inter- 
acted with age, being largest at the 
first- and fourth-grade levels, and 
considerably reduced at the eighth- 
grade and adult levels. There was 
also a significant interaction of this 
factor with the stimulus variable 
which was less consistent in nature. 

Order—The order-of-judgment va- 
tiable (distance before vs. after size) 
failed to account for any significant 
portion of the variance, either singly 
or in interaction. 


Size 

The size of the standard rectangle 
(measured in terms of the length of 
the diagonal) was 7.5 cm. Thus set- 
tings of the variable smaller than 7.5, 


306 


JOACHIM F. WOHLWILL 


TABLE 3 
Mean Size Matcues, 1x Cm. (POE = 7.5) 


Stimulus Panels Combined 
Groups 

L-R H-R L-NR H-NR G Mean + 
Grade 1 7.47 7.51 7.49 7.27 7.36 7.23 7.39 0.28 
Grade 4 7.39 7.49 7.53 7.35 7.33 7.36 7.41 0.29 
Grade 8 7.42 7.49 7.50 7.26 7.39 7.27 7.39 0.24 
Adults 7.29 7.50 7.61 7.38 7.33 7.31 7.40 0.23 
Combined | 7.39 7.50 7.53 7.32 7.36 7.29 
Ca .32 0.34 0.39 0.35 0.37 0.41 


Note.—Values tabled represent height at top of field judged equal to a 7.5-cm. height at bottom (see text for 


details). 
a See Footnote a, Table 1. 
bSee Footnote b, Table 1. 


when the standard was at the bottom 
of the field, and larger than 7.5, when 
the standard was at the top of the 
field, would indicate an overestimation 
of the top stimulus, hence a perspec- 
tive effect. In order to make the 
measures for the two positions of the 
standard comparable to each other, 
the settings obtained with the stand- 
ard at the top were translated into 
scores which represented the size of 
the top rectangle perceptually equiva- 
lent to a 7.5-cm. rectangle at the 
bottom. This was accomplished by 
means of the formula s’/7.5 = 7.5/s, 
where s and s’ represent, respectively, 
the match made to the 7.5 standard 


at the bottom, and the transformed 
score, 


TABLE 4 


MEAN DIFFERENCES IN SIZE MATCHES 
(IN CM.) BETWEEN PANELS VARYING 
IN DENSITY AND IN RANDOMNESS 


Groups Hvs L NR vs. R 
Grade 1 04 ~.18 
Grade 4 01 = 17 
Grade 8 -07 Xp] 
Adults .03 ~18 

Combined 04 SAE, 


Note.—Negative values indicate greater magnit de 
of illusion for first-listed panels. S 


In order to simplify somewhat the 
analysis of variance (complicated even 
further, beyond the already rather un- 
wieldy one dealt with for the distance 
judgments, due to the addition of the 
variable of the position of standard) 
each S’s settings for the ascending and 
descending conditions for each stim- 
ulus panel were averaged. 

The means for each stimulus field 
at each age level are shown in Table 3, 
while Table 4 provides a comparison 
of the H and L panels and the NR and 
R panels. The analysis of variance of 
the data falls into two parts. The 
first, between-Ss portion (comprising 
the variables of age, order and position 
of standard, singly and in interaction) 
discloses no significant source © 
variance. The second, within-Ss por- 
tion (comprising the stimulus vari- 
able, singly and in interaction with 
the others), shows a significant effect 
due to the stimulus fields (F = 7.66, 
df = 5/400, P <.01), but no sig- 
nificant interactions. 

Summarizing and at the same time 
elucidating these results, we find the 
following: 

Stimulus fields —While this variable — 
had a significant effect on the size 
judgments, the results were much les? 
consistent, and less closely in agree- 

, 
i 


R R a | 


THE PERSPECTIVE ILLUSION 307 


ment with expectations, than was the 
case for the distance judgments (cf. 
Table 3). Duncan's range test shows 
that the means for both H-R and L-R 
conditions were significantly higher 
(at the .01 level) than those for the 
H-NR, L-NR, and G conditions. 
This was what had been anticipated, 
since for these judgments, the lower 
the score, the larger the apparent size 
of the top stimulus, and hence the 
greater the magnitude of the: illusion. 
The results for the C (Control) condi- 
tion are, however, decidedly out of 
line, since instead of yielding the high- 
est mean, this condition emerges as 
intermediate between the two random 
and the two nonrandom stimulus 
fields; in fact, the difference between 
H-R and C is significant at the 01 
level (in the wrong direction)! Fur- 
thermore, while the two nonrandom- 
panel means clearly differ from the 
two random-panel means as expected, 
the two high-density means are both 
higher (though not significantly) than 
the two low-density means, whereas 
the opposite was anticipated (cf. 
Table 4). 

Age.—There appeared to be no 
consistent differences between the age 
groups, nor did this variable interact 
with the stimulus variable. 

Direction.—Although this variable 
did not enter into the analysis of 
variance, inspection of the data shows 
again a very marked anticipation 
effect. This effect also tended to 
decrease with age, but the main 
difference appeared to be between the 
adults on the one hand and the three 
groups of children on the other. 

Position of standard.—The mean for 
all judgments made with the variable 
at the top was 7.44, as against a mean 
of 7.36 for the transformed settings 
made with the standard at the top. 
While this difference was not sta- 
tistically significant (F = 2.21, 


df = 1/80), it is in the direction of the 
“error of the standard,” involving an 
overestimation of the standard stim- 
ulus per se, which has been encount- 
ered previously in the literature 
(Gardner & Long, 1960; Piaget & 
Lambercier, 1943). The interaction 
of this effect with age likewise was 
nonsignificant; it might be noted 
nevertheless that the effect appeared 
to be most marked at the fourth grade, 
while the eighth graders and adults 
failed to exhibit any trace of it. 

Order—The order-of-judgment va- 
riable again failed to affect the judg- 
ments significantly. 


Verbalizations Given to the H-R and 
H-NR Stimulus Fields 


Of the 72 school children asked for 
an interpretation of the H-R and H- 
NR fields, 43 responded to the H-R 
and 48 to the H-NR panels. Of these 
91 responses, 78 clearly referred to a 
scene seen in depth, the most common 
response being a floor, or some variant 
thereof (e.g., a highway). Interest- 
ingly enough, 9 responses made refer- 
ence to a vertical plane (skyscraper, 
building), of which 6 came from the 
eighth graders. Otherwise no notable 
age differences were found; even the 
failure-to-respond rate was essentially 
the same for all groups. Nor were 
there any very consistent differences 
between the two panels: the H-NR 
panel elicited substantially more depth 
responses on the part of the eighth 
graders, but the two younger groups 
gave slightly more depth responses to 
the H-R panel. 

All in all, the results suggest that 
perspective drawings of this type are 
effective in conveying a sense of phe- 
nomenal depth even to the youngest 
of the Ss included in this study—a 
conclusion which is in line with the 
observed effect of these drawings on 
the perceptual judgments. 


308 JOACHIM F. 


Discussion 


The discussion of the results of this 
investigation will focus on four separate 
points: 

1. With respect to the distance judg- 
ments, the effects of the perspective 
drawings conformed very neatly to our 
expectations, showing that information 
to depth contained in the field, as mani- 
pulated in this study, represents a major 
determinant of the perceptual distortions 
produced in such drawings. The size 
judgments, however, yielded much less 
conclusive results, and did not appear to 
be consistently related to the stimulus- 
information variable. 

In attempting to account for these 
somewhat discrepant results, it may be 
helpful to examine the way in which S 
would in fact let the stimulus field affect 
him in making his judgment. It is ap- 
parent that for the distance judgments 
the background forms an essential part 
of S’s field of attention, since his task, 
to bisect the distance between the two 
screws, requires him to scan back and 
forth along a considerable portion of the 
stimulus field. In the case of the size 
judgments, on the other hand, it would 
be much easier for S to ignore the field 
separating the two stimulus objects to be 
compared. The incisions made into the 
field around the objects, and the location 
of the upper stimulus standing on the top 
border of the field may also have con- 
tributed to the perceptual isolation of the 
stimuli from the background fields, which 
would have mitigated their influence. 
Admittedly, it is difficult to account on 
this basis for the significant differences 
between some of the stimulus fields that 
were found, unless one assumes that, for 
whatever reason, the intrusion of the 
field into the size comparisons was es- 
pecially slight for the two randomly 
textured panels. 

2. The results with respect to the age 
variable can only be regarded as incon- 
clusive. Certainly there was little sug- 
gestion of an increase in the effects of 
perspective with age, except for the very 
slight and nonsignificant increase from 
the first to the fourth grade. It seems, 


WOHLWILL 


therefore, that if this illusion is a product 
of learning, whether in the sense of 
associative or assumptive processes or in 
the sense of increasing experience in re- 
sponding to spatial relationships within 
a field, such learning must run its course 
fairly early in life. The significant de- 
crease in the effects of perspective on the 
distance judgments for the adults, on the 
other hand, appear to reflect a more 
active attempt on the part of these Ss to 
counteract this illusion, of which most of 
them were well aware. In this connec- 
tion it should be noted that the difference 
between adults and children may have 
been at least in part a matter of intel- 
lectual level, rather than age, the adults 
clearly representing a more select group 
in this respect than the children. 

3. An interesting finding was that for 
both distance and size judgments the 
control panel itself gave rise to a constant 
error in the direction of the perspective 
illusion. A variety of factors could have 
contributed to this result: the border of 
the field present in the control panel 
might by itself have conveyed a sense of 
depth; perseverative effects from other 
panels previously exposed might have 
led S to perceive the control panel in 
terms of depth; relative height in the 
field may be interpreted as depth, even 
in the absence of other cues, just as in 
other studies (Smith, 1958; Weinstein, 
1957) this variable has been shown to 
represent an effective cue by itself, lead- 
ing to a certain amount of constancy in 
size judgments made from photographs. 

4. Finally, since one might consider 
this study as representing a beginning to- 
wards an informational approach to the 
study of space perception, a brief analysis 
of the possibilities as well as the limita- 
tions of such an approach in this area 
appear to be warranted. 


_ Given the selection procedures employed 
in the construction of the stimulus fields, the 
specification of their formal informational 
content (independent of their role in suggest 
ing depth) is relatively straight forward As 
regards the density variable, the two random 
fields, for which p (the probability of a cell's 
being filled in) was, respectively, .06 and .20, 
contain an average of 34 and 72 bits per cell, 


THE PERSPECTIVE ILLUSION 309 


respectively. Similarly the two NR panels, 
for which p, for the restricted set of columns 
from which cells were selected, was .18 and 
.60, contain, respectively, .69 and .97 bits per 
cell for each filled column, or .23 and .32 
bits per cell for the total field. 

The regularity variable, on the other hand, 
can best be expressed in terms of the redun- 
dancy, relative to the corresponding R panels, 
introduced by the selection procedures. 
Thus, taking the R panels as a baseline, L-NR 
would show a redundancy of 32% (1— 
.23/.34), while that of H-NR would be 55% 
(1—.32/.72). It is interesting to note that 
for the distance judgments the effect of 
redundancy was indeed more pronounced for 
the high-density than for the low-density 
panels (cf. Table 1). 

At first sight, however, the foregoing anal- 
ysis may appear paradoxical; increasing in- 
formational content by increasing the number 
of cells increases the perspective effect; at the 
same time, increasing redundancy, which is 
equivalent to a decrease in informational 
content relative to the baseline, also increases 
the effect. In order to resolve this paradox, 
it is essential to distinguish between the in- 
formational content of the stimulus array, 
which represents essentially the structural 
complexity of the array, and the “information 
to depth” provided by such an array. In 
order to elucidate this point, let us examine 
more closely the actual role played by the in- 
formation variable as manipulated here. In- 
creasing the number of cells filled in, or more 
particularly the proportion of cells, up to ET 
increases the effect, by providing the obseryer 
with a greater amount of visual information 
as regards the progressive deformation of the 
field from the bottom to the top of the panel.” 
Increasing the regularity of the arrangement 
of the cells by restricting them to a limited 
set of columns, while decreasing the informa- 
tion, or introducing redundancy, likewise 
heightens the effect due to the ‘overdeter- 
mination” of the location of the vanishing 
point, towards which all the columns converge. 
These considerations underlie our perhaps 


2 For values of p >-5, the figure-ground 
relations would simply be reversed: the in- 
formation would in effect be concentrated in 
the white cells, so that the illusion would 
decrease again until, for p = 1.0, the situation 
would be formally equivalent to that for 
p= 0.0, ie, to the control panel. This 
situation is of course faithfully reflected in the 
concomitant changes in H, the measure of the 


informational content. 


somewhat capricious distinction between 
“amount” and “redundancy” of information, 
to deal with the density variable and the 
regularity variable, respectively, as well as our 
use of the admittedly imprecise term “‘in- 
formation to depth.” 


Speaking more generally, it becomes 
apparent that any application of an in- 
formational model in this area cannot 
proceed blindly, but must consider the 
particular ways in which informational 
content is varied, and their bearing on 
the perceptual situation and on S's task. 
Particularly is this true with respect to 
the role of redundancy, which can prob- 
ably operate in very different directions, 
depending on the way itis imposed. For 
instance, if the restrictions imposed in the 
selection of cells for the NR fields had 
involved the rows rather than the 
columns, the formal amount of redun- 
dancy thus introduced would have been 
the same, yet the effects of this redun- 
dancy on the judgments in this task 
would probably have been less pro- 
nounced. 

Nevertheless, judiciously applied, the 
concepts and principles of information 
theory should prove rewarding in carry- 
ing the study of space perception beyond 
the investigation of isolated cues to the 
kind of parametric and systematic anal- 
ysis of the information in the stimulus 
array on which Gibson has repeatedly 
insisted. A limited example of this 
point from our experiment is the treat- 
ment of texture density and linear 
perspective in terms of the more general 
concept of informational content, which 
the present approach has made possible. 


SUMMARY 


This experiment investigated the effects of 
different stimulus fields, made up of per- 
spective drawings varying in the amount and 
regularity of the elements subjected to per- 
spective deformation, on the judgment of 
relative size and distance in the plane of the 
drawings. Four age groups, varying from 
first-grade children to college-age adults, were 
used as Ss. The results obtained confirmed 
the prediction that, as the amount and re- 
dundancy of information to depth contained 
in the field increased, the apparent midpoint 


310 JOACHIM F. 


of a segment of a line through the vanishing 
point would be displaced towards the top of 
the field. ‘The results for the size judgments 
were less consistent. The only age difference 
appeared on the distance judgments, where ' 
adults exhibited smaller effects than children 
between 7 and 14 yr. of age. The implica- 
tions of the experiment for an informational 
approach to the study of space perception 
are briefly considered. 


REFERENCES 


EDWARDS, A. L. Experimental design in 
psychological research. (Rev. ed.) New York: 
Rinehart, 1960. 

Garpner, R. W., & Lone, R. I. Errors of 
the standard and illusion effects with the 
inverted-T. Percept. mot. Skills, 1960, 10, 
47-54. 

GIBSON, J. J. The perception of the visual 
world. Boston: Houghton Mifflin, 1950. 
Gisson, J. J. A theory of pictorial percep- 
tion. Audiovis. commun. Rev., 1954, 1, 

3-23. 


WOHLWILL 


Geson, J. J.» Pictures, perspective, and 


perception. Daedalus, 1960, 89, 216-227. 
Grasser, O. Optical illusions. In O. 
Glasser (Ed.), Medical physics. Chicago: 


Yearbook, 1944. Pp. 824-827. 

PIAGET, J., & LAMBERCIER, M. Recherches 
sur le développement des perceptions: III. 
Le probléme de la comparaison visuelle en 
profondeur (constance de la grandeur) et 
erreur systématique de l'étalon. Arch. 
Psychol., Genève, 1943, 29, 253-308. 

SmitH,O.W. Judgments of size and distance 
in photographs. Amer. J. Psychol., 1958, 
71, 529-538. 

Smıra, O. W., Smrt, P. C., & HUBBARD, D. 
Perceived distance as a function of the 
method of representing perspective. Amer. 
J. Psychol., 1958, 71, 662-674. 

WEINSTEIN, S. The perception of depth in 
the absence of the texture-gradient. Amer. 
J. Psychol., 1957, 70, 611-615. 

Woutwit, J. F. Developmental studies of 
perception. Psychol, Bull., 1960, 57, 249- 
288. 


(Received August 17, 1961) 


S pm n 


Journal of Experimental Psycholo 
1962, Vol. a No. 3, 311-313 e 


STIMULUS GENERALIZATI 


ON AS A FUNCTION OF UCS 


INTENSITY IN EYELID CONDITIONING 


JOHN J 


. PORTER! 


State University of Iowa 


One of the implications of the be- 
havior theory developed by Hull and 
Spence (Hull, 1943, 1952; Spence, 
1956), based on studies of classical 
and instrumental conditioning, is that 
generalization performance curves for 
groups at different drive levels will 
tend to converge. The derivation 
of this interaction is as follows: 
E, = H X D,, Eo = H X Dy; then, 
E, By H(D, — Dw), where 
Pi strong drive and Du = weak 
drive while E, and Ey are the corre- 
sponding excitatory potentials. If H 
is the habit strength developed to the 
training stimulus (S) and H that 
developed to the generalized stimulus 
(S'), then the above derivation implies 
that a greater difference in excitatory 
potential will be expected at S than 
at S$’. Thus an interaction is implied. 


Studies of stimulus generalization under 
two levels of motivation in rats by Brown 
(1942) and in humans by Rosenbaum (1953) 
yielded converging generalization curves 
when these authors’ time measures were later 
transformed to speed measures. Newman 
(1955) studied stimulus generalization in the 
rat under different drive levels. Although 
Newman found evidence of convergence with 
both speed and extinction measures, the inter- 
action was not statistically significant. Both 
Jenkins, Pascal, and Walker (1958) and 
Thomas and King (1959) used pigeons to 
study generalization at different drive levels 
in the Skinner box. Jenkins et al. (1958) 
found divergent relative stimulus generaliza- 
tion gradients under conditions of constant 
drive differences while Thomas and King 
(1959) found that three of their four drive 
groups yielded converging stimulus general- 
ization gradients during extinction. 

1 Now at the University of Wisconsin, Mil- 
waukee. 

The aut 
W. Spence for a 
out the course ol 


hor is greatly indebted to Kenneth 
dvice and assistance through- 
f this investigation. 


The present experiment measured 
the predicted performance conver- 
gence of high- and low-drive groups 
with either a 1500-cps or 400-cps tone 
after all Ss were trained at one drive 
level with the 1500-cps tone alone. 
An extinction test was used in order 
to avoid confounding generalization 
effects with differential reinforcement 
effects due to differences in puff 
intensities. 


METHOD 


Subjects: —The Ss were 160 women from a 
course in introductory psychology at the 
State University of lowa. Forty-one women 
were discarded for not meeting a condi- 
tioning criterion of more than 8 CRs 
and less than 36 CRs on Trials 41-80. This 
conditioning criterion was employed to avoid 
both ceiling and floor effects when Ss were 
shifted to either a higher or lower drive condi- 
tion during the extinction trials. Six women 
who gave CRs to the CS alone on test trials 
were discarded as were 7 others who gave 
50% or more responses that met the criterion 
of voluntary responses used in this laboratory 
(Spence & Ross, 1959). Four additional Ss 
were discarded due to E error and 2 due to 
equipment malfunction. The 100 remaining 
Ss were randomly assigned to four groups. 

Apparatus and recording method.—The Ss 
were seated in an adjustable dental chair in a 
sound shielded room which was illuminated 
by a shielded 7.5-w. bulb. The Ss’ room was 
separated from the recording and control 
room by a third intervening room. The 
equipment used to record eyelid responses was 
the same as that used in previous studies from 
the Iowa laboratory (cf. Spence & Taylor, 
1951). The CS employed during acquisition 
was the onset of a 1500-cps tone of 70 db. 
generated by a Hewlett-Packard audio oscil- 
lator and delivered by a 6-in. loudspeaker 
4 ft. behind S. During extinction the same 
1500-cps tone served as CS for two groups 
while the remaining two groups received a 
400-cps tone of like intensity. The acquisi- 


311 


312 


tion UCS was a 50-msec., .6-psi, air puff 
delivered to the right eye through a .062-in. 
diameter orifice by a 110-v. ac solenoid valve. 
During extinction two groups received a 
.33-psi puff while the other two received a 
2.0-psi puff 2500 msec. after CS onset. A 
dimly illuminated 2.25-in. diameter circular 
milk-glass disk located 4 ft. in front of S 
served as a fixation point. 

Procedure —Each S was instructed to 
blink once to the ready signal and then to look 
at the disk until the tone went off. After the 
instructions had been read to each S she 
then received three presentations of the cs 
alone and one presentation of the UCS alone. 
The intervals between the ready signal and 
the onset of the CS were 2, 3, or 4 sec. ran- 
domly varied. Intertrial intervals of 15, 20, 
or 25 sec., given according to a fixed schedule, 
and averaging 20 sec. were used. A CR was 
recorded whenever the record showed a 
deflection of 1 mm. or more in the interval 
200-500 msec. following CS onset. 

Experimental design—An 80% partial 
reinforcement schedule was used during train- 
ing in order to provide for greater resistance 
to extinction than is obtained with continuous 
reinforcement. The reinforced and non- 
reinforced trials followed a prearranged se- 
quence restricted by the provision that no 
more than 2 nonreinforced trials occurred in 
each block of 10 trials, On reinforced trials 
the UCS onset followed the CS by 500 msec. ; 
on nonreinforced trials the UCS began 2500 
msec. after the CS onset. McAllister (1953a, 
1953b) has shown that little or no conditioning 
occurs at 2500-msec, intervals, and that the 
CR extinguishes when the interval is shifted 
from 500 msec. during acquisition to 2500 
msec. during extinction. On both reinforced 
and nonreinforced trials the CS extended 50 
msec. beyond the UCS onset and both CS 
and UCS terminated simultaneously. 

Four groups of 25 Ss were conditioned and 
extinguished. All groups received identical 
(.6 psi, 1500 cps) 80% reinforcement condi- 
tions for 80 trials followed by 40 extinction 
trials under one of four conditions. An ex- 
tinction test was used in order to avoid con- 
founding of the results through further 
differential performance build-up under the 
new puff strengths of 2 psi and .33 psi. 
Group I received 40 extinction trials with a 
2-psi puff and the original 1500-cps tone. 
Group II differed from | only in receiving a 
.33-psi puff during extinction. Group MI 
received 40 nonreinforced trials with a 2-psi 
puff and a 400-cps tone. Group IV differed 
from III only in receiving a .33-psi puff 
during extinction. 


JOHN J. PORTER 


RESULTS AND DISCUSSION 


Performance had stabilized at about 
60% responding at the point where 
extinction began. Inspection of the 
data and an analysis of variance over 
the last 10 acquisition trials (71-80) 
revealed no significant performance 
differences between the four acquisi- 
tion groups: therefore these four 
groups were treated as one group. 

Three different response measures 
were used during extinction and three 
different blocks of trials were ana- 
lyzed. Extinction results were ana- 
lyzed in terms of total number of 
responses, percentage of responses, 
and superthreshold excitatory poten- 
tial (£z) as defined by Spence (1956). 
The above three measures were taken 
on Trials 1-10, 1-20, and 1-40, 
respectively. 

Figure 1 presents the percentage of 
CRs made during Extinction Trials 
1-40. The same picture resulted 
when the other two response measures 
were used. Examination of Fig. 1 
revealed the predicted performance 
convergence during extinction for the 
high- and low-drive groups. A simi- 
lar graphical convergence was ob- 
tained on Trials 1-10 and 1-20. 
Thus there was a smaller difference 
at the generalized stimulus value than 
at the original stimulus between the 
two drive groups. Statistically, such 


6 45| 
€ o 2 \Be——* 
o 35 L\Gom—e 
EOS 

z 

3 

« 25 . 
a 
= 15 Oe e ae tt ee — 0 
w 
o TTO 

1500 EXTINCTION ces 40g 
TRIALS 1-40 
Fic. 1. Mean percentage of conditioned 


responses for Extinction Trials 1-40 


STIMULUS GENERALIZATION 


a condition is represented by a sig- 
nificant interaction between groups. 
This interaction was significant 
(P < .05, F = 4.35 for percentage) 
over Trials 1—40 for all three perform- 
ance measures. However this inter- 
action was not significant (P < .20) 
for Trials 1-10 and 1-20 for any of the 
three performance measures. 


Given the interaction over Trials 1-40, 
the problem becomes one of accounting 
for the lack of a significant interaction 
over Trials 1-10 and 1-20 within the 
framework of the theory used here. As 
developed earlier, the theory would pre- 
dict the presence of the sought after 
interaction within, at the most, a few 
trials after the change to extinction con- 
ditions. Since it required 40 trials to 
obtain the predicted interaction during 
extinction it is necessary to examine the 
effect of inhibition (I) on the predicted 
interaction. In Hull-Spence theory J is 
assumed to be a function of the number 
of nonreinforced trials and is assumed to 
subtract from the quantity (D X H). 
If I is added to the two equations used 
earlier to derive the difference equation 
it may be seen that J cancels out of the 
difference equation. This leaves H and 
D, or H and D, as the differential factors 
in this equation. It should be under- 
stood that the cancellation of J in the 
difference equation does not mean that 
I does not act on performance, but that 
it does not act differentially. Therefore 
the interaction predicted by the theory 
remains unchanged. 

Theoretical considerations aside, ex- 
amination of the range of responses over 
extinction trial blocks revealed decreas- 
ing variability within groups as later 
extinction trial blocks were examined. 
This finding suggests that the greater 
variability of the data during Trials 
1-10 and 1-20 may have masked the 
interaction effect. 


SuMMARY 


This study investigated the interaction 
between drive level and original and general- 
ized stimulus conditions during extinction. 


313 


One hundred Ss were first conditioned for 80 
trials to respond to a .6-psi air puff on an 80% 
reinforcement schedule with a CS of 1500 cps. 
The Ss were then divided into four groups 
which were extinguished with either a 1500- 
or 400-cps tone and a .33- or 2.0-psi air puff. 
On all nonreinforced trials during acquisition 
and extinction the UCS was presented, but 
2500 msec. after the CS. 

The results confirmed (P < .05 for Ex- 
tinction Trials 1—40) the hypothesized inter- 
action between drive level and original and 
generalized stimulus conditions predicted by 
Hull-Spence theory. The effect was not 
significant over Trials 1-10 or 1-20, 


REFERENCES 


Brown, J. S. The generalization of ap- 
proach responses as a function of stimulus 
intensity and strength of motivation. J. 
comp. Psychol., 1942, 33, 209-226. 

Hut, C. L. Principles of behavior. New 
York: Appleton-Century, 1943. 

Hut, C. L. A behavior system. New 
Haven: Yale Univer. Press, 1952. 

Jenkins, W. O., Pascat, G. R., & WALKER, 
R. W. Deprivation and generalization. 
J. exp. Psychol., 1958, 56, 274-227. 

McALLISTER, W. R. The effect on eyelid 
conditioning of shifting the CS-UCS in- 
terval. J. exp. Psychol, 1953, 45, 423- 
428. (a) 

MCALLISTER, W. R. Eyelid conditioning as 
a function of the CS-UCS interval. J. exp. 
Psychol., 1953, 45, 417-422. (b) 

Newman, J. R. Stimulus generalization of 
an instrumental response as a function of 
drive strength. Unpublished doctoral dis- 
sertation, University of Illinois, 1955. 

Rosensaum, G. Stimulus generalization as a 
function of level of experimentally induced 
anxiety. J. exp. Psychol., 1953, 45, 35-43. 

Spence, K. W. Behavior theory and condi- 
tioning. New Haven: Yale Univer. Press, — 
1956. 

Spence, K. W., & Ross, L. E. A methodo- 
logical study of the form and latency of 
eyelid responses in conditioning. J. exp. 
Psychol., 1959, 58, 376-381. 

Spence, K. W., & TAYLOR, J. A. Anxiety 
and strength of UCS as determiners of the 
amount of eyelid conditioning. J. exp. 
Psychol., 1951, 42, 183-188. 

Tuomas, D. R., & Kine, R. A. Stimulus 
generalization as a function of level of 
motivation. J. exp. Psychol., 1959, 57, 
323-328. 


(Received August 18, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No, 3, 314-317 


EXPERIMENTAL EXTINCTION AS A FUNCTION OF 
NUMBER OF REINFORCEMENTS! 


JAMES R. ISON 
University of Rochester 


Overlearning-reversal experiments 
by Reid (1953), Pubols (1956), and 
Capaldi and Stevenson (1957) in 
brightness discrimination and by Pu- 
bols (1956) and Ison and Birch (1961) 
in spatial discrimination have demon- 
strated facilitation of reversal learning 
after overlearning compared to con- 
trol groups reversed at criterion. A 
similar experiment by Birch, Ison, 
and Sperling (1960) in differential 
conditioning demonstrated more rapid 
extinction of the formerly positive 
tesponse for the overlearning group 
and it was concluded that, contrary to 
the results obtained in the free re- 
sponding lever press apparatus (Miles, 
1956; Perin, 1942; Williams, 1938), 
resistance to extinction of a running 
response is not a monotonically in- 
creasing function of the number of 
reinforced trials (N,). This conclu- 
sion was supported, in part, in a 
Tunway experiment by North and 
Stimmel (1960) which demonstrated 
greater resistance to extinction in a 
group given 45 reinforcements as com- 
pared to groups given 90 or 135. The 
Purpose of the present experiment is 
to provide further evidence on this 
relationship in the straight runway 
apparatus, 


METHOD 


Subjecis:—The Ss were 715 male hooded 
rats, approximately 100 days old, obtained 
from the colony maintained by the Psychology 
Department of the State University of Iowa. 


1 This research was performed while the 
author was a Rackham Postdoctoral Fellow at 
the State University of lowa. The author 
wishes to thank K. W. Spence for his generous 
assistance. 


314 


They were randomly assigned to six groups 
of 12 or 13 Ss which received either 10, 20, 40, 
60, 80, or 100 rewarded acquisition trials, 
Apparatus.—A straight alley was housed 
within a two-unit, black-draped enclosure 
4 ft. high, 4 ft. wide, and 11} ft. long. A ply- 
wood panel 4 ft. from the end of the en- 
closure completely separated the runway 
section from the goal box except for a hole in 
the base through which passed the alleyway. 
The alleyway (covered with glass) was 4 in. 
high, 3} in. wide; the start box was 9 in. 
long, the runway 72 in., and the goal box 18 in. 
Guillotine retrace doors separated the start 
box and the goal box from the runway. The 
entire apparatus was painted flat black and 
was illuminated by three 75-w. Lumline 
lights attached to the ceiling of the enclosure 


directly over the goal box and at distances of 


1 and 4 ft. from the start box. ‘These bulbs 
were screened to give an incident light of 
approximately 3 ft-c (range, 2.9 to 3.3) on 
the alleyway. Infrared photobeams per- 
mitted the measurement of running speed 
over two 1-ft. segments of the alleyway 
beginning 1 and 4 ft. from the start box. 
The apertures for the lights were } in. in 
diameter and covered with painted plastic, 
thus provided little differential stimulation. 
Mercury switches in the start-box and goal- 
box doors permitted the measurement of 
running speed over the entire alleyway. All 
times were recorded on electronic clocks and 
the operation of the timing circuits was silent. 

Procedure.—The Ss were allowed 10 gm. 
of food powder each day in wet mash, 
presented $ hr. after each daily treatment. 
They were placed on this schedule and 
handled for 2 min. each day for 14 days prior 
to acquisition training. Two trials were 
given on Days 1 and 2 and § trials per day 
thereafter except for Extinction Day 1, which 
contained the final rewarded trial followed by 
$ nonrewarded trials. The reward was 0.4 
gm. of food powder in 0.3 ml. of water, given 
in a glass dish 4 in. from the goal-box end 
wall. In extinction the empty glass dish was 
present, and S was detained in the goal box 
for 30 sec. The minimum intertrial interval 
in both acquisition and extinction was 18 min. 
during which S was detained in a wooden and 


EXPERIMENTAL EXTINCTION 


wire mesh box with water available. Extinc- 
tion was carried to a minimum of 80 trials, 
continuing if necessary until $ took longer 
than 120 sec. to enter the goal box. 


RESULTS 


Four measures are reported, the 
number of trials to various extinction 
criteria, running speed in early ex- 
tinction, the number of avoidance 
responses made in extinction, and the 
number of trials to the first avoidance 
response. 

Trials to criterion —A criterion trial 
was the first trial on which S exceeded 
a criterion number of seconds to enter 
the goal box after the start-box door 
was opened. Four criteria were 
chosen, 10, 20, 40, and 120 sec., and 
the mean numbers of trials to each of 
these are presented in Fig. 1. With 
the criteria of 10 and 20 sec., the 
groups did not differ (F < 1.00). 
With the 40-sec. criterion the differ- 
ence among groups was significant at 
the .01 level (F = 3.69, df = 5/69) 
and on the 120-sec. criterion the differ- 
ence was significant at the .001 level 
(F = 7.48, df = 5/69). On both of 
these latter criteria the relationship 


D 
o 


> a 

o o 
Fg 
° 
o 


CRITERIA (SEG. 


j 


j 


/ 


TRIALS TO GRITERIA 
N wo 
o O 


2 

o 

5 
` 


r 
[/ 
| 


| 
l 


\ 


10 20 40 60 80 
REINFORCEMENTS 


100 


Fic. 1. The mean number of trials to 
criteria of 10, 20, 40, and 120 sec. between E's 
Opening the start box and S's entering the 
goal box. 


315 


& 


RUNNING SPEED (FT./SEG) 


=> b w > warn wm w& 


2 23 34 45 57 TO 69 9-10 IOR I3 8414 I9 
SLIDING BLOGKS OF TWO TRIALS 


Fic. 2. Mean running speed in sliding 
blocks of two trials over the first 3 days of 
extinction. (The first trial of each day is 
omitted.) 


between JV, and trials to criterion was 
negative, 

Running speed in extinction —In 
Fig. 2 is depicted the running speed 
(over the entire alleyway) of the six 
groups in the first 3 days of extinction. 
The other speed measures showed es- 
sentially identical results. On the 
initial point Groups 20, 40, 60, and 
100 were but little different whereas 
Groups 10 and 80 were slower, In 
subsequent trials Group 100 showed 
the greatest decrement and, with 
Group 80, was responding at the 
slowest speeds at the end of the 3 days. 
Group 10 decreased the least and the 
other three groups fell roughly in 
order between Groups 10 and 80. 
The groups, ranked in mean response 
speed per trial, were Group 20 
(M = .60 fps); Group 40, (M = .55 
fps); Group 10, (M = .52 fps); 
Group 60, (M = .50 fps); Group 100, 
(M = .41 fps); Group 80, (M = .37 
fps). A trial by trial mixed analysis 
of variance (Lindquist, 1953) yielded 
a significant Groups effect (F = 3.64, 
df = 5/69, P < .01), a significant 
Trials effect (F = 77.77, df = 12/828, 


316 


MEAN NUMBER OF 
AVOIDANCE TRIALS 


TRIALS 


10 aor et 
o 
5 TRIALS TO FIRST 
AVOIDANCE 


1020 40 60 


pa A 


Fic. 3. The mean total number of ex- 
tinction trials on which an avoidance response 
occurred and the mean number of trials to the 
first avoidance response. 


80 100 


P < .001), and a significant Groups 
X Trials interaction (F = 1.96, 
df = 60/828, P < .01). 

Avoidance responses in extinction — 
An avoidance response was recorded 
whenever S turned and moved in the 
direction of the start box. In Fig. 3 
two different measures are shown, 
One is the mean number of trials 
before the first avoidance response 
occurred. A simple analysis of vari- 
ance of these data yielded a significant 
Groups effect (F = 4.10, df = 5/69, 
P < .005); the relationship between 
this variable and N, was negative. 
The second is the mean number of 
trials on which an avoidance response 
occurred in the 80 extinction trials, 
An analysis of these data yielded a 
significant Groups effect, (F = 5.14, 
df = 5/687, P< 001); the relation- 
ship between this variable and N, 
was positive. 

2 One S was dropped after Extinction Trial 


30 because of an experimental error. This 
reduced the df for this comparison to 68. 


- is determined by some characteristic of 


JAMES R. ISON 


Discussion 


Under the conditions of this experi- 
ment, trials to extinction criteria of 40 
and 120 sec. were negatively related to 
N, and running speed in the first 3 days 
of extinction was nonmonotonically re- 
lated to N,. These data support the 
conclusions of Birch, Ison, and Sperling 
(1960) in their account of overlearning- 
reversal problems and confirm and ex- 
tend the findings of North and Stimmel 
(1960). i 

This relationship between trials to 
criterion and WN, is to be contrasted with 
the negatively accelerated increasing 
function typically obtained in the Skin- 
ner box. One possible reason for this 
difference is that in the present experi- 
ment and in that of North and Stimmel 
(1960) the reward magnitude (W,) was 
large; whereas, in the Skinner box the 
reward was relatively small. Several 
experiments have suggested that N, and 
W, interact in determining resistance to 
extinction, the relationship between Ra 
and W, being positive at small N; 
(Zeaman, 1949) but negative at large Ny 
(Armus, 1959). This interaction might D 
be reversible, i.e., at large W, Rn is 
negatively related to N,; whereas, at 
small W,, R,, is positively related to Ng 
over at least part of therange. A second 
possibility is that the form of the function 


the investigated response. The one 
overlearning-reversal experiment which 
did not use a running response (McCul- 
loch & Pratt, 1934) found that extended 
training retarded the subsequent reversal, 
which is contrary to the results of the 
later studies. Whether nhonmonotone or 
negative relationships are peculiar to the 
running response and whether they can 
be obtained with other responses. given 
appropriate values of N, and W, are 
subject to further investigation. 

The positive relationship between the 
number of avoidance responses and Ne 
and the negative relationship between 
trials to the first avoidance response andy 
N, are consistent with interference 
theories of extinction stressing the ac 
quisition of frustration-instigated avoid- 


ance responses which compete with the 
approach response (e-g., Birch, 1961; 
North & Stimmel, 1960). Following 
Amsel (1958) and Spence (1960), the 
magnitude of frustration elicited on non- 
reinforced trials is assumed to be in part 
a positive function of the number of prior 
reinforcements, which, with the further 
assumption that the strength of the 
avoidance response is positively related 
to frustration magnitude, is sufficient to 
account for these two relationships. 


SUMMARY 


The relationship between the resistance 
to extinction of a running response and the 
number of acquisition trials (V,) was in- 
vestigated. Six groups of rats received either 
10, 20, 40, 60, 80, or 100 rewarded trials 
followed by 80 nonrewarded extinction trials 
at five trials per day with an intertrial interval 
of 18 min. The mean numbers of trials to 
extinction criteria of 40 and 120 sec. were 
negatively related to Np and running speed 
in early extinction was nonmonotonically 
related to Ny. These data were contrasted 
with those previously obtained in the Skinner 
box. In addition, the mean number of trials 
on which avoidance responses occurred in 
extinction was positively related to N, and 
the mean number of trials to the first avoi- 
dance response was negatively related to No. 
These latter relationships are consistent with 
interference theories of extinction which 
stress the acquisition of competing avoidance 


responses. 
REFERENCES 

AmseL, A. The role of frustrative nonreward 
in noncontinuous reward situations. Psy- 
chol. Bull., 1958, 55, 102-119. i 

Armus, H. L. Effect of magnitude of rein- 
forcement on acquisition and extinction 
of a running response. J. exp. Psychol., 
1959, 58, 61-63. 

Biren, D. A motivational interpretation of 
extinction. In M. R. Jones (Ed.), Ne- 
braska symposium on motivation: 1961. 


- 


EXPERIMENTAL EXTINCTION 


317 


Lincoln: Univer. Nebraska Press, 1961. 
Pp. 179-196. 

Brrcu, D., Ison, J. Ra & Speriine, S. E. 
Reversal learning under single stimulus 
presentation. J. exp. Psychol., 1960, 60, 
36-40. 

Capatpt, E. J., & STEVENSON, H. W. Re- 
sponse reversal following different amounts 
of training. J. comp. physiol. Psychol., 
1957, 50, 195-198. 

Ison, J. R., & BIRCH, D. T maze reversal 
following differential endbox placement. 
J. exp. Psychol., 1961, 62, 200-202. 

Linpguist, E. F. Design and analysis of 
experiments in psychology and education. 
Boston: Houghton Mifflin, 1953. 

McCutocs, T. L., & PRATT, J. G. A study 
of the presolution period in weight dis- 
crimination by white rats. J. comp. Psy- 
chol., 1934, 18, 271-290. 

Mies, R. C. The relative effectiveness of 
secondary reinforcers throughout depriva- 
tion and habit strength parameters. J. 
comp. physiol. Psychol., 1956, 49, 126-130. 

Nort, A. M., & STIMMEL, D.T. Extinction 
of an instrumental response following a 
large number of reinforcements. Psychol. 
Rep., 1960, 6, 227-234. 

„PERIN, C. T. Behavior potentiality as a 
joint function of the amount of training and 
degree of hunger at the time of extinction. 
J. exp. Psychol., 1942, 30, 93-1 13. 

Pusots, B. H. The facilitation of visual and 
spatial discrimination by overlearning. 
J. comp. physiol. Psychol., 1956, 49, 243— 
248. 

Rem, L. S. The development of non- 
continuity behavior through continuity 
learning. J. exp. Psychol., 1953, 46, 107- 
112. 

Spence, K. W. Behavior theory and learning. 
Englewood Cliffs, N. J: Prentice Hall, 
1960. 

Wrutas, S. B. Resistance to extinction as 
a function of the number of reinforcements. 
J. exp. Psychol., 1938, 23, 506-522. 

Zeaman, D. Response latency as a function 
of amount of reinforcement. J. exp. 
Psychol., 1949, 39, 466-483. 


(Received August 21, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No, 3, 318-324 


EYE-MOVEMENT LATENCY, DURATION, AND 
TIME AS A FUNCTION OF ANGULAR 


DISPLACEMENT 


ALBERT E. BARTZ? 
Applied Research Laboratory, University of Arizona 


Characteristics of eye movements 
and reaction time to visual stimuli 
have both been studied systematically 
Since the beginning of the twentieth 
century. Various components of the 
visual response were systematically 
investigated by Dodge and others 
(Diefendorf & Dodge, 1908; Dodge & 
Cline, 1901), Using photographic re- 
cording techniques, they found that 
the average latency was about 200 
msec., while the eye movement dura- 
tion was 29 msec. for a 5° movement, 
and increased to 100 msec. for a 40° 
movement. Essentially the same re- 
sults were obtained by Miles (1936) 
and Hackman (1940), 

It must be noted that these time 
intervals do not reflect the time in- 
volved in the process of “seeing” an 
object in the periphery. After the 
eye has fixated upon the peripheral 
stimulus, the observer still must 
Process the new information and make 
some response, 

More recently other investigators 
have been concerned with the total 
Tesponse time (RT) when there is 
more than a simple movement in- 


„` This paper is based upon a thesis sub- 
mitted to the Graduate School of the Univer- 
sity of Arizona in partial fulfillment of the 


Highway Research Board for Contract AFE 
8093 that made the Project possible, and to 
R. W. Lansing, R. L. Lucas, and H. Tucker 
for their criticism and suggestions, 

2 Now at Concordia College, Moorhead, 
Minnesota. 


volved. Hyman (1953) found that 
the total RT increased when the task 
required S to identify the specific 
location of the stimulus. Words were 
assigned to various lights and the RT 
was measured by a voice key set off 
when .S pronounced the correct word 
for the stimulus location. It should 
be noted that this increase in RT 
occurred even though S$ was not 
Specifically instructed to move his 
eyes since Hyman’s stimulus lights 
subtended a maximum of only 2.5° 
of visual angle. This type of RT is 
more closely related to the problem of 
Seeing, since the total visual reaction 
must include an identification of what 
is seen. As was expected Hyman 
found this identification type of RT 
to be longer—the lengthening being a 
function of the statistical probability 
that a stimulus would appear in the 
Specific location identified. Hyman’s 
vocal RTs varied from 300 to 750 
msec, 

However, this complex response 
still does not represent accurately the 
Process of “seeing” an object in the 
periphery. To see an object in the 
Periphery S must not only identify 
the location and swing his eyes to it, 
but also must interpret the stimulus. 
The present research was designed for 
two purposes. Experiment I involved 
the investigation of RTs as functions 
of angular displacement from the line 
of regard and the number of stimuli 
to which S$ must attend. Experi- 
ment II was designed to isolate and 
measure the various components of 
the total RT. By using the electrical 


318 


RESPONSE 


EYE-MOVEMENT LATENCY, DURATION, AND RESPONSE TIME 


method for recording eye movements 
(Ford & Leonard, 1958; Mowrer, 
Ruch, & Miller, 1936), it was possible 
to isolate the latency, eye-movement 
duration, and the time required for 
interpreting the stimulus. 


EXPERIMENT | 
Method 


Subjects.—The Ss were 3 20-yr.-old male 
volunteer undergraduates. They were free 
from pertinent visual defects as measured by 
an Orthorater. 

Apparatus.—The peripheral stimuli were 
arranged in a semicircle about S, and were at 
40°, 20°, 10°, 5°, and 2.5° right and left. 
There was also one stimulus at the center or 
0°. A point between S’s eyes was the center 
of a circle 6 ft. in radius, and the 11 stimuli 
were at eye level. ~ 

The stimuli were the digits 4, 5, 6, and 9 
presented by Burroughs Type BD200S Nixie 
indicator tubes. The height of each numeral 
was .305 in., subtending a visual angle of 14” 
at a distance of 6 ft. These four digits were 
chosen on the basis of preliminary tests as 
giving a good response for triggering the 
voice relay circuit. 

The indicator tubes were mounted on a 
curved panel painted a flat gray to minimize 
glare. The only illumination in the room was 
a fluorescent source located 6 ft. behind and 
3 ft. above S, giving an illumination on the 
panel of 2.78 mL. 

To keep S looking at the center of the 
display prior to the presentation of a stimulus, 
a tracking task requiring continuous monitor- 
ing was used. This task required S to follow 
a light moving in a triangular pattern with 
another light controlled by a three-button 
switch. The triangle subtended 1.5° at S's 
eye, and S could perform satisfactorily while 
fixated on the center of the triangle. 

The S sat in an armchair with a headrest, 
preventing any horizontal head movements, 
and wore a headset with an attached carbon 
microphone. The microphone triggered the 
voice relay circuit whenever S responded 
verbally to the number presented in any one 
of the indicator tubes. This circuit tripped 
a latching relay that stopped the timer. 

The E had controls to select an indicator 
light at any of the 11 positions and any of the 
4 numerals appearing in it. 

Procedure.—In both the training and ex- 
perimental sessions, the procedure for the 
presentation of the stimuli was the same. 


319 


The S entered the experimental room which 
was light-proofed to exclude any extraneous 
illumination that might reflect from the 
curved surfaces of the indicator tubes. After 
the headrest and microphone were adjusted, 
10 warm-up trials were given before the 
session began. 

At the start of each trial S began tracking 
the center display of lights. At intervals of 3, 
4.5, or 6 sec. after the start of the tracking 
task, a number in one of the indicator tubes 
came on and the tracking lights extinguished. 
The S then moved his eyes to the position 
of the stimulus and verbalized the number 
into the microphone. This response stopped 
the timer and extinguished the number in the 
indicator tube. After a 5-sec. rest period the 
next trial was begun, and S resumed his 
tracking task. There were two 1-min. rest 
periods during the experimental session. 

Experimental design.—To insure reliability 
of results, all Ss were highly trained prior to 
the beginning of the experimental sessions. 
For the training trials the indicator lights in 
the 20° right and left positions were used. 
Each S made 144 responses per session, 72 to 
each position. The four numerals appearing 
in the indicator lights were randomized among 
the 72 stimuli for each position. The training 
sessions were concluded when both the means 
and SDs became asymptotic. This occurred 
on Day 16 for 1 S and on Day 19 for the 
other 2. After the training trials were con- 
cluded each S experienced two sessions of 
responding to all 11 lights. 

The experimental trials were run for 12 
days and were initiated on the day following 
the training sessions. In order to determine 
how RT varies as a function of the number of 
possible stimuli, it was necessary to divide 
Exp. I into Sequences AandB. The stimuli 
in Sequence A consisted of the indicator 
lights at the 20° and 10° right and left posi- 
tions. Each of the 16 possible combinations 
of position and indicator numeral appeared 
three times in each group of 48 trials, and was 
randomized throughout each group. The 
entire session of 144 trials consisted of three 
of these groups of 48. Sequence A was 
presented on Experimental Days 1, 2, 11, 
and 12. 

The stimuli in Sequence B consisted of the 
indicator lights in all 11 positions. To 
simplify the data reduction the light at the 
center position was considered as two stimuli, 
with one-half of the responses counting on the 
left side of the visual field and the other half 
counting on the right side. As a result there 
were 48 possible combinations of position and 
indicator numeral. Each of the combinations 


320 


TABLE 1 


MEAN RTs anD SDs (.01 SEC.) TO 
STIMULUS POSITIONS IN 
SEQUENCE B: Exp, I 


Direction 
Position Left Right 

Mean SD Mean SD 

40° 91.12 8.3 90.05 7.8 
20° 77.70 7.2 77.14 6.2 
10° 72.82 74 72,13 5.9 
3r 70.35 7.4 69.99 7.3 
2.59 67.19 7.8 66.81 iy 
0° 58.43 6.9 = ae 


appeared once in each group of 48 trials and 
was randomized throughout. Again, each 
experimental session consisted of three groups 
of 48 trials. Sequence B was presented on 
Experimental Days 3 through 10. 


Results 


The data from Sequence B, with all 
11 positions, were analyzed by means 
of a four-factor (Position, Numeral, 
S, and Day) analysis of variance, 
All of the main factors were significant 
beyond the .001 level, Mean RTs to 
the lights at various positions in the 
periphery and their SDs are given in 
Table 1. 

The four numerals, or the four vocal 
Tesponses required (4, 5, 6, and 9), 
yielded significantly different mean 
RTs. An examination of the means 
showed that the vocal responses 4 and 
5 were significantly faster than the 
response 6 or9. The significant Days 
effect was due to lengthened RTs 
occurring on Days 3 and 4. 

The Position X Numeral interac- 
tion was significant at the .001 level, 
This indicated that at some positions 
certain digits yielded faster RTs than 
at other positions, An inspection of 
the means showed that in 8 of the 11 
Positions the fastest mean response 
was made to the numeral 4. How- 


ALBERT E. BARTZ 


ever, in the other three positions, 40°, 
20°, and 10° left, the fastest response 
was made to the numeral 5. 

The significant Position X S inter- 
action indicated that at some posi- 
tions certain Ss performed better than 
others. Inspection of the means 
showed that 1 S was faster at the 40° 
and 20° positions and slower on the 
other positions. 

The Numeral X S interaction in- 
dicated that some Ss responded faster 
to certain numerals, 

The significant Position X Numeral 
X Day and the Numeral x S X Day 
interactions indicated that the signifi- 
cant Position X Numeral and Numeral 
X S interactions varied as a function 
of the day on which the responses 
were made. The results of the anal- 
ysis of variance are shown in Table 2. 

The data from Sequence A, with 
only four positions used (20° and 10° 
right and left), were analyzed by a 
similar analysis of variance. The 
results of this analysis were identical 
with that of Sequence B, with the ex- 
ception of the main effect of Day. 
There was no significant difference be- 


TABLE 2 

ANALYSIS OF VARIANCE oF RTs: Exe. I 

Source df MS F 
Position (P) 11 [10,137.64 | 1,062.65** 
Numeral (N) 3| 1,891.67 | '198.29** 
5 2| 8,683.35 | 910.20** 
Day (D) 7| 113.29 11.88** 
PXN 33 34.21 3.59** 
PXS 22 64.23 6.73** 
PXD 77 10.71 1.12 
NXS 6| 138.83| 14.55** 
NxD 21| 14.95] 1.25 
SXD 14 61.36 6.43** 
PXNXS 66| 15.86 1.66* 
PXNXD 231 7.78 
PXSXD 154 10.33 1.08 
NXSXD 42| 16.07 1.68* 
PXNXSXD| 462| 9.54 
Tota 1,151 | 


EYE-MOVEMENT LATENCY, DURATION, AND RESPONSE TIME 


a 

oO 

ee POSSIBLE POSITIONS 

2 -— ELEVEN 

w e—- FOUR 

Fa y a TWO 

eras 

E 

y a 

on Ai 

WwW p4 

(i » 

iets, 

Ww [i ae 

= 205a a0 jo° 20° 
LEFT RIGHT 


Fic. 1. Response time as a function 
of number of possible stimuli. 


tween responses made on the 4 days 
of Sequence A, i.e., Days 1, 2, 11, and 
12. This also indicated that the 
counterbalancing technique used to 
offset possible transfer effects was 
successful. 

The use of these two sequences was 
to enable a comparison of responses 
made to 11 possible positions (Se- 
quence B) as against 4 possible posi- 
tions (Sequence A). Shown in Fig. 1 
is the comparison of the responses at 
the 20° and 10° left and right positions 
from both sequences. As is evident 
from the graph, the mean RTs were 
faster for the smaller number of 
possible stimulus positions. All dif- 
ferences were significant at the .05 
level. Also shown in Fig. 1 are the 
data from a situation in which only 
the 20° left and right positions were 
used. These means were taken from 
the last 5 days of the training trials. 


EXPERIMENT II 
Method 


Apparatus.—The same apparatus was used 
for presenting the stimuli as in Exp. I. To 
record the eye movements necessary for 
measuring the components of the total re- 
sponse, electrodes were placed behind the 
external canthi of S's eyes. The output from 
the electrodes was fed to a Grass Model P-5 


321 


preamplifier, and the output of the pre- 
amplifier terminated at an oscilloscope. The 
upper trace of this dual channel oscilloscope 
was a record of S's eye movements. For the 
lower trace the input was from the first stage 
of amplification of the electronic voice key. 
The sweep was triggered when an indicator 
light came on. A Dumont oscilloscope 
camera was used to photograph the tracings. 

Procedure.—Experiment II was begun on 
the day following the close of Exp. I. As 
before, S attended to the tracking task until 
an indicator light came on, moved his eyes 
to the stimulus, and responded verbally to the 
indicator numeral. 

Experimental design.—Because of the time 
required to manipulate the camera for record- 
ing the CRT trace, it was necessary to reduce 
the number of responses during the experi- 
mental session. It was also necessary to 
alter the order of presentation of the indicator 
lights. The order was arranged so that only 
every third response was recorded by the 
oscilloscope camera. Since the main interest 
was in responses to stimuli involving eye 
movement, only those responses to the 40°, 
20°, 10°, and 5° right and left positions were 
recorded. To insure high reliability in the 
vocal response, all responses recorded were 
to the stimulus 5. The remaining three 
numerals were divided equally among the un- 
recorded stimuli. 

Each S made a total of 96 responses at each 
session. Of these 32 were recorded, 4 for 
each of the eight positions. The Ss were not 
aware that only some responses were being 
recorded. After 4 days of testing, Ss made 


G 

O $ A 

h on we 

3 80 nae Ue 

Y ee, el 

sae VOCALIZATION 

uJ 

n 

zZz 

& 40 

3 MOVEMENT 

= Tie oo 

Dae nee os eee ` 

> 20 a 7 

B LATENCY 

= 

Z 

3 m E e 
LEFT RIGHT 


Fic. 2. Portion of response occupied 
by three visual components. 


322 ALBERT 


a total of 384 recorded responses, or 48 for 
each of the eight positions. 


Results 


The proportions of the total RTs 
accounted for by the three components 
of latency, movement, and vocaliza- 
tion are shown in Fig. 2. As was 
expected, both eye-movement latency 
and eye-movement duration increased 
as the angle from the line of regard 
increased. However, vocalization 
time (the interval between the cessa- 
tion of the eye movement and S’s 
vocal response) also increased with 
angle. The means and SDs for 
latency, duration, and vocalization are 
shown in Table 3. 

No significant practice effect ap- 
peared during the 4 experimental 
days. As in Exp. I there were sig- 
nificant subject differences, appearing 
mostly in the latency and vocalization. 


Discussion 


The type of RT that was investigated 
in this experiment involved much more 
than a simple eye-movement latency, 
so it was logical to expect greater RTs 
than the latencies reported by Dodge, 
Miles, or Hackman. Their interest was 
Not in the speed of “seeing,” but only in 
the time that was required for the eyes to 


TAB 


E. BARTZ 


begin moving to a peripheral stimulus. 


In Exp. II it was found that the average — 


eye-movement latency agreed quite well 
with these previous studies. The over- 
all mean latency to stimuli at all angles 
was .213 sec. (SD = .041 sec.). This 
coincides very well with previous data, 

However, as mentioned earlier, data on 
the latency of the ocular reaction does 
not accurately reflect the process re- 
quired to see objects in the periphery, 
The S must get his eyes in motion, swing 
his eyes to the new object, and then 
make his response. 

It can be seen from Fig. 2 that it is 
inaccurate to state an “average” value 
for eye-movement lateney, since the 
time required for the eyes to begin their 
movement was a function of the angle at 
which the new stimulus was located. 
The fact that eye-movement latency 
increases as the angle from the center 
line of regard increases was noted earlier 
by White, Eason, and Bartlett (1962). 

It is further apparent that RTs must 
increase as a function of the angle 
through which the eyes must move. 
The actual movement of the eye takes 
longer as the angle from the center line 
of regard increases, However, this time 
interval was extremely small, accounting 
for only 5% to 10% of the total RT. 

A close inspection of the total RT as a 
function of position shown in Table 1 
yielded an interesting observation. The 
large differences between mean RTs at 


LE 3 


MEANS AND SDs FOR Eye-Movemenr LATENCY, DURATION, 


VOCALIZATION, AND TOTAL RESPONSE IN Exp. II 
ee = = = = = 
Latency Duration Vocalization Total 
Position ” dat al 
Mean SD Mean SD Mean SD Mean SD 
rT ae Gey a a3 a7. | 10 ome | 36 tems 1 “85 
20° L 21.2 4.4 5.7 0.8 50.5 5.9 77.4 8.0 
10° L 19.6 2.4 3.8 0.6 47.9 5.8 71.5 9.5 
5L A EE 31 0.5 45.6 5.7 685 | 65 
5°R 20.8 | 25 22 | T 61 | os | 68 
10°R 19,5 2.5 3.8 0.6 46.6 61 69.9 6.7 
20°R | 207 | 3.9 57 09°F Er Sls age 4 a8'3 6.9 
40°R 24,2 | 42 | 8.7 1.2 57.5 8.1 90.4 8.7 


} 


——— 


EYE-MOVEMENT LATENCY, DURATION, AND RESPONSE TIME 


the various positions could not be fully 
explained by differences in eye-move- 
ment latency or duration. For example, 
the mean RT to the 40° left position was 
911 sec. and that of the 20° left position 
was .777 sec., for a difference of .134 sec. 
The difference in eye-movement latency 
tothe two positions was .036 sec., and 
the difference in eye-movement duration 
was only .030 sec. Thus latency and 
movement accounted for less than half of 
the original difference between the total 
RTs to the two positions. 

Since two of the three components 
have been accounted for, the difference 
must be due to the third component, 
the vocalization time. As defined ear- 
lier, vocalization was the time interval 
between the completion of the eye move- 
ment and S’s vocal response. As can be 
seen from Table 3, the time required to 
make this response after S was looking at 
the signal varied as a function of in- 
dicator light position. In terms of the 
experimental situation, it took longer for 
S to “recognize” the numeral presented 
and verbalize the response when the 
stimulus was at a greater angle in the 
periphery. 

There are several possible explanations 
that may be advanced to account for this 
observation. With the type of recording 
method used, it was difficult to distin- 
guish very small eye excursions in com- 
parison to the gross movement at the 
onset of the signal. Therefore, if at the 
end of a large angular movement, the 
eye hunts for an exact fixation, this 
hunting would probably occupy several 
degrees or less. Small hunting excur- 
sions could not be read from the type of 
records taken. It is possible that this 
hunting time occupies proportionately 
more time as the angle increases. This 
hunting may be comparable to the vari- 
able error discussed by Woodworth 
(1899) in simple motor tasks. He found 
that variable error for controlled rapid 
movements increased with amplitude or 
distance traveled. Fitts (1954) has sug- 
gested that in the concept of fixed in- 
formation-transmission capacity of the 
motor system, such increased variability 
is due to decreased information that the 


323 


movement provides. It is possible that 
the hunting may be the variable error of 
a simple motor response. 

Another possible explanation is in 
terms of accommodation. As the eyes 
move, the muscles exert different pat- 
terns of tension upon the two eyes, and 
therefore some accommodation may be 
required in fixating upon a peripheral 
stimulus. Because of binocular con- 
vergence this effect is probably magnified 
for stimuli close to the eyes. Although 
the stimuli were 6 ft. from S, sufficient 
accommodation may be required to make 
the effect appreciable. 

In an experiment mentioned previ- 
ously (Hyman, 1953), it was reported 
that RT increased as a function of the 
probability that a stimulus would ap- 
pear in a specific location. It would 
follow from this evidence that RT in 
general will increase with an increase in 
the number of possible stimuli to which 
S must react. As was noted in Fig. 1 
the RTs for 11 possible stimuli were 
significantly longer than for 4 possible 
stimuli. (A decrease was also noted in 
the training trials when 2 possible stimuli 
were used.) 

In the 8 days with 11 stimuli, there 
appeared to be no further learning taking 
place. With the Days main effect of 
Sequence A nonsignificant, it can be as- 
sumed that no learning in that situation x 
took place over a 12-day period. How- 
ever, considering the data of Mowbray 
and Rhoades (1959), it is entirely possible 
that RTs could be significantly shortened 
with practice over a long period of time. 

Although some investigators (Hack- 
man, 1940) have found shorter RTs to 
stimuli on the right side, there were no 
significant differences in this experiment 
between RTs to stimuli on the left and 
right sides. 

In the general use of the term, the 
process of “seeing” refers to a wide 
variety of functions. The usual figure 
given for RTs to a visual signal is about 
one-fifth of a second. In such situations 
the practiced S makes some kind of 
manual response to the onset of a visual 
signal. However, a review of the liter- 
ature shows a wide range of values for 


324 


RT—obviously a function of the experi- 
mental conditions. In this study, re- 
quiring more than a simple reaction, the 
range in RTs was from .584 sec. for the 
central stimulus to .906 sec. for extreme 
positions, with an overall mean of 127 
sec. 


SUMMARY 


The present research was initiated with 
two purposes in mind: (a) to determine the 
speed of seeing ina complex visual task (Exp. 
I), and (b) to isolate and measure the various 
components of the total response (initial 
latency, travel time of the eye, and the 
response time for interpreting the signal). 

Results of Exp. I showed that RT in- 
creased as the angle from the center line of 
regard increased, There was no significant 
difference between pairs of means for right 
and left sides. It was also found that response 
time increased as the number of possibie signals 
increased. In Exp. II, the time required for 
each of the three components of the response 
increased as the angle increased. Several 
interpretations of the positive relationship 
between angle and the time required for S to 
make his vocal Tesponse after his eyes had 
reached the signal were considered, 


REFERENCES 


DIEFENDORF, A, R, & Done, R. An 
experimental study of the ocular reactions 
of the insane from photographic records. 
Brain, 1908, 31, 451-489. 


ALBERT E. BARTZ 


DonGE, R., & CLINE, T. S. The angle 
velocity of eye movements. Psychol. Rev., 
1901, 8, 145-157. 

Fitts, P. M. The information capacity of 
the human motor system in controlling the 
amplitude of movement. J. exp. Psychol., 
1954, 47, 381-391. 

Forn, A., & LEONARD, J.L. Techniques for 
recording surface bioelectric direct cur- 
rents. USN Electron. Lab. res. Rep., 1958, 
No. 839. 

Hackman, R. B. An experimental study of 
variability in ocular latency. J. exp. 
Psychol., 1940, 27, 546-558. 

Hyman, R. Stimulus information as a de 
terminant of reaction time. J. exp. Psy- 
chol., 1953, 45, 188-196. 

Mues, W. R. The reaction time of the eye. 
Psychol. Monogr., 1936, 47 (2, Whole No. 
212). 

Moweray, G. H., & Ruoapes, M. V. On the 
reduction of choice reaction times with 
practice. Quart. J. exp. Psychol., 1959, 
11, 16-23. 

Mowrrr, O. H., Rucu, T. C., & MILLER, 
N. E. The corneo-retinal potential differ- 
ence as the basis of the galvanometric 
method of recording eye movements. 
Amer. J. Psychol., 1936, 114, 423-428. 

Warre, C. T., Eason, R. G., & BARTLETT, 
N. R. Latency and duration of eye 
movements in the horizontal plane. J. 
Opt. Soc. Amer., 1962, 52, 210-213. 

Woopwortn, R. S. The accuracy of volun- 
tary movement. Psychol. Monogr., 1899, 
3(2, Whole No. 13), 


(Received August 24, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 3, 325-326 


SUPPLEMENTARY REPORT; DIRECTION OF CHANGE IN CS IN 
EYELID CONDITIONING * 


FRANK A, LOGAN anD ALLAN R. WAGNER 
Yale University 


The assumption that the important param- 
eter of the CS is the amount of change from 
the pre-CS condition to the CS condition 
(e.g., Logan, 1954; Perkins, 1953) implies that 
a decrease in intensity should be as effective 
a CS as the corresponding increase in in- 
tensity. The assumption that the absolute 
value of the CS has a motivational (dynamo- 
genic) property (e.g., Hull, 1952) implies that 
an increase in intensity should be more effec- 
tive. Kish (1955) found tone-off to be a less 
effective CS than tone-on for avoidance 
conditioning in rats but Schwartz and Good- 
son (1958), using a comparable situation, 
found these events to be equally effective. 
Hansche and Grant (1960) concluded that 
light-off was as effective as light-on for eyelid 
conditioning under a procedure in which the 
light was off between trials for all Ss. The 
present study compares an increase with a 
decrease in intensity between two nonzero 
values treated symmetrically. 

Method.—The general features of the eye- 
lid conditioning apparatus, recording equip- 
ment, and procedures have been described 
elsewhere (Dufort & Kimble, 1958). The CS 
was provided by a circular milk glass disk, 
2.25 in. in diameter, set in a flat black ground, 
and illuminated from behind by General 
Electric NE30 neon bulbs. The onset of the 
CS was either an increase from two to four 
bulbs or a decrease from four to two bulbs. 
In each case, the CS intensity lasted for 
600 msec. during the last 100 msec. of which 
a 2-lb. air-puff CS was delivered to the corner 
of the eye. The non-CS intensity remained 
on during the intertrial interval which aver- 
aged 20 sec. in length. 

Five test trials of CS or UCS alone were 
followed by 60 conditioning trials. During 
these trials the CS for half of the Ss was an 
increase while for the other half it was a 
decrease in illumination. All Ss were then 
given 20 additional conditioning trials with 
the opposite CS. The results from 16 female 
student nurses were combined with those from 
40 male undergraduates since they were 
virtually identical. 

Results and discussion—The results are 
shown in Fig. 1. Both the increase and de- 


1 Supported in part by Grants G-9014 and G-13080 
from the National Science Foundation. 


e INCREASE THEN DECREASE 
s= s DECREASE THEN INCREASE 
80 


Go) 


PERCENT CR 


BLOCKS OF TEN TRIALS 


of conditioned responses during 


Fic. 1. Percent: 
curve refers to Ss for whom the CS 


training. (The soli 
was an increase from the between-trials intensity 
during the first 60 trials and a decrease during the next 
20 trials, while the dashed curve refers to Ss who 
received these CS conditions in the reverse order.) 


crease in intensity were clearly and equally 
effective CSs in producing a relatively high 
level of conditioning. Although the null 
hypothesis cannot be proven statistically, 
the standard error of the difference between 
the groups at the end of training was only 7% 
and hence it is unlikely that the true differ- 
ence deviates very much from zero. The 
data thus indicate the greater relative im- 
portance of the change parameter of the CS 
rather than its absolute intensity. 

The degree of transfer when the direction 
of change was reversed is remarkable. In- 
deed, a slight drop in performance would be 
expected because of the “extinction trial” 
given inadvertently when the non-CS in- 
tensity was reversed between the last acquisi- 
tion trial and the first reversal trial. This 
finding suggests that generalization should 
be viewed in terms of a surface including 
the non-CS condition as well as the CS condi- 
tion. However, it will require a large para- 
metric study adequately to characterize this 
surface. 


REFERENCES 


Durort, R. H., & KIMBLE, G. A. Ready signals and 
the effect of interpolated UCS presentations in eye- 
lid conditioning. J. exp. Psychol., 1958, 56, 1-7. 

Hanscue, W. J., & GRANT, D. A, Onset versus termi- 
nation of a stimulus as the CS in eyelid conditioning. 
J. exp. Psychol., 1960, 59, 19-26, 

Hutt, C. L. A behavior system. 


Univer. Press, 1952. 
Kisu, G. B. Avoidance learning to the onset and 


cessation of conditioned stimulus energy. J. exp. 
Psychol.. 1955, 50, 31-38. 


New Haven: Yale 


325 


326 


Locan, F, A. A note on stimulus intensity dynamism 
(V). hol. Rev., 1954, 61, 77-80, ds 

PERKINS, € C., Jr. The relation between conditioned 
stimulus intensity and response strength, J. exp. 
Psychol., 1953, 46, 225-231. 


Journal of Experimental Psychology 
1962, Vol. 64, No. 3, 326-327 


DAVID RAAB AND ELIZABETH FEHRER 


Scuwartz, M., & Goopson, J. E. Direction and rate 


of conditioned stimulus change in avoidance per- 
formance, Psychol. Rep., 1958, 4, 499-502, 


(Received June 30, 1961) 


SUPPLEMENTARY REPORT: THE EFFECT OF STIMULUS DURATION AND 
LUMINANCE ON VISUAL REACTION TIME? 


DAVID RAAB anv ELIZABETH FEHRER 
Brooklyn College 


Raab, Fehrer, and Hershenson (1961) 
found that simple reaction time (RT) was 
independent of stimulus duration over the 
range of 10 to 500 msec. Luminance, on the 
other hand, was found to be an important 
determiner of RT. Since intensity rather 
than total energy (intensity times duration) 
determined RT, it is obvious that the critical 
duration (CD) for RT is 10 msec. or less for 
the three luminance levels (3000, 30, and 
0.3 ft-L) investigated. 

The term critical duration has been bor- 
rowed from visual threshold studies, which 
have shown reciprocity (Bunsen-Roscoe law) 
up to a CD of approximately 100 msec., 
beyond which temporal integration ceases 
and the threshold is defined solely in terms 
of luminance. 

It seemed worthwhile to determine the 
CDs in the mediation of RT for the lumi- 
nances previously studied and the relation 
between RT and stimulus duration below 
these critical values. In the experiment to be 
reported, the six durations ranged from 0,5 
to 20 msec., and thus overlapped the range 
used previously. Two additional inter- 
mediate luminances were included. 

Method.—Yarget flashes were generated 
and RTs measured by the same equipment 
as that employed in our previous study. A 
single Tektronix wave-form generator pro- 
vided the gating pulses for the glow modu- 
lator tube; pulse durations were switched 
between trials, as required, 

In order to generate flashes having wave 
forms as rectangular as possible, the driving 
pulses were shaped to “overvolt” the glow 
modulator tube, and the tube itself was placed 
next to an ultraviolet source. With these 
arrangements, flash energy was found to be 
National ‘Science Fountereen (Glg anta from the 
National Institute of Neurological Diseases and Blind- 
vo (3-1029) and by funds provided by Brooklyn 


» The data were gathered by Carlos Goldberg 
and Naomi Maizel as part of an honors coure, 


proportional to flash duration within 0.5 db. 
from 0.5 to 20 msec. The circular target, 
1 cm. in diameter, subtended 1° 10’ of arc and 
was viewed binocularly. 

Two senior honors students and the 2 
authors served as Ss. Each S served in 30 
experimental sessions. Computations are 
based on data of the last 25 sessions. Only 
one luminance was used in a given session; 
the five luminances were counterbalanced over 
test days for each $. Each session consisted 
of four blocks of 18 trials each, in which each 
combination of the six durations and three 
foreperiods appeared once in random order. 
Only the four longer durations could be 
explored for the 0.3 ft-L luminance, since 
this light was below foveal threshold when 
presented for 0.5 or 1 msec. 

Each session began with 5 min. of dark 
adaptation. Four practice trials preceded 
the recorded trials. The four blocks were 
separated by 1-min. rest periods. 

The 12 RTs obtained in a session for each 
of the six durations were reduced to 10 by 


280 
260 
240 
220 


200 


REACTION TIME IN MSEC. 


180 


160 


os ‘ 2 5 © 20 
FLASH OURATION IN MSEC. 


Pic. 1. Reaction time as a function of stimulus 
duration. (The parameter is flash luminance in {t-L- 
Each data point is the mean for 4 Ss) 


SUPPLEMENTARY REPORT 


discarding the longest and the shortest RT. 
Testing over 25 days (5 at each luminance) 
thus yielded means for each luminance- 
duration combination based on 50 trials. 
Results and discussion—Mean RTs for 
the 4 Ss combined are plotted in Fig. 1. 
Each data point, is thus based on 200 RTs 
For the two highest luminances, duration 
is unrelated to RT over the range studied. 
For the 30 ft-L flash, there was a 10-msec. 
increase in RT when its duration was re- 
duced from 5 to 0.5 msec. For the two lowest 
luminances, stimulus duration: has a far more 
marked effect on RT, RT being obviously 
an accelerated function of flash briefness. 
Our-results show that the CD for moder- 
ately intense stimuli (3000 and 300 ft-L) 
is remarkably brief, being less than 0.5 msec. 
At 30 and at 3 ft-L, CD lies between 2and 5 
msec. For the weakest target, the CD lies 
between 10 and 25 msec. The present study 
shows a small decrease in RT as duration 
increased from 10 to 20 msec. In the previous 
study, a smaller decrease occurred between 
10 and 25 msec., but there was no further 
decrease when this stimulus was prolonged 
beyond 25 msec. 2 
These CDs for RT are far shorter than 
the 100-msec. value previously reported for 
absolute threshold (¢.g-, Baumgardt & Hill- 
mann, 1961) or the minimal value of 30 
msec. reported by Graham and Kemp (1938) 
for the incremental threshold at their highest 
background luminance. The three dependent 
variables, RT, RL, and DL, are thus differ- 
ently related to stimulus duration, with the 
CD being obviously shortest for RT. 
Although luminance differences are con- 
founded with test days (i.e., only one lumi- 
nance was studied in a given test session), 
the effect of luminance on RT is pronounced 


Journal | Experimental Psychology 
1962, Vol. 64, No. 3, 327-328 


327 


and is apparent at all durations studied. 
That RT decreases when luminance is in- 
creased is consistent with earlier findings 
(see Woodworth & Schlosberg, 1954). But 
the form of the relation between luminance 
and RT will depend on stimulus duration 
unless each stimulus duration is greater than 
the CD. In other words, our data could be 
replotted to display six different luminance- 
RT functions, one for each flash duration. 

Our results show that although the overt 
response to a target flash may not appear 
until much later, the minimal latency of that 
response is determined very shortly after 
stimulus onset. The finding that increasing 
duration may cease to be effective long before 
the criterion response appears parallels the 
classical observation of this fact made by 
Hartline (1934). The fact that RT is deter- 
mined by so brief a “package” of luminous 
energy is consistent with our earlier finding 
that RT is independent of the growth (with 
duration) of phenomenal brightness. In 
addition, it helps to explain why retroactive 
(metacontrast) masking of a flash does not 
affect its RT (Fehrer & Raab, 1962). 


REFERENCES 


Baumcarot, E., & HILLMANN, B. Duration and size 
as determinants of peripheral retinal response. 
Opt. Soc. Amer., 1961, 51, 340-344. 

Fenrer, E., & Raar, D. Reaction time to stimuli 
taped ty metacontrast. J. exp. Psychol., 1962, 63, 
143-147. 

Grauam C. H., & Kemr, E. H. Brightness discrimi- 
'nation as a function of the duration of the increment 
in intensity. J. gen. Physiol., 1938, 21, 635-650. 

HARTLINE, H. K. Intensity and duration in the excita- 
tion of single photoreceptor units. J. cell. comp. 
Physiol., 1934, 5, 229-247. 

Raan, D., FEHRER, E., & HERSHENSON, M. Visual 
reaction time and the Broca-Sulzer phenomenon. 
J. exp. Psychol., 1961, 61, 193-199. y 

WoopwoRTH, R. S., & SCHLOSBERG, H. Experimental 
psychology. (Rev. ed.) New York: Holt, 1954. 


(Received June 30, 1961) 


SUPPLEMENTARY REPORT: MEANINGFULNESS AS A DIFFEREN- 


TIATION VARIABLE 


IN THE VON RESTORFF EFF ECT * 


HAROLD ROSEN, DONALD H. RICHARDSON, anp ELI SALTZ 
Center for the Study of Cognitive Processes, Wayne State University 


The present study was designed to test 
an aspect of the theory for the von Restorff 
effect proposed by Saltz (1960). This theory 
defines a differentiation construct in terms of 
two variables: (a) Similarity between an item 
rogram of research, 


supported National Science Foundation grant to 
El Saltz, earned ‘with the influence of differentiation 


on verbal learning. 


and other items in a list, and (b) amount of 
prior reinforcement (e.g, “familiarization”) 
of the item. Isolation techniques typically 
involve differentiation of an item by reducing 
its similarity to other items in the list. How- 
ever, if all the items in a list are already highly 
differentiated, reduction of similarity would 
be expected to have a smaller effect than if 


328 


TABLE 1 


MEAN INTRALIST RANK AND MEAN NuMBER OF 
CORRECT ANTICIPATIONS OF THE ISOLATED 
AND CONTROL TERMS OVER 15 

LEARNING TRIALS 


Correct 

Ranks Anticipations 

List Item 
Mean} SD |Mean SD 
Low m | Isolated | 4.53 |2.24 | 5.73 3.53 
Control | 6.53 | 1.62 2.12 2.95 
Highm | Isolated | 5.00 | 1.63 | 7.70 3.94 
ontrol | 5.45 |1.89 | 6.48 3.75 


the other items in the list 
ferentiation. 

This deduction was tested by comparing 
the effects of isolation of an item in a serial 
list of high-meaningfulness items with the 
effects obtained by isolating an item in a serial 
list of low-meaningful items, since Noble 
(1953) has demonstrated that meaningfulness 
is related to amount of prior experience with 
an item. Isolation was accomplished by 
typing one word of the list in red, all the 
other words being typed in black, 

Method—The Ss were 132 students in 
introductory Psychology at Wayne State 
University, and were randomly assigned to 
conditions except that Ns of conditions were 
made equal. 

Two basic lists were used, each consisting 
of nine items taken from Noble's (1952) mean- 
ingfulness scale. One list consisted of low- 
meaningful items (m range = 1.05 to 1.50): 
MEARDON, BYSUSS, VOLVAP, LATUK, GOKEM, 
POLEF, SAGROLE, WELKIN, NARES. The high- 
meaningful items (m range = 7.39 to 9.61) 
in a second list were: INSECT, JEWEL, HEAVEN, 
OFFICE, WAGON, DINNER, MONEY, ARMY, 
KITCHEN. The item in Position 5 was typed 
in red and served as the isolate in the experi- 
mental condition, All other items were typed 
in black. The control lists were typed en- 
tirely in black. Each of the two basic lists 
was organized into six different random orders, 
with a different term serving as isolate in each 
order, An S learned one of these orders as a 
serial list. 

Items were exposed on a Lafayette memory 
drum at a 2-sec. rate with a 4-sec. intertrial 
interval. After an initial trial in which S 
pronounced aloud the items in order of their 
appearance, S was given 15 anticipation trials, 

Results.—The relative effects of isolation 
for high- and low-meaningful material were 
evaluated by determining the effect of isola- 
tion on each S's serial position curve. For 
each S, the items in the list were ranked from 
greatest number of correct anticipations in 


were low in dif- 


H. ROSEN, D. H. RICHARDSON, AND E. SALTZ 


15 trials (Rank 1) to least correct anticipa- 
tions (Rank 8). The mean ranks of the iso- 
lated and control items, summarized in Table 
1, indicate that isolation has a much greater 
relative effect in a list of low-m than a list 
of high-m items. The main effects due to 
isolation are significant beyond the .001 level 
(F = 14.36, df = 1/128) and the Isolation 
X Meaningfulness interaction is significant 
beyond the .025 level (F = 5.77, df = 1/128), 
Table 1 shows that the interaction effect also 
occurs for mean number of correct anticipa- 
tions over 15 trials. However, the anticipa- 
tion data are less persuasive than the rank 
data, since the anticipation data could be an 
artifact due to the rapid learning of high-m 
items, both isolated and control. In brief 
the original prediction based on the differen- 
tiation formulation is sustained. y 

Saltz and Newman (1959) found that iso- 
lation resulted in an increased tendency for 
the isolated term to be emitted as an intrusion 
response to other stimuli in the list, The 
intrusion data in the present study were 
analyzed by means of a chi square test since 
the data were markedly skewed: the median 
number of intrusions was zero in the low-m 
control groups, and close to zero in the other 
groups. For low-m lists, 58% of the Ss in the 
isolated condition emitted the isolated term 
as an intrusion at least once during learning; 
only 33% of the control Ss emitted the control 
item. This difference is significant (P <05). 
The corresponding percentages in the high-m 
conditions are 58% and 67%, producing @ 
nonsignificant difference opposite in direction 
from that for low m. Saltz (1960) has hy- 
pothesized that differentiation increases the 
tendency for a response to be emitted, and 
that this tendency is basic to the relatively 
high intrusion rate for the isolated term. In 
terms of this position, the fact that the high-m 
isolation condition produces no greater 
tendency toward intrusions than the high-m 
control condition is consistent with the posi- 
tion stated previously in this paper: in a list 
of highly meaningful items, the terms are 
already relatively differentiated, and so 
isolation will contribute relatively little 
additional differentiation. 


REFERENCES 


Noster, C. E. An analysis of meaning. Psychol. Rets 
1952, 59, 421-430. -3an 
Nom, C, E. The meaningfamiliarity relationship. 


Psychol, Rev., 1953, 60, 89-08. tel 
Sartz, E, Similarity and diferentiation in ver 
learning. Paper read at Psychonomics meeting* 


Chicago, 1 
Sartz, E & Newman, S. E, The von Restorif isola- 
tion effect: Test of the intralist association asst! 


tion. J. exp, Prychol,, 1959, 58, 445-451, 
(Received August 1, 1961) 


Journal of 


Experimental Psychology 


Vor. 64, No. 4 


OCTOBER 1962 


a 


USE OF TEMPERATURE STRESS WITH COOL AIR 
REINFORCEMENT FOR HUMAN 
OPERANT CONDITIONING? 


GORDON L. PAUL, CHARLES W. ERIKSEN, ano LLOYD G. HUMPHREYS 


University of Illinois 


The attempts that have been made 
to investigate operant conditioning in 
normal human Ss have dealt primarily 
with verbal behavior. The general 
approach has been to manipulate the 
frequency of usage of word classes 
using as a reinforcement various signs 
of social approval such as “umhm,” 
“good,” etc. However, Buchwald 
(1960) has raised doubts as to the 
effectiveness and unambiguity of these 
signs of social approval as reinforcers 
and other Es (Eriksen, 1960; Krieck- 
haus & Eriksen, 1960; Levin, 1961) 
have found that changes in S's verbal 
behavior do not convincingly occur in 
the absence of S’s ability to verbalize 
relevant intentions and hypotheses 
concerning relationships between the 
experimental variables and changes in 
his behavior. 

Attempts to obtain operant condi- 
tioning of nonverbal responses have 
been reported by Verplanck (1955, 
1956) and Hefferline, Keenan, and 
Harford (1959). | However, Ver- 
planck’s results have come under seri- 

1 This research was supported by Mental 


Health Grant M-1206, National Institutes of 
Health, Public Health Service. 


ous question as a result of an attempt 
to replicate his procedures reported by 
Azrin, Holz, Ulrich, and Goldiamond 
(1961) and the results of Hefferline 
et al. are difficult to interpret owing 
to inadequate questioning of Ss for 
relevant verbalizable hypotheses. 
Interpretation of these studies is 
also complicated by the failure of the 
investigators to include appropriate 
control groups. To show clearly that 
operant conditioning occurred, these 
studies would have required a control 
group, treated identically with the 
experimentals and receiving the same 
number of reinforcements with the 
exception that the reinforcements for 
the controls would have been ran- 
domly administered and not con- 
tingent upon any particular response. 
In the study reported below we 
have attempted to remedy the defi- 
ciencies in the previous attempts at 
operant conditioning of nonverbal 
responses in human Ss. By the use of 
a new technique we were able to ad- 
minister positive primary reinforce- 
ment to human Ss under conditions 
directly comparable to those obtaining 
in animal experimentation. Three 


329 


330 


different classes of motor responses 
were studied and particular care was 
taken to assess the extent to which 
behavior modification depended upon 
awareness as signified by S's ability 
to verbalize relevant mediational steps 
that intervened between the experi- 
mental variables and his behavior, 


METHOD 


education, and speech during the summer 
session at the University of Illinois.? As an 
added inducement, 5 dollars in services at a 
local beauty salon was offered volunteers, 
Procedure.—Subjects were run individu- 
ally. After S had changed from street clothes 
to a playsuit or swimsuit she was told that 
this was an experiment on the effects of heat 
and isolation 
Further, that the study was government sup- 
ported and was being conducted as part of a 
“lady in space project.” The Ss were then 


They 
were then told that since this study was only 
concerned with „Psychological reactions, an 


During this preliminary period Æ also 
determined whether S had any physical 
restriction which would be a basis for ex- 
clusion from the study and then gave Sa brief 
legend what she would do for the rest of 


she entered the “space” chamber. After 
riding the ergocycle S$ was immediately 
directed into the chamber (maintained at a 


temperature of 105° F, and relative humidity 


* Female Ss were used Since several males 
(run ona pilot basis), to demonstrate the 
cultural stereotype of “masculinity,” refused 
air although they greatly desired it. This 
confounding was rare in females. 


G. L, PAUL, C. W. ERIKSEN, AND L. G, HUMPHREYS 


of 85%). She was told to sit with her face 
directly in front of an insulated pipe which 
protruded into the chamber since cool air, 
high in oxygen content, would be blown 
through the pipe from time to time. 

The chamber door was then closed and 
bolted. Further conversation was by means 
of an intercommunication system with § 
being observed through a one-way mirror in 
one porthole of the chamber, The S was told 
that she should ask any questions she might 
have immediately following the instructions, 
since no communication could take place once 
the “sonar” signals had begun. A standard 
set of instructions was then read to S further 
explaining the “increasing” heat in the 
chamber. The S was told: “You will probably 
find the air most beneficial if you breathe it 
directly.” Also included were detailed in- 
structions on the performance of a pseudo 
task consisting of differentiating between 
sonar signals and forming patterns with 
colored washers on a peg board: “Make as 
many different designs as you can. If you 
drop a washer, replace it with one of ap- 
propriate color from the Stack on your right. 
Completely fill one-half of the board and then 
work back to the other half. Continue this 
process until I tell you the time is up. Are 
there any questions?" 

Approximately 5 min. elapsed from the 
time S first entered the chamber through 
completion of the instructions, Following the 
instructions a time clock and tape recorder 
which presented the signals for the pseudo 
task were started. The E observed S for the 
following 5 min. and determined the specific 
response within the various response classes 
which was to be conditioned. 

The recording apparatus was then started 
and the initial rate of the Operant was re- 
corded by throwing a toggle switch each time 
the selected response occurred. After 5 min. 
of recording the Operant level, a switch was 
thrown which connected a programmer and 
timer into the circuit marking the beginning 
of the conditioning phase, During this phase 
which lasted for 35 min., 10 sec. of cool air 
(primary reinforcement) and a red light 
(secondary reinforcement) were automatically 
timed and administered on a programed 
schedule activated by the same switch which 
recorded responses. 

A decreasing ratio reinforcement schedule 
was used in which the first 20 responses 
received 1:1 reinforcement, the next 15 re- 
Sponses received 2:3 reinforcement, the next 
14 received 1:2 reinforcement, and the re- 
maining responses received 1:3, The latter 
schedule applies to responses in which only 


OL N 


HUMAN OPERANT CONDITIONING 


one program step could be made in every 
10-sec. period. The record of responses on the 
other hand was kept on each specific response 
regardless of time interval with the exception 
of single responses lasting for a period of more 
than 10 sec. which were then recorded as two 
responses. This procedure was adapted to 
maximize unawareness. 

Following the 35 min. of conditioning, the 
programmer and timer were disengaged anda 
10-min, extinction phase was initiated in 
which responses were recorded as above. 

_ Upon completion of the conditioning phase, 

S was assisted out of the chamber and 
escorted directly to a shower room, with 
instructions to return to the laboratory for a 
short interview. 

Control Ss were tested for each response 
class, receiving exactly the same treatment as 
experimental Ss with the exception of the 
conditioning phase. During this 35-min. 
period a technician operated a separate switch 
which activated the programmer and timer 
that delivered rewards. These rewards were 
administered in accordance with a master 
program which had been prepared from the 
average number of program reinforcements 
over time that had been delivered to the 
experimental Ss in the same response class. 
Thus the control Ss received essentially the 
same number of rewards at the same times as 
the experimental Ss but the rewards were not 
contingent upon any specific response of 
theirs. The record of responses was taken in 
the same manner for both experimental and 
control Ss. 

Upon S’s return from the shower room the 
following questions were asked and answers 
recorded verbatim: 


1. Would you tell me, in your own words, 
exactly what we're studying? 

2. Did you like or look forward to the 
coolair and oxygen? Relatively how much? 

3. Were you able to concentrate on the 
task the entire period? 

4, How well do you believe you did? 

5. Do you feel you had control of the 
cool air at any time? How? When? 

6. Would you mind telling me specifically 
what you thought about when you were in 
the chamber. 

7, Did anything you did at any time 
during the period have any influence upon 
when the light of air came on? When? 

8. If I were to tell you that something 
you did determined when the air came on 
what would you guess it might be? 

9. Would you describe any emotions you 
may have experienced during the period. 


331 


Running notes of S's behavior in the 
chamber were kept, specifically noting actions 
correlated with the operant or onset of 
reward. A record was also kept of S’s per- 
formance on the pseudo task to provide some 
“feedback” and in combination with the 
buffer items in the postexperimental interview 
to maintain face validity of the “space” 
experiment. 

The reliability of Æ in recording responses 
was determined for each of the three response 
classes by having 2 other Os, the environ- 
mental laboratory technician and a graduate 
student in psychology, simultaneously ob- 
serve Ss for an entire period and record the 
operant responses with paper and pencil by 
1-min. intervals. 

Three general categories of responses were 
studied: hand movements (Group H), face 
and mouth movements (Group FM), and foot 
movements (Group F). Subjects were ran- 
domly assigned to one of the response class 
groups with the specific response in this 
category that was to be reinforced (e.g., 
Group H, touch chin with right hand; Group 
FM, press lips; Group F, tap ball of right foot 
on deck) determined during the 5-min. 
observation period that preceded the de- 
termination of the operant level of the 
response. Twelve Ss were run in each re- 
sponse category with an additional 4 control 
Ss for each category. In the course of 
experimentation an additional 4 Ss were 
excluded: 2 because the experimental period 
had to be terminated before completion ; 1 be- 
cause she did not like the reinforcement; and 
1 who immediately became aware of the 
contingency of the reinforcement and her 
response and stopped responding because she 
“thought it would ruin the experiment.” No 
Ss were initially aware of the true nature of 
the study. 

Apparatus—The experiment was con- 
ducted in the Physical Environment Lab- 
oratory at the University of Illinois. The 
laboratory housing all equipment was main- 
tained at a temperature of 70° F. and 35% 
relative humidity.* 

The ergocycle which Ss rode prior to 
entrance into the chamber was the Illinois 
electrodynamic bicycle ergometer with an 
armature load of 2400 ft-lb at 50 rpm. The 
chamber was a 6-man low pressure chamber 
soundproof and cork insulated with cylindrical 


Lawrence Siler, technician in charge of 
the Environmental Unit, “maintained a 
vigilant watch on all equipment used, assuring 
constant environmental conditions through- 
out the experiment. 


332 


TABLE 1 


RELIABILITY COEFFICIENTS BETWEEN E 
AND 2 Os BAsED on Successive 
5-Min. PERIODS 


Group EQ: EO: 0:102 
H -999 998 999 
FM -947 915 947 
E .972 923 965 


external dimensions of 12 X 7 ft., built for 
the United States Air Force by the Pitts- 
burgh Des Moines Steel Company. The 
internal compartment was a 6-ft. sphere. 
The environmental controls were of the 
Johnson Service Company. Conditions inside 
the chamber were maintained at a tem- 
perature of 105° F, (+ or —2°) and relative 
humidity of 85% (+ or —5%). The air 
velocity within the chamber determined by 
an aneotherm air meter varied from 10 to 18 
ft. per min. at different points. 

Rewards were administered through a 1-in, 
insulated pipe, 4 ft. long. Twin blowers drew 
in air from the laboratory (70° F, and 35% 
humidity) and blew it through the pipe, 
emerging at S's face at 78° F, with a velocity 
of 710 ft. per min. Operant responses were 
recorded on a Hunter stylus recorder which 
was wired directly to the control switch. 
The mechanism which timed and administered 
rewards, also wired in series to the control 
switch, included a Hunter Model III elec- 
tronic timer and a Ridgely automatic pro- 
grammer. A 25-w. red light inside the cham- 

was wired to operate simultaneously with 
the activation of the blowers and to serve as a 
secondary reinforcer filling the short time 
interval between activation of the blowers 
ae of the cool air on § (less than 1 
sec.). 
The apparatus for the pseudo task con- 
sisted of a 1 X 3 ft. Pressed wood board half 
of which was painted white with 42 bolts 
mounted on its face. The bolts contained 


The tape was Presented over a 
tape recorder which was wired into the 
intercommunication system with which the 
chamber was equipped, 


RESULTS 


Before turning to the results the 
reliability of E in recording the oc- 


G. L. PAUL, C. W. ERIKSEN, AND L. G. HUMPHREYS 


currence of responses needs to be 
documented. Table 1 presents the 
product-moment coefficients obtained 
between 2 independent Os and E 
These figures were based on a single § 
from each experimental group with 
the total responses in successive 5- 
min. periods as the basic unit. 

On the basis of Ss’ verbalizations 
during the postexperimental inter- 
view, they were classified into aware 
and unaware subgroups within each 
response category immediately upon 
completion of questioning. Those Ss 
were classified as aware who gave an 
affirmative answer to Questions 5, 7, 
or 8 in the postexperimental interview 
and who could further state the 
correct contingency between the oper- 
ant response and the reward, or name 
a response the occurrence of which 
would have led to an increase in the 
Operant. In the case of these latter 
correlated hypotheses the accuracy of 
S’s verbalization was checked against 
the running notes that had been made 
of her behavior during the experiment. 
Subjects who gave negative answers 
to these three questions or who could 
not state the correct or correlated 
contingency were classified as un- 
aware. Table 2 shows the number of 
aware Ss in each of the three response 
groups and also the question of the 
three listed which elicited the ver- 
balization of the correct or correlated 
contingency, 


TABLE 2 


NUMBER OF AWARE Ss IN Eacu GROUP 
DESIGNATED BY THE QUESTION 
ELICITING EVIDENCE OF 


AWARENESS 
——— M 
Question 
Group Se eee — 7 
a a 
ei? |S 
3 | 2 1 
FM 1 2 4 
0 i 2? 


om 


——— 


HUMAN OPERANT CONDITIONING 


Evidence of conditioning was eval- 
uated separately within each of the 
three response groups by a simple 
analysis of covariance (aware, un- 
aware, and control subgroups) using 
the average number of responses 
during the last 15 min. of the condi- 
tioning phase for each subgroup ad- 
justed according to the average num- 
ber of responses occurring during the 
operant period. The summary of 
these covariance analyses is given in 
Table 3. The adjusted and unad- 
justed means are presented in Table 4. 

These analyses indicate that condi- 
tioning was obtained in Groups H and 
FM but, as seen in Table 4, apparently 
only in Ss who became aware of the 
contingency between their response 
and the reward. 

Curves of the average response rate 
per minute over successive 5-min. 
periods are shown in Fig. 1 for aware, 
unaware, and control Ss within each 
of the response groups. These plots 
reveal, without adjustment for oper- 
ant level, that the two groups demon- 
strating conditioning show a steady 
increase of response for the aware Ss 
during conditioning and a decline 


333 


TABLE 3 


COVARIANCE ANALYSES ON THE MEAN 
RESPONSES DURING THE Lasr 15 
MIN. oF CONDITIONING ADJUSTED 
TO OPERANT LEVEL 


Group Source df MS F 

H Subgroups | 2 | 446.9 | 4.06* 
Error 12 | 110.1 

FM Subgroups | 2 | 1463.7 7.93** 
Error 12 | 184.5 

F Subgroups | 2 43.7 | 18 
Error 12 | 237.5 

*P <05. 

*P <.01. 


during extinction. No such change is 
present for the unaware Ss or the 
controls. 

There was some concern that varia- 
tions in the daily environment of Ss 
might have some effect on the results 
since the value of the reward depended 
for the most part upon the effect of 
heat and humidity within the cham- 
ber. Temperature and humidity data 
for the hour on which each S arrived 
for the experiment were obtained from 
the Illinois State Climatologist. The 
ranges of the mean temperature and 


TABLE 4 


MEAN OPERANT LEVEL AND MEAN AND ADJUSTED MEAN RESPONSES 
DURING Last 15 MIN. OF CONDITIONING 


Operant Level Last 15 Min, of Conditioning 
Group Subgroup N Obtained Adjusted 
Mean SD 

Mean SD Mean 

Aware 6 5.2 1.14 30.7 12.09 29.9 

H Unaware 6 4.8 219 15.7 8.84 15.8 
Control 4 45 2.69 11.8 10.92 12.8 

Aware 7 1.4 1.99 53.1 14.50 55.1 

FM Unaware 5 7.8 2.11 28.0 18.86 28.2 
Control 4 8.8 3.17 31.0 20.23 26.8 

Aware 3 14.3 7.41 48.7 17.99 44.9 

F Unaware 9 8.4 5.87 44.2 14.02 45.6 
Control 4 10.2 2.48 40.2 11.09 40.0 


334 


4.0 


© UNAWARE 
© CONTROL 


MEAN RESPONSE PER MINUTE OVER SUCCESSIVE 5-MIN. PERIODS 


S 13 is 2 28 33 38 43 48 53 


TIME IN MINUTES 
Jou | EXT | 


Fre. 1. Mean responses per minute over 
Successive 5-min. periods. (The times shown 
on the abscissa represent midpoints of the 
successive 5-min. intervals from completion of 
instructions—see text.) 


humidity for each subgroup were 
79.5-83.25° F, and 43.0-52.25%, re- 
spectively. There appeared to be no 
differences attributable to external 
environmental conditions over these 
slight ranges, 

The success of this technique and 
the apparatus used was vouched for 
by all Ss save one. In response to 
Question 2, in which Ss were asked 
how well they liked the cool air, 
responses such as: “Very, very much”; 
“It was ecstasy”; “Desperately”; 
“Like a Manhattan”; “Couldn't have 
survived without it” were typical. In 
pilot studies in which Ss were in- 
formed as to how they could obtain 
the cool air, some Ss gave as many as 
eight responses per min, when six 
responses per min. were sufficient to 
maintain a constant supply of cool air, 
It was not uncommon for Ss to ver- 


bally request air, particularly during 
the extinction period, and Æ suffered 
some verbal abuse from Ss in all 


groups when these requests were not 
fulfilled. 


Discussion 


The results obtained in Groups H and 
FM require little interpretation. When 
plotted cumulatively, the extinction 
curves of the aware Ss still show positive 
although reduced slope. This is not 
surprising since Ss had been on a partial 
reinforcement schedule just prior to the 
extinction phase and the strength of the 
reward was great. Some confounding 
may also be present, since the controls 
in all three groups increased in response 
rate during extinction. Also to be noted 
is that operant levels did change as a 
part of a general change in activity 
during the period, exclusive of experi- 
mental manipulation, Fig. 1. The reason 
that Ss in Group H were generally lower 
in response rate may be due to the fact 
that the pseudo task required those Ss 
to be using their hands constantly. 

The results of this study give added 
Support to the increasing evidence 
against “learning without awareness. 
Only those Ss who could verbalize a 
contingent relation between their be- 
havior and the reinforcement demon- 
strated learning. Since learning did 
occur for these Ss it would appear that 
the reinforcement was effective. Fur- 
ther, the fact that somewhat less than 
50% of the Ss became aware of the 
contingency would suggest that the task 
allowed room for “learning without 
awareness” had such a process been 
operative. s 

A finding that strikes us as quite 
remarkable was that over half of the 58 
did not learn. Results such as this are 
not surprising in studies utilizing verbal 
reinforcements where the “rewards” are 
secondary and have constituted a part 
of the general social background of Ss 
for many years. However, in the present 
case the reward was primary and dra- 
matic and gained the attention of all Ss 
to such a degree that it seems surprising 


HUMAN OPERANT CONDITIONING 


that humans could spend 35 min. with 
the occurrence of this reward contingent 
upon their behavior and not learn the 
contingency. 

A factor that may at least partially 
account for Ss not learning or becoming 
aware of the contingency of the reward 
upon their behavior is the high operant 
level of the responses selected for con- 
ditioning. This is particularly true in 
Group F where only 3 Ss became aware 
of the contingency. Since the reward 
lasted for 10 sec., most Ss in this group 
would receive nearly continuous rein- 
forcement by merely maintaining their 
operant level. In fact Table 4 demon- 
strates that the mean number of re- 
sponses of the control Ss in Group F was 
greater than that of the aware Ss in 
Group H. 

The data forcefully demonstrate the 
need for appropriate control Ss in work 
on operant conditioning since it is 
obvious that parameters other than the 
experimentally controlled reinforcement 
exert an influence on Ss’ behavior. Also, 
the thorough exploration of Ss’ verbalized 
hypotheses and other behavior is shown 
to be an absolute requirement in deter- 
mining awareness. Only 4 Ss of 16 who 
were ultimately classified as aware 
volunteered the correct or correlated 
contingency in response to the initial 
question. i 

It appears that the technique utilized 
in this study offers a means to replicate 
on humans the behavioral changes found 
in subhuman animals. The high tem- 
perature and humidity, supplemented by 
appropriate instructions, allows for the 
manipulation of a reinforcement which 
definitely seems comparable in reward 
value to those administered to deprived 
animal Ss. Isolation within the sound- 
proof chamber also affords a more com- 
parable situation, and one in which most 
stringent environmental and task con- 
trols are built in. Some limitations are 
placed on a complete survey of operant 
conditioning phenomena owing to the 
duration of the reward (determined in 
pilot work as the optimum), however 
this limitation appears to be slight when 


335 


compared with the ease of administration 
and overall time saved by the procedure. 


SUMMARY 


This study was concerned with a technique 
for producing operant conditioning of human 
motor responses with special emphasis on 
conditioning without awareness. Female Ss 
who thought they were participating in a 
“space” experiment were enclosed in a 
chamber at 105° F. and relative humidity 
of 85%. A 10-sec. stream of cool air served 
asa reward for operant responses. Two of the 
three experimental groups (designated by 
response class) demonstrated conditioning 
and intensive investigation of Ss’ awareness 
revealed that conditioning was apparent only 
in those Ss who were able to verbalize 
mediational steps that intervened between 
the experimental variables and the changes in 
their behavior. 


REFERENCES 


Azrin, N. H., Hotz, W., Urca, R., & 
GOoLDIAMOND, I. The control of the content 
of conversation through reinforcement. 
J. exp. Anal. Behav., 1961, 4, 25-30. 

Bucuwatp, A. M. Supplementary report: 
Alteration in the reinforcement value of a 
positive reinforcer. J. exp. Psychol., 1960, 
60, 416-417. 

ErrKsEN, C. W. Discrimination and learning 
without awareness: A methodological sur- 
vey and evaluation. Psychol. Rev., 1960, 
67, 279-300. 

HEFFERLINE, R. F., KEENAN, B., & HARFORD, 
R. A. Escape and avoidance conditioning 
in human subjects without their observa- 
tion of the response. Science, 1959, 130, 
1338-1339. 

KrEcKkHAUs, E. E., & ERIKSEN, cw. A 
study of awareness and its effects on 
learning and generalization. J. Pers., 
1960, 28, 503-517. 

Levin, S. M. The effects of awareness on 
verbal conditioning. J. exp. Psychol., 
1961, 61, 67-75. 

VERPLANCK, W. S. The control of the con- 
tent of conversation: Reinforcement of 
statements of opinion. J. abnorm. soc. 
Psychol., 1955, 51, 668-676. 

VERPLANCK, W. S. The operant conditioning 
of human motor behavior. Psychol, Bull., 
1956, 53, 70-83. 


(Received September 20, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 4, 336-345 


SIMULTANEOUS CONTRAST AS A FUNCTION OF 
TEST-FIELD AREA! 


A. LEONARD DIAMOND 
Psychological Research Center, University of Hawaii 


The phenomenon of simultaneous 
contrast is illustrated in the change in 
brightness of a visual field without a 
corresponding change in the lumin- 
ance of that field. That is, if next to 
a small illuminated square, which we 
shall call the test field, we place an- 
other illuminated square of much 
greater luminance, which we shall 
term the inducing field, we will per- 
ceive a decrease in the brightness of 
the test-field square even though the 
test field is held constant in luminance. 

A number of parameters of this phe- 
nomenon have been investigated in 
previous experiments. The lumin- 
ance of the test and inducing fields 
(Diamond, 1953; Heinemann, 1955), 
the separation between test and induc- 
ing fields (Fry & Alpern, 1953; 
Leibowitz, Mote, & Thurlow, 1953), 
and the area of the inducing field 
(Diamond, 1955) have all been found 
to be pertinent variables in the con- 
trast effect. 

It is now of interest to know how 
the variation of the test-field area will 
affect simultaneous contrast. Specif- 
ically, a theoretical formulation (Dia- 
mond, 1960; also see Discussion 
below), which describes the relation- 
ships between the above-mentioned 
parameters, would necessarily predict 
little or no change in test-field bright- 
ness as the test-field area is varied in 
such a way that its center remains at 
a constant distance from the center of 
the inducing field. The present ex- 


1 This work was supported by a research 
grant NSF-G9588 from the National Science 
Foundation. 


periment is designed to test this 
prediction. 

Our general method is as follows 
(see Fig. 1): To S's right eye is 
presented an inducing field (i), a 
rectangle twice as wide as it is long; 
below the inducing is the test field (t), 
equally as wide as the inducing field 
but variable in height, and thus in 
area. To S’s left eye is presented the 
match field (m), which is either kept 
equal in size to the test field, as the 
test-field area is varied, or is held 
constant in size. (As we shall see, 
whether we vary the match-field size 
or hold it constant makes little differ- 
ence in the results.) As the test-field 
area is varied, the distance between 
the centers of the test and inducing 
fields is held constant. The brightness 
of the test field is thus measured as a 
function of its area for three different 
values of test-field luminance (.69, 
1.60, and 2.68 log mL.) over a wide 
range of inducing-field luminances 
(— © to 2.79 log mL.). 


METHOD 
Apparatus 


Description of S's view.—A modification of 
the apparatus used by Diamond (1955) 18 
employed in the present experiment. The 
patterns seen by S are different for each eye 
(see Fig. 1). 

The pattern, R, to the right eye only, 
includes: (a) a test-field rectangle (t), 33’ in 
visual angle along the horizontal, and variable 
in its vertical extent from zero to 33’; (0) 
above the test field with its center held at 4 
constant distance from the center of the test 
field, an inducing-field rectangle (i), also 33 
wide but with a fixed vertical dimension of 
16.5’; and (c) a small fixation point (P) 
located 21’ to the left of the test field. The 


336 


SIMULTANEOUS CONTRAST 337 


second pattern, L, presented to the left eye 
only, includes: (a) a match-field rectangle (m), 
33’ and variable in height; and (4) a small 
fixation point (P) located 21‘ to the right of 
the match field. 

In the binocular view, S is instructed to 
fuse left- and right-eye fixation points into the 
point P so that the match, test, and inducing 
fields are held in constant position relative to 
one another. Stimulation may be considered 
essentially foveal since the visual angle be- 
tween the fused fixation point and furthest 
corner of any field is never more than 70’. 

In order to make it easier for S to fuse the 
left- and right-eye fixation points, a circular 
prism, the refracting angle of which is 5°, is 
placed in each eyepiece of the apparatus. By 
rotating each eyepiece, one clockwise and the 
other counterclockwise, S can optically rotate 
both left- and right-eye patterns such that he 
can vary the horizontal separation between 
them and thus more easily superimpose (fuse) 
the left- and right-eye fixation points. No 
systematic changes in the brightness of either 
right- or left-eye patterns occurred as a result 
of prism rotation. 

Apparatus controls.—T! he apparatus is 
designed to control the following variables: 
(a) the luminances of the test, inducing, and 
match fields; and (b) the areas of the test and 
match fields. Luminance controls are dis- 
cussed in detail by Diamond (1955). The 
general arrangement is as follows: The right- 
and left-eye patterns (L and R in Fig. 1) are 
presented to S along two separate optical 
paths, one to each eye. The light source of 
each path is a 150-w. tungsten filament 
projection lamp, in front of which is a section 
of heat-absorbing glass. The light, diffused 
by flashed opal glass, travels through its 
particular pattern (L or R). The luminance 
in either path can be continuously varied by 
fixed filters and a fixed and movable Polaroid. 
Made parallel by 4-diopter lenses, the light 
finally travels through 3-mm. artificial pupils 
into the eyes of S who is seated in a light-tight 
cubicle. The entire left optical path is 
adjustable horizontally for interpupillary 
distance by means of a screw arrangement- 
attached to the optical bench. 

The luminance of the match, test, and 
inducing fields are calibrated by means of 
binocular matches to fields of similar shapes 
and areas and whose luminances had been 
determined by a MacBeth illuminometer. 

Test-field area is controlled by the use of 
six thin metal masks. In each of these masks 
is cut a test-field rectangle of a particular 
vertical dimension (5.5’, 11.0’, 16.5’, 22.0’, 
and 33.0’ in visual angle) and constant 


] aout | itista 
ay “anane[ 7! 
| Mit 


b—ss —4 
L R 
Fic. 1. The S's binocular view to left (L) 


and right (R) eyes. (The match field—m— 
was held either at its maximum of 33 min. 
or equal in area to the variable test-field—t.) 


horizontal dimension (33.0). An inducing- 
feld rectangle of constant dimensions 
(16.5 X 33.0’) is also cut in each mask such 
that its center is always at the same distance 
(24.75') from the test-field center. This is 
illustrated as Pattern R in Fig. 1. 

Match-field area is controlled as follows: 
A thin metal mask, in which is cut a 33’ 
square, is placed before the diffused light 
source (in the left optical path). Another 
thin piece of straight-edged metal is mounted 
immediately next to the square mask such 
that when the straight edge of this second 
piece of thin metal coincides with the top edge 
of the match field, the match-field area is zero. 
When the straight-edged metal is drawn down 
away from the top of the match field, the 
match-field area increases in successive rec- 
tangular increments. In this way, the match 
field can be set equal in area to any of the six 
test-field areas. 


Procedure 


The experimental method was designed to 
investigate the brightness of the test field as a 
function of its area; the area was varied in six 
steps from 5.5’ to 33’ (vertical dimension). 
This entire function was studied for three 
test-field luminances, .69, 1.60, and 2.68 log 
mL., and for various inducing-field luminances 
as specified in Table 1. 

It was desirable to determine whether 
variation of the match-field area would affect 
the results in any systematic way. Therefore, 
in addition to the variation of the afore- 
mentioned parameters, two conditions of 
match-field area were explored. That is, the 
match-field area was (a) held equal to that of 
the test field as the test-field area was varied 
and (b) held constant at its maximum value, 
i.e. 33’ square for all values of test-field area. 
This last condition prevailed for two inducing- 


338 


field luminances—zero luminance (— © log 
mL.), and a luminance approximately equal 
to that of the test field, whatever it happened 
to be (.69, 1.60, or 2.68 log mL.). 

During each experimental session, which 
required between 30 to 60 min., test-field area 
and luminance were held at constant values 
for that session, and both match-field area and 
inducing-field luminance were varied. The 
various match-field sizes or inducing-field 
luminances explored during one session were 
counterbalanced. Intersession periods were 
never shorter than 4 hr. Successive test-field 
areas and test-field luminances were taken in 
different sessions and were presented in 
random order. 

Psychophysical method.—Seated in the 
light-tight cubicle, S initially dark adapted 
for 3 min., then light adapted for 3 min. to the 
binocular view, i.e., at a test-field luminance 
and a set area for a particular experimental 
session, the match field at the same brightness 
as the test field, the inducing field at a 
luminance set for a particular experimental 
point. The S then began making brightness- 
equality matches following the psychophysical 
method of adjustments as described by Guil- 
ford (1936). That is, S set the luminance of 
the match field so that it appeared equal to 


A. LEONARD DIAMOND 


that of the test field. The Æ then changed 
the luminance of the match field in a random 
manner after each match. The S had to 
adjust the match-field luminance again until 
it seemed equal in brightness to that of the 
test field. In this manner, for each experi- 
mental point as indicated in Table 1, S made 
10 matches. The average of these 10 match- 
field luminances was taken as the brightness 
of the test field for each experimental point. 
This procedure was followed for 2 Ss, JS 
and RH. 


RESULTS 


The data are presented for each S 
in Table 1, and averaged in Fig. 2. 
Figure 2 shows the log luminance of 
the match field (B,) plotted as a 
function of the test-field area (as 
measured by its vertical dimension). 
Log B,, may be termed the brightness 
of the test field since its value is based 
upon an equality judgment between 
the match- and test-field brightnesses. 
The function is graphed for different 
values of test-field luminances (log B:), 


TABLE 1 


LoG MaTcH-FIELD LUMINANCES (ML.) FOR DIFFERENT TEST AREA 
AND INDUCING AND TEST LUMINANCES 


Test Height (Min.) 


Test | Log Ind. 
Luminance Lum ro ie kk, NS Loa DR ae a. 
(mL.) (mL.) f 11.0 à 5 a ce 
JS | RH 
=% A Eh 2 
— On 52 
0.69 0.69 64 
0.702 59 
1.70 46 
2.68 .23 
—% 38 1.31 
ent | 1, AT 
1.60 1.60 31 1.21 
1.69," | 1.42 1.33 
2.23 33 1.25 
2.72 02 99 
-%0 2.19 |2.84 |2.48 
=," | 2.59 |2.99 |2.80 ` 
2.68 | —0.32 | 2.20 [2.74 12.60 $ 
1.22 | 2.35 |2.83 |2.50 i 
2.68 | 2.30 |2.75 |2.47 i 
2.79m" | 2.64 |2.94 |2.42 .80 
test 


* The subscript m indicatesa constant maximum match-field area for all 


SIMULTANEOUS CONTRAST 339 


2 SUBJECTS (AV) 


MAXIMUM 
MATCH FIELD 


1.0 LOG Bj (mL) 
eee o -@ 
=e" TO 


(0) 
LOG Bt (mL) .69 
a 
=20 f LOG Bi (mL) 
£ Ko aa -0 
a S169 
ro) Ki x 
© 
1.0 
LOG Bt (mL) 1.60 
3.0 


20 LOG Bt (mL) 2.68 


10 (20/930 


40 


VARIABLE 
MATCH FIELD 


LOG Bi (mL) 
° a -0 
m 69 
ge — 1.70 


a 
o — tma a 2.68 


LOG Bt (mL) .69 


LOG Bj (mL) 


eo ——__—— 2 —o — 0 1MM 80 
———— S 
I 
— se . 


LOG Bt (mL) 1.60 


LOG Bi (mL) 

om to 
538 !-22 
% 268 


LOG By (mL) 2.68 


(One 20 630). .40 


HEIGHT OF TEST FIELD (min) 


Fic. 2. Test brightness as a function of test area for different test 
and inducing luminances. 


and for different inducing-field lumin- 
ances (log B:) and match-field areas. 

The effect of test-field area upon 
test-field brightness is minimal. 
There seems to be little or no change 
in the brightness of the test field as its 
area is increased; this holds through- 


out the test-field and inducing-field 
luminance range explored and for both 
conditions of match-field area. In- 
ducing luminance, however, does have 
an effect as seen in previous experi- 
ments (Diamond, 1953, 1955). The 
greater the inducing luminance, 


340 


INDUCING-FIELD 
"ON" FIBERS 


DISCHARGE-FIELD 
“OFF" FIBERS 


TEST-FIELD 


“on” 


FIBERS 


Fic. 3. 
between test and inducing on fibers and 


Diagram of inhibitory interactions 


spontaneously discharging off fibers. (Ar- 
rows show directions of main inhibitory 
effects. Circles represent any fiber or fibers 
in each respective field.) 


greater than equality with test lumi- 
nance, the lower the brightness of the 
test field, or the greater the depression 
of test brightness. 

The lines drawn through the experi- 
mental points are fit by theoretical 
curves which can now be discussed. 


Discussion 


The theory fit to the above data was 
originally devised not only to explain the 
phenomenon of depression seen in the 
Present results (and its change, or lack of 
change, with test-area change) but also 
the phenomenon of enhancement (see 
Diamond, 1960). Under certain condi- 
tions test brightness can increase, or be 
enhanced, especially if surrounded by a 
less bright inducing or surround field. 
Why enhancement did not occur in the 
present experimental situation will be ex- 
plained after the basic physiological as- 
sumptions to the theory are summarized, 
In Fig. 3 are diagramed the main or 
primary physiological events that accord- 
ing to our theory are basic to psycho- 
physical depression and enhancement, 

For depression of test brightness, the 
test field “on” fibers (which presumably 
mediate psychophysical brightness) are 
inhibited by (a) the inducing on fibers, 
(b) other test on fibers, and (c) the 
spontaneously discharging off fibers. 


A. LEONARD DIAMOND 


For enhancement of test brightness we 
must first note something not indicated 
in Fig. 3; i.e., off fibers exist within the 
test and inducing fields as well as the 
discharge field. Now to explain en- 
hancement in a multiple-field situation; 
i.e., one in which an inducing field as well 
as a test field is present, we must assume 
that when the inducing-field luminance is 
zero, the off fibers immediately surround- 
ing the test field are normally active. 
As the inducing field now increases from 
zero to some value below test-field lumin- 
ance the off fiber activity within the 
inducing-field borders becomes dimin- 
ished. Enhancement in the multiple- 
field situation is therefore explained as a 
“disinhibition” of test-field activity; i.e. 
inhibition by the inducing field of dis- 
charge activity releases the test field from 
discharge inhibition. 3 

This, of course, is a verbal description 
of the theory behind the explanation of 
Fig. 2 data. The curves drawn through 
the data are mathematically determined. 
The mathematic function fitted to the 
data is based upon preliminary assump- 
tions which we must first discuss. A 
more complete and detailed discussion of 
the assumptions is available in Diamond 
(1960). 

Our theory then assumes a number of 
physiological events, concerning both on 
and off fibers in the retina when it is 
illuminated by the test circle. Most of 
these assumptions are based upon physio- 
logical findings in animals and are as 
follows. 


On Fiber Frequency 


The frequency fa of a stimulated on 
retinal fiber (as described by Hartline, 
1938): 


1. directly determines the strength of the 
brightness response, or 


Ai = kifa [1] 


where A; represents the brightness response 
kı is a proportionality constant, and fa i$ 
the frequency of the on fiber. This rela- 
tionship has been suggested by Adrian's 
(1928) demonstration of the similarity be- 
tween the brightness-duration curves in 


y o _ 


SIMULTANEOUS CONTRAST 341 


man and the frequency-duration curves 
taken from the optic nerve of the eel. 

2. is directly proportional to a power 
function of the luminance of the stimulating 
light, as suggested in Diamond's (1960) fit 
of Hartline and Graham's (1932) Limulus 
data, or 


Ía = ReBe} [2] 


where Bz, is the luminance of light striking 
an on fiber. 

3. is inversely proportional to the fre- 
quency of a nearby on fiber, as demon- 
strated by Hartline and Ratliff (1957) in 
the Limulus, or 


Ja ere [3] 


where fi is the frequency of a nearby on 
fiber within the test field. Since according 
to Equation 2 above frequency is propor- 
tional to luminance then 
ky 
fy = Ba 4] 
when By, is the luminance striking the near- 
by on fiber. Within the test field this is 
equal to Ba. 

4. is inversely proportional to the number 
of nearby on fibers as demonstrated by 
Hartline, Wagner, and Ratliff (1956) in the 
Limulus, or 


Aas [5] 


where E;, represents the total number of 
nearby on fibers within the test field. 

5. is directly proportional to the distance 
between the test and nearby on fibers, as 
demonstrated by Hartline, Wagner, and 
Ratliff (1956), or 


ty = korne [6] 


where rı, s is the separation between the two 
on fibers in the test field. It becomes 
convenient and actually desirable, as 
pointed out by Diamond (1955), to combine 
Equations 5 and 6 such that 


Cum = [7] 


where F: + is the average separation between 
all test fibers and Crs: therefore describes 
the combined effect of all the individual 
nearby on fibers; it then follows that 


kni be 
Ía Be Cai [8] 


Equations 2 through 8 are also applicable 
to the interaction between a test on fiber 
and inducing on fiber (see Fig. 3). Thus 


maT, [9] 


where Cj,. represents the combined effect 
of all on fibers from the inducing field upon 
those in the test field, 

6. is inversely proportional to the fre- 
quency of a nearby off fiber, as suggested 
in Granit’s (1955) descriptions of the 
mutual antagonism between on and off 
fibers, or 


Shim ae [10] 


where fe is the frequency of an off fiber. 

7. is inversely proportional to the number 
of nearby off fibers at particular distances 
away or 


Le era (11) 


where Ca, is the combined effect of all the 
off fibers in the discharge field on the on 
fiber frequency in the test field. With 
respect to the frequency of an on retinal 
fiber in the test field, Equations 1 through 
11 may be combined into the following 
formula: 
K.Bè 

A E KK Cret Edea 74 
The subscripts of the proportionality con- 
stants (K) are chosen to coincide with 


those used in the more general brightness 
theory by Diamond (1960). 


Off Fiber Frequency 


The frequency, fo, of an off fiber in the 
discharge field, which we assume to be 
spontaneously active in the dark accord- 
ing to data taken from the cat eye by 
Barlow, FitzHugh, and Kuffer (1954): 


1. is directly proportional to some “in- 
ternal driving force” (comparable to the 
external luminance effect on on fibers) 
which we shall assume to exist. Such a 
mechanism is suggested by experiments, 
described by Granit (1955), which show 
both on and off activity to increase in the 
retina as a result of central (reticular 
formation) stimulation. Thus 


fo = kuD [13] 


where D represents this “driving force.” 


342 


2. is inversely proportional to the fre- 
quency of a nearby on test fiber, according 
to Granit’s on-off antagonism findings cited 
above, or 

kis 
fo = E 
3. is inversely proportional to number 


and distances away of nearby on test fibers, 
or 


[14] 


[15] 


where Cra is the combined effect of all the 
on fibers in the test field upon the off fibers 
in the discharge field. This assumption has 
not been tested experimentally, 

4. is inversely proportional to the amount 
of light impinging upon the off fiber, as 
demonstrated by Hartline (1938). This 
light could be direct or scattered from the 
test beam stimulating the on fibers, The 
retinal effectiveness of scattered light in the 
human eye has been demonstrated by 
Boynton and Riggs (1951). Thus, 


f= 3, [16] 
With respect to the frequency of an off 
fiber in the retinal discharge field, therefore, 


Equations 12 through 15 may be com- 
bined into the following formula: 


K;pt 
KiBiC ya [17] 
Equations 14 through 16 are also ap- 
plicable to the interaction between an in- 


ducing on fiber and a discharge off fiber (see 
Fig. 3). Thus 


to 


Í 


K,Dt 
KsBiCia [18] 


where C;,a represents the combined effect 
of all on fibers from the inducing field upon 
the off fibers in the discharge field. 


It should be noted that the inhibi- 
tion of the discharge field by the in- 
eTA wba teed) 2A 


A= 


The S's brightness response to the match field can 


K.Bè 
KıBè(K:Ci,4) + KBK Cy, + 


A. LEONARD DIAMOND 


ducing field is effectively greater when 
an inducing square, for example, is 
near the test square. This is because 
discharge elements near the test field 
are initially more effective than those 
far away. When the inducing field 
inhibits near discharge elements, the 
effect of spontaneous discharge on the 
test field is much more reduced than 
when the far discharge elements are 
inhibited. Therefore the effective 
discharge frequency is a function of 
the separation between test and in- 
ducing fields, or 


Fit 


Fee 


fe Se ky; [19] 


where 7; is the average separation 
between all elements in the inducing 
field and those in the test field. That 
fi is set in ratio to #,,, is empirically 
required for a satisfactory fit of certain 
data (see Diamond, 1960, pp: 183- 
184). A completely rational account 
of the functional relationship between 
fo and Fi, awaits further knowledge of 
separate and relative effects of light 
scatter in the intact human eye and 
on-off antagonism. 

Equations 17 through 19 may be 
combined to include the effect of the 
inducing field upon the off frequency, 
or 
uA KD — c0 
K:BèC,a + Kpa Cid 


i,t 


If we now combine Equations 1, 12, 
and 20 then 


ae: 21 
KsD*(KiCa,1) [2i 


KıBiCia + KB: tasted 
a 


be described in the 


same manner minus the inducing field effect, or 


SIMULTANEOUS CONTRAST 343 
K.Bmt 
A, = = 2 
KD KC) [22] 


K Bm (KaCm,m) + 


KBC's a 


Since in the present experiment, the S is required to match the test and match 


field in brightness such that 


Ae [23] 


then if accordingly we set Equations 21 and 22 equal 


KBr’ 


KsD!4(K6Ca,m)® 


K:Bm®!4(KoC m,m) + RB Caa 


[24] 


K.B 4 


KiB (K011) + KsBiP!4 (Kali) + 


In Table 2 are presented the values of 
the terms in Equation 24. The pro- 
cedures for determining values for the 
terms were as follows: The values of 
the luminance (B) terms were taken 
directly from the luminance values of 
the different fields. The area-separa- 
tion (C) terms were calculated accord- 
ing to Equation 10 above. The di- 
mensions of the discharge area were 
arbitrarily chosen; since only minimal 
inhibitory effects occur between fields 
separated by more than 4.5° (Fry & 
Alpern, 1953) of visual angle and at 
certainly no more than 9° (Leibowitz 
et al., 1953); spontaneous discharge 
elements further than 9° from the 
center of the test field were not con- 
sidered to be effective. 

The value of D (spontaneous dis- 
charge activity) is arbitrarily taken as 
the value 1. This turns out to be 
empirically satisfactory and must do 
until such time as physiological and/or 
psychophysical, manipulation of dis- 
charge activity reveals its actual 
value. 

The value of the a, b, c, e, f, and K 
terms were determined empirically 


KsD*!4(K6Cu.t)* 
FrtCra 


f: 
KıB:C*t a + KsBi (2G ) 
ae 


(by trial and error fitting). The 
value of K, is not included in Table 2 
since this term cancels out in Equa- 
tion 20. 

It should be noted that in Equation 
20 and Table 2 are listed a total of 18 
fitted constants. Were these con- 
stants chosen completely without 
restriction, any number of a variety of 
functions could be generated, so that 
the fit of our physiological theory to 
the 16 functions in our present experi- 
ment (see Fig. 2) would be meaning- 
less. The constants, however, were 
not chosen arbitrarily but were those 
fit by Diamond (1960) to more than 
45 additional functions having to do 
with brightness contrast and pre- 
dicted by the same theory as outlined 
above. 

As noted above, enhancement as a 
function of test area is not clearly 
evident in the data. Furthermore, 
although it is included as a basic com- 
ponent of the present theory it is not 
predicted for the stimulus conditions 
existing in the present experiment. 
That is, the off fibers in the discharge 
field (see Fig. 3) are already inhibited 


A. LEONARD DIAMOND 


344 


“Asoaqp nodn paseq 34 you pue paatep Ajeopmduwa st siE È 1 + = ) sor we )-> 
“QUA = a—'NoN 


00`£ 00'E { 
00°Z 00°2 a 
A A s2 
A A +2 A 
ore— | OWe— + oeo o'o £7 szi 
00°0 00°0 Ey a A z2 A 
che acy — EA oeo oso 12 olf 
st) o£'o sy Z601 76°01 Ki A 
Ere IAA H 00°F 00°F eg A 
00°0— | 00°0- y: 00°F 00° zq A A A tg 
ei a Aa *y 00°F 00° Ki ori a a ig 
00'T 00'T a 00°0 00'0 'y 00°21 00°71 D A A A “g 
3; 
sarin | “Seat | er | aana | “SORT | wees | iter, | Suet | eer | RAR som | “suet | eeoa 
«PAHA BO, (B0) syueysu0 AyTeEuoNI0dGo1g syuauodxy (8077) voperedag-vary (Tu 307) urunun 
sanqeA parity ALAU uopenys pepuaunədxg áq pawuan 


p7 NOILVAOY NI SHUAL J0 san IVA 
z AIdVL 


SIMULTANEOUS CONTRAST 


by the suprathreshold test light, 
according to Equation 17 above. 
The increase in test area therefore has 
no further effect on the discharge off 
fibers thereby resulting in little or no 
disinhibition or enhancement. 


SUMMARY 


The brightness of a test-field rectangle was 
studied as a function of its area and the 
luminance of a nearby inducing-field rectangle. 
The area of the test field was varied by in- 
creasing its vertical dimension from 5.5’ to 
33.0’ in visual angle. A binocular matching 
technique was used in which S indicated the 
test-field brightness by the luminance of a 
mates set equal in brightness to the test 

eld. 

Two experiments were performed. In the 
first the match area was held at a constant 
maximum value. In the second the match 
area was allowed to vary along with the test 
area. The dependent variable for both ex- 
periments was match-field luminance and the 
independent variables, test area, test lumin- 
ance, and inducing luminance. Very little if 
any change occurred in the test-field bright- 
ness as a function of test-area variation in 
either experiments. This held true for a 
wide range of test and inducing luminances. 
As inducing luminance was increased, how- 
ever, test brightness became depressed. A 
mathematical theory fit to the data was based 
upon physiological inhibitive interaction 
among individual cone receptors in the retina. 


REFERENCES 


ADRIAN, E. D. The basis of sensation. Lon- 
don: Christopher, 1928. 

Bartow, H. B., FrtzHucu, R., & KUFFLER, 
S. W. Resting discharge and dark adapta- 
tion in the cat. J. Physiol., 1954, 125, 28- 
29, . 


345 


Boynton, R. M., & Ricos, L. A. The effect 
of stimulus area and intensity upon the 
human retinal response. J. exp. Psychol., 
1951, 42, 217-226. 

Diamonp, A. L. Foveal simultaneous bright- 
ness contrast as a function of inducing and 
test-field luminances. J. exp. Psychol., 
1953, 43, 189-195. 

Dtawonp, A. L. Foveal simultaneous con- 
trast as a function of inducing-field area. 
J. exp. Psychol., 1955, 50, 144-152. 

Driamonp, A. L. A theory of depression and 
enhancement in the brightness response. 
Psychol. Rev., 1960, 67, 168-198. 

Fry, G. A., & ALPERN, M.* The effect of a 
peripheral glare source upon the apparent 
brightness of an object. J. Opt. Soc. Amer., 
1953, 43, 189-195. 

Granit, R. Receptors and sensory perception. 
New Haven: Yale Univer. Press, 1955. 

GUILFORD, J. P. Psychometric methods. New 
York: McGraw-Hill, 1936. 

Hartiine, H. K. The response of single 
optic nerve fibers of the vertebrate retina. 
Amer. J. Physiol., 1938, 121, 400-415. 

Harrie, H. K., & Granam, C. H. Nerve 
impulses from single receptors in the eye. 
J. cell. comp. Physiol., 1932, 1, 277-295. 

Harruıne, H. K., & RATLIFF, F. Inhibitory 
interaction of receptor units in the eye of 
Limulus. J. gen. Physiol., 1957, 40, 375- 
376. 

Harrie, H. K., WaGNeR, H. G., & RAT- 
irr, F. Inhibition in the eye of Limulus. 
J. gen. Physiol., 1956, 39, 651. 

HEINEMANN, E. G. Simultaneous brightness 
induction as a function of inducing- and 
test-field luminances. J. exp. Psychol., 
1955, 50, 89-96. 

Lemowrtz, H., More, F. A., & THURLOW, 
W.R. Simultaneous contrast as a function 
of separation between test and inducing 
fields. J. exp. Psychol., 1953, 46, 454-456. 


(Received August 5, 1961) 


Ji al oj Experimental Psychology 
1962, Vol. 64, No. 4, 346-351 


MEDIATED SATIATION IN VERBAL TRANSFER? 


LEON A. JAKOBOVITS ann WALLACE E. LAMBERT 
McGill University 


The role of mediation in associative 
processes has long been recognized by 
psychologists (see Atherton & Wash- 
burn, 1912) despite early failures to 
demonstrate its influence in controlled 
experiments ` (e.g, Howe, 1893), 
Peters (1935) was among the first to 
report positive results in controlled 
designs, and recently, several in- 
vestigators (McGehee & Schulz, 1962 ; 
Russell & Storms, 1955) showed con- 
clusively that mediation in paired- 
associate learning can be demon- 
strated in the laboratory. In these 
studies the existence of mediation in 
verbal learning was inferred from 
transfer designs permitting positive 
generalization of acquisition gradients. 
The present paper represents an at- 
tempt to study mediation in verbal 
transfer using generalization of inhibi- 
tion gradients. In other words, a 
design was used which, on the basis 
of previously established facts, should 
lead to facilitative effects in paired- 
associate learning but, because the 
inferred mediators are “tampered 
with,” inhibition instead of facilita- 
tion is expected to be transferred. 
Since the present design produces 
inhibition by the direct manipulation 
of the mediators, it has an advantage 
over previous approaches in which the 
mediated effect remained at the im- 
plicit level. 


' This research was supported by the 
Canadian Defense Research Board Grant 
D77-9401-10 and in part by a subvention 
from the Carnegie Corporation. We are 
grateful to Rabindra Kanungo for helpful 
suggestions, 


346 


METHOD 


Design.—The design of the present study 
was parallel to those of Russell and Storms 
(1955) and McGehee and Schulz (1962). In 
Group E (experimental), Ss learned two 
paired-associate lists: List 1 established A-B 
connections between nonsense syllables and 
meaningful words; List 2 was composed of 
A-D pairs, where the relation between B and 
D was such that D was the most common 
associate to C and the latter was the most 
common associate to B, The middle element, 
C, acts as the mediating link in the forward 
associative chain, B > C > D, and provides 
facilitation in the acquisition of the A-D list. 
In the present case, however, the C word was 
“satiated” according to a technique described 
by Lambert and Jakobovits (1960) who 
showed that continuous repetition of a word 
results in a decrement in the intensity of its 
connotative meaning. It was expected that 
the decreased meaningfulness of the C word 
would reduce ifn, “ectiveness to act as a 
mediator in thej- 7C => D chain, reducing 
the facilitation éffèct during the subsequent 
acquisition of the A-D pairs. 

In Group C (control), Ss also learned two 
lists: List 1 established A-X associations be- 
tween the same nonsense syllables but other 
meaningful words than those used for Group 
E; List 2 was composed of the same A-D pairs 
as used for Group E. However, no associa- 
tive relation existed between the X and D 
words. The words were the same as those 
used in the Russell and Storms (1955) study 
where a complete description of the procedure 
for selecting words can be found. 

The overall design is illustrated in Table f 
Group E received the Mediator Nonsatiated 
and the Mediator Satiated conditions. Each 
of these two conditions consisted of five paired 
associates. The Aı-By and A;-Dy pairs cor- 
respond to the second half of the A-B and A-D 
pairs, respectively, in Table 3 of the Russell 
and Storms study. The ABa and ArDs 
pairs correspond to the first half of the A-B 
and A-D pairs, respectively, in their study- 
The 10 A-B pairs formed List 1 in the present 
study, while the 10 A-D pairs formed List 2- 
Three different random orders of the 10 pairs 


| 


MEDIATED SATIATION IN VERBAL TRANSFER 


TABLE 1 
ILLUSTRATION OF THE DESIGN USED IN THE EXPERIMENT 


Condition List 1 List 2 Inferred Action 
By Cy 
Mediator Nonsatiated ArByN ArDy eae IE Cree Pet, pa aoe 
Bs————Cs 
Mediator Satiated ArBs ArDs AG ae ae Sense AEA —Ds 
Xr ? 
Nonmediated Control ArXı Ar Dy a E Eeu eens —Dx 
Xr? 
ArX ArD, A 4 =D 
2X2 rDs A TE s 


Examples of the Above Conditions in the Same Order 


List 1 List 2 
GEX-JUSTICE GEX-WAR 
YOV-SOLDIER YOV-NAVY 
GEX-HOUSE GEX-WAR 
YOV-CHEESE YOV-NAVY 


Inferred Action 
JUSTICE PEACE 
7 N 
OES keene ESDS EAT, —WAR 
SOLDIER————*-ARMY. 
ya ` 
AE N A A fp! i06 — NAVY 
init ? 
GEX a ea a Eons —WAR 
CHEESE: ? 
SOTE dawn bere diets be AT —NAVY 


Note.—The notations A:and Az refer, respectively, to the first 5 and second 5 stimulus members of the 10-item 


lists. Similarly, By or Dy and Bs or Ds refer to the first 


in each list were presented in a standard 
memory drum at a 3:3-sec. rate with a 6-sec. 
intertrial interval. The instructions given 
were the same as those described in detail 
by Storms (1958). Each S saw List 1 for a 
maximum of 27 trials or until he met the 
criterion of three errorless repetitions, which- 
ever came first. (All Ss met the lesser 
criterion of one errorless repetition, but 10 
failed to meet the criterion of three errorless 
repetitions within the maximum of 27 trials.) 
Eight minutes elapsed between the last pres- 
entation of List 1 and the first presentation 
of List 2. During this period, Ss of Group E 
received the satiation treatment which in- 
volved the five Cs words of the Mediator 
Satiated condition (ArBs), as well as five 
filler words (actually, the second half of the 
X words in the Russell and Storms table). 
First, S rated the 10 words (randomly mixed) 


t and second half of the response members. 


TABLE 2 


NuMBER OF TRIALS TO CRITERION 
ON THE VARIOUS CONDITIONS 
For GROUPS E anD C 


Group E | Group C t 
between 
Groups 


Condition 


Mean| SD | Mean| SD 


a eed 
Ai-By and A:-Bs 19.804] 5,09) — | — 

(List 1) 

1.21 

AvXi a AvXa — | — |17.95>) 4.22 f 
Ai-Dy and A:D3 10.778] 4.85] 17.300] 5.16] 4.45** 

(List 2) 
A-D 7.77 |3.45| 15.25 |5.14| 6.13%* 
ArDs 10,30 | 4.74| 14.45 |5.16) 2.86* 


s The difference between these two means is signifi- 
cant (f = 11.73; P < .001). 
> The difference between these two means is not 
significant (¢ = 0.64; ns). 
*P <.01. 
** P <.001. 


348 


on three scales of the semantic differential 
(pleasant-unpleasant, strong-weak, and fast- 
slow). Then S repeated aloud each word for 
20 sec. before rating it again on the same three 
scales. Differences in intensity of ratings 
before and after repetitions represent the 
semantic satiation scores to be presented be- 
low. Following the satiation treatment, List 
2 was presented and S learned it toa criterion 
of three errorless repetitions and in this case 
all Ss reached the criterion in less than 27 
trials, 

The Ss in Group C received the Non- 
mediated Control condition shown in Table 1, 
The five A,-X; pairs and the five AX: pairs 
formed List 1 which corresponds to the A-X 
column of the Russell and Storms table. The 
Aı-Dy and Az-Ds pairs which formed List 2 
were the same as those for Group E. The 
procedure used with Group E was duplicated 
here except for the fact that no satiation 
treatment was administered, and Ss were 
engaged in neutral conversation during the 
8 min. which separated the two lists, 

In summary, the following comparisons 
can be noted: the Mediator Nonsatiated 
condition in the present study corresponds to 


LEON A. JAKOBOVITS AND WALLACE E. LAMBERT 


mediated Control condition corresponds to the 
“Unchained” or ‘‘Nonmediated” conditions 
respectively in those two studies; the Medi- 
ator Satiated condition represents the pro- 
active interference design of the present study, 
The results are presented in terms of differ- 
ences in acquisition scores of List 2 under the 
three conditions. 

Subjects —The Ss were 50 male English- 
speaking cadets of the Royal Canadian Air 
Force enrolled in a 6-wk. training course at a 
base near Montreal, They were asked to 
volunteer for the experiment by their in- 
structor. The testing was done individually 
at the training base during regular work 
hours. The first 30 Ss formed Group E, the 
last 20, Group C. 


RESULTS 


Comparison between the two groups 
on the number of trials required to 
reach the criterion of one errorless 
repetition (met by all Ss) for their 
particular first list is presented at the 


the “Chained” condition of Russell and top of Table 2. There is no signifi- 

Storms (1955) or the “Mediated” group of cant difference between the two 

McGehee and Schulz (1962). The Non- groups, indicating that they are of 
TABLE 3 


COMPARISONS BETWEEN THE MEDIATOR NONSATIATED (A1-Dy) AND MEDIATOR 
SATIATED (Aj-Dg) CONDITIONS or List 2 FOR Groups E ann C 


Means for Group E Means for Group C 
Measures a E7 
A.Dy ArDa Diff, Ai-Dw A+Ds Diff. 

—_—_—_—— | ee | | --—_-_ -5 
First 5 diff. correct} 3.13 1.87 1.26** 2.45 2.55 —0.10 
responses (0.62) (0.62) | (1.72) (1.07) (1.07) (2.18) 
Trials to 1 errorless 7.77 10.30 | —2.53** | 15.25 14.45 0.80 
repet. (3.47) (4.73) | (3.28) (5.14) (5.15) (4.05) 
Total correct anticipa-| 33.07 26.70 6.37%* 43.30 44.10 —0.80 
tions* (15.40) | (11.83) | (6.41) (17.10) | (15.50) | (14.52) 
Total number of 3.97 6.30 | —2.33* 12.30 12.80 —0.50 
intrusions" (3.43) (4.60) | (3.50) (6.45) (7.65) (8.07) 
Total number of 17.67 21.71 | —4.04°* | 30.25 28.95 1.30 
omissions* (11.43) | (13.83) | (4.78) (7.55) (8.85) (7.98) 
Total correct 47.90 40.83 7.07** | 56.10 57.25 -1.415 
anticipations* (16.20) | (11.83) | (73m | 14.85) | (13°78) | (13.19) 


Note.—Numbers in parenthesis are SDs. 
* To a criterion of one errorless repetition. 


> To a criterion of three errorless repetitions or 27 trials. 


* P <.01, two-tailed £ test. 
** P < .001, two-tailed £ test. 


te 


MEDIATED SATIATION IN VERBAL TRANSFER 


equal learning ability. This conclu- 
sion is correct unless the materials in 
the two lists are not of equal difficulty 
which is unlikely in view of the par- 
ticular selection procedure used by 
Russell and Storms (1955) and the 
fact that McGehee and Schulz (1962), 
using the same meaningful words but 
other nonsense syllables of compa- 
rable association values, found no such 
difference. The two groups differ 
significantly on the acquisition of 
List 2, with Group E showing a 
marked superiority over Group Cs 
This finding is a replication of Mc- 
Gehee and Schulz and other studies in 
which the mediated condition was 
found to be superior to a nonmediated 
condition. The same finding is 
pointed up by the fact that the ac- 
quisition of List 2 by Group E is 
significantly faster than for List 1, 
whereas no such facilitation effect is 
noticeable for Group C (see Footnotes 
a and b to Table 2). Breaking down 
the analysis of List 2 acquisition into 
Ay-Dy and Az-Ds pairs it can be seen 
that in both cases Group E is signifi- 
cantly superior to the control. This 
means that the predicted proactive 
inhibition effect of the Mediator 
Satiated condition (As-Ds) did not re- 
sult in absolute negative transfer. In 
fact, facilitation was noticed, although 
significantly less than in the case of 
the Mediator Nonsatiated condition 
(Ay-Dy), as will be indicated below. 
The mean semantic satiation score 
for Group E on all 10 words was — 3.07 
(SD = 6.64;¢ = 2.49: P < .02). 
For the five Cs words the mean was 
—2.27 (SD = 3.82;1 = 3.20; P < .01); 
the mean for the five filler words was 
—0.80 (SD = 4.09; t= 12055, cms)t 
This difference in amount of semantic 
satiation shown on the two sets 
of words approaches significance 
(t = 1.84; P < 10), even though the 
product-moment correlation coefficient 


349 


between the two sets is significant 
(r = 43; P <.02). Although there 
is a basis for speculating on the reason 
for the difference noted (e.g., the Cs 
words are related to the B words seen 
in List 1, whereas the filler words are 
not so related), such a discussion is 
not directly relevant to the purpose of 
this study. 

Let us then turn to a comparison of 
the acquisition scores between the 
Ay-Dy and A»-Ds pairs of List 2 for 
Group E. Russell and Storms (1955), 
making a similar comparison in their 
study, used two separate criteria. 
One of these was an analysis of the 
first five different correct responses 
made by the Ss, with a view to deter- 
mine whether the response terms of 
the A-D pairs, for which associative 
chaining was possible (here, the Ay-Dy 
pairs), were more easily elicited during 
the early trials. In the present study 
these were compared to the As-Ds 
pairs where the mediator was satiated. 
The other analysis involved subtract- 
ing the total number of correct antici- 
pations by each S for the Mediator 
Satiated pairs (As-Ds) from the cor- 
responding total for the Mediator Non- 
satiated pairs (Ai-Dy). If there is 
inhibition during learning of the Me- 
diator Satiated pairs (i.e, S has a 
smaller number of correct anticipa- 
tions for the A»Ds than the Ai-Dw 
pairs), this difference will be positive. 
In view of Weitz’s (1961) argument 
that conclusions based on verbal 
learning data often depend on the type 
of criterion measure used, it was 
decided to add four other measures in 
the present analysis and these are pre- 
sented in Table 3. It can be seen that 
all six measures used in the compari- 
son support the conclusion that se- 
mantic satiation of the connotative 
meaning of the mediator (C) increases 
the difficulty with which an A-D list 
is acquired after having established 


) 


350 


A-B pairs. Furthermore, the pro- 
active inhibition effect is noticeable 
not only as an increase in the number 
of trials required for acquisition, but 
also as an increase in both the number 
of intrusions and omissions made dur- 
ing acquisition. The same analysis 
for Group C is also given in Table 3. 
It will be remembered that Group C 
established A-X connections before 
learning A-D associations, and no sa- 
tiation treatment was given. It can 
be seen that none of the six measures 
indicate differential difficulty of ac- 
quisition of the A,-Dy and ADs 
pairs. 

A closer examination of the data 
lends support to the main finding. If 
the Aı-By pairs of List 1 had been 
originally better learned by experi- 
mental Ss than the Az-Bs pairs, then 
the Aı-Dy pairs of List 2 would have 
been easier to acquire than the A»-Ds 
pairs. Analysis of the total number 
of correct anticipations of the Aj-By 
pairs minus the correspondin g number 
for the A»-Bg pairs during learning of 
List 1 (to a criterion of one errorless 
repetition) yielded a mean of —2.47 
(SD = 15.51;% = 0.88; ns). Thus, 
not only is there no reliable difference 
in the learning of Ay-By and A»-Bs 
pairs in List 1, but also the Ai-By 
pairs were somewhat more difficult to 
learn as indicated by the minus value 
of the mean difference. Conse- 
quently, the differential difficulty of 
A-D pairs under the satiation and 
nonsatiation treatments cannot be 

» attributed to the initial differential 
difficulty of A-B pairs. Also, the 
fact that Group C did not exhibit 
superiority of the A,-Dy over the 
ADs pairs of List 2 (Table 3) in- 
dicates that there was no intrinsic 
difference between the two sets of A-D 
pairs. Finally,r = 42 (P < 02) be- 
tween the degree of semantic satiation 
shown by each experimental .S on the 


LEON A. JAKOBOVITS AND WALLACE E. LAMBERT 


satiated mediators (Cs) and the 
extent of inhibition shown in acquisi- 
tion of A»-Ds pairs relative to the 
A1-Dy pairs. This is a most interest- 
ing finding since it shows that the 
extent of proactive inhibition for in- 
dividual Ss caused by satiation of the 
mediator is related to the degree of 
decreased meaningfulness of the me- 
diator itself. 


Discussion 


The finding that a significant facilita- 
tion effect is obtained in List 2 with 
Group E, but not with Group C, is a 
replication of the positive transfer effect 
of the mediation paradigm reported by 
several authors and will not be discussed 
further in detail. The specific contribu- 
tion of this paper concerns the other 
finding reported, namely that signifi- 
cantly less facilitation is obtained when 
the mediator is satiated. Two possible 
mechanisms might be operating here: one 
is that the availability of the mediator is 
reduced, making the completion of the 
mediation “Sequence B—> C—>D less. 
probable; the other is that, given the 
assumption (see Staats, 1961) that some 
of the mediation reactions in the mediator 
C are also involved in B and D, the 
inhibition process might generalize to 
these two terms as well. As a result of 
either of these two processes, the sub- 
sequent acquisition of D will have been 
made more difficult. Furthermore, the 
amount of generalized inhibition or sec- 
ondary extinction might be expected to 
be proportional to the degree to which 
the original stimulus word was inhibited. 
The significant positive correlation re- 
ported above is consistent with this 
expectation. x 

The authors view the present findings 
as further support for their interpretation 
of semantic satiation as a cognitive form 
of reactive inhibition having character- 
istics similar to extinction phenomena 
noted with conditioned responses (Jako- 
bovits & Lambert, 1962a, 1962b; Lam- 
bert & Jakobovits, 1960). The present 
study also shows that the role of media- 


MEDIATED SATIATION IN VERBAL TRANSFER 


tion in verbal learning can be studied 
from the point of view of generalization 
of inhibitory processes—an approach 
which is complementary to the positive 
semantic generalization paradigms used 
so far by previous investigators. Studies 
on the generalization of semantic in- 
hibition, or, as in the present case, of 
mediated satiation, ought to prove useful 
in eliminating undesirable verbal habits, 
and perhaps even nonverbal habits, and 
may well provide a tool for the experi- 
mental manipulation of implicit verbal- 
izations or meanings in studies of think- 
ing and problem solving. 


SUMMARY 


The present study was concerned with 
mediation in verbal transfer where generalized 
inhibition could be observed from one learning 
task to another. A proactive interference 
paradigm was arranged using the same method 
and material as Russell and Storms (1955) and 
McGehee and Schulz (1962). Assuming that 
mediation follows the sequence B—>C>D, 
Ss first learned an A-B list, then the meaning 
of the inferred mediator, C, was reduced by a 
satiation procedure, and finally an A-D list 
was learned. It was shown that satiation of 
the mediator resulted in generalization of 
inhibition during learning of the A-D list. 
The results were discussed in the light of the 
existing theoretical models for mediated 
generalization. 


351 


REFERENCES 


ATHERTON, M. V., & Wasnsurn, M. F. 
Mediate association studied by the method 
of inhibiting associations: An instance of 
the effect of “Aufgabe.” Amer. J. Psychol., 
1912, 23, 101-109. 

Howe, H. C. “Mediate” association. Amer. 
J. Psychol., 1893, 6, 239-241. 

Jaxosovits, L. A., & Lampert, W. E. 
Semantic satiation in an addition task. 
Canad. J. Psychol., 1962, 62, 112-119. (a) 

Jaxonovits, L. A., & Lampert, W. E. 
Semantic satiation among bilinguals. J. 
exp. Psychol., 1962, 62, 576-582. (b) 

LAMBERT, W. E., & Jaxosovits, L. A. Ver- 
bal satiation and changes in the intensity 
of meaning. J. exp. Psychol., 1960, 60, 
376-383. 

McGenre, N. E., & ScuL, R. W. Media- 
tion in paired-associate learning. J. exp. 
psychol., 1962, 62, 565-570. 

Peters, H. N. Mediate association. J. exp. 
Psychol., 1935, 18, 20-48. 

RusseLL, W. A., & Storms, L. H. Implicit 
verbal chaining in paired-associate learning. 
J. exp. Psychol., 1955, 49, 287-293. 

sraats, A. W. Verbal habit-families, con- 
cepts, and the operant conditioning of word 
classes. Psychol. Rev., 1961, 68, 190-204. 

Srorms, L. H. Apparent backward associa- 
tion: A situational effect. J. exp. Psychol., 
1958, 55, 390-395. 

Wertz, J. Criteria for criteria. 
chologist, 1961, 16, 228-231. 


Amer. Psy- 


(Received August 9, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 4, 352-354 


WORK DECREMENT AND REMINISCENCE IN 
PIGEON OPERANT RESPONDING? 


C. ALAN BONEAU anp SEYMOUR AXELROD 
Duke University 


The present study was prompted by 
the incidental observation that pi- 
geons in a Skinner box seemed to 
exhibit some of the phenomena of 
human motor learning (see, e.g., 
Kimble, 1949). Specifically, after the 
pigeons had been trained to peck a key 
with a pattern projected on it, and 
not to peck when the key was blank 
(a relatively distributed work sched- 
ule), response rate declined over a 
series of successive positive periods 
(work decrement during relatively 
massed practice) and then increased 
markedly after a negative period in 
which response rate was low or zero 
(reminiscence). The experiment de- 
scribed here was a more systematic 
exploration of these phenomena; addi- 
tionally, the question was asked 
whether the reminiscence effect is at- 
tributable merely to the rest during 
the negative period, or to inhibitory 
Properties acquired by the nega- 
tive stimulus during discrimination 
training. 


METHOD 


The Ss were 12 naive young-adult white 
Carneau pigeons, maintained at 75% ad lib. 
weight by manipulation of food intake. The 
Ss were first key trained and then given 6 

. days of training on a 60-sec. variable interval 
(VI) schedule for 30 {-min. periods per day. 
Following this, Ss were trained for 4 days, 
using a 60-sec. VI schedule for positive periods 
and no reinforcement for negative periods, to 
peck at a disk on which a pattern (circle with 
a gap at the top) was projected, and to with- 
hold response when the disk was illuminated 


1 The authors are indebted to Norman 
Guttman for the use of his apparatus. David 
Wells assisted in running Ss and in analyzing 
data. 


but blank. Response periods lasted 60 Sec., 
and alternated with 10-sec. off periods during 
which the box and key were unilluminated. 
Thirty periods per day were run, 15 positive 
and 15 negative, in random sequence with the 
restriction that no more than 2 positives or 
negatives were presented successively. For 
the next 4 (postdiscrimination test) days, 40 
periods/day were run. During Periods 9, 
18, 27, and 36 the key was either (D) dark 
(a rest period) or (L) lighted but blank (the 
negative stimulus) and reinforcement was 
withheld; during the remaining 36 periods, 
the positive stimulus was present and the 
60-sec. VI schedule was maintained. The 
temporal positions of the dark and light 
periods were systematically assigned so as to 
distribute sequence effects. 


RESULTS AND DISCUSSION 


In Fig. 1, average rates of respond- 
ing for the 12 Ss are plotted trial by 
trial for selected days of the training 
procedure. The first panel shows the 
relatively stable rate of responding 
on Day 6 (last) of VI training. The 
second and third panels show the 
rates to both the positive and negative 
stimuli for Day 1 and Day 4 (last) of 
discrimination training. As expected, 
there was a considerable warm-up 
effect at the beginning of each day. 

The fourth panel of Fig. 1 shows the 
average rate per trial on the first day 
of the postdiscrimination training: 
Of interest are the high rates on the 
first trials of each block and the 
gradual decrease in rate until the 
interpolated blackout period or nega 
tive stimulus (Trials 9, 18, 27, and 36). 
The decrease is quite orderly. I” 
dividual Ss showed a steady trial-by- 
trial decrease in rate, occasionally 
with no inversions. Least squares 
lines fitted to the last seven points 0 


352 


—— 


PIGEON OPERANT RESPONDING 


AVERAGE NUMBER OF RESPONSES PER Min 


0 
0508205 505 5 05 5 0 6 202530 3540 5 05 %25 WS 40 


SUCCESSIVE ONE-MINUTE RESPONSE PERIOOS 


Fic. 1. 


the first group of eight trials (to 
compensate for warm up), and to all 
points for the other blocks of trials, 
are also shown, and are the basis for 
the determination of reminiscence 
scores, defined as the difference be- 
tween the extrapolated ninth-trial 
performance in one block and the 
fitted first-trial performance in the 
next. 

Two differences in performance be- 
tween the first day and the last day 
of postdiscrimination (fifth panel) 
should be noted. First, the orderly 
reminiscence phenomenon occurring 
on the first day has disappeared by the 
fourth day. Secondly, there is a 
change in level of response. The aver- 
age amount of reminiscence per bird 
on Day 1 of postdiscrimination, when 
tested against the null-hypothesized 
zero, yields a ż of 4.19, significant for 
11 df at the .01 level. Response rate 
declined monotonically between the 
last day of discrimination and the last 
day of postdiscrimination (F = 6.10, 
df = 4/44, P < .01). 

As outlined in the procedure, half 
of the interpolated 60-sec. rest periods 
in postdiscrimination were periods in 
which the negative stimulus was pre- 
sented ; the other half were periods in 


Rate of responding (pecks per minute) on selected days of the training procedure. 


which the key was not lighted. The 
average difference in reminiscence 
over all 4 days following these two 
kinds of interpolated periods was .48 
responses per min. This difference 
results in a ¢ of .09, evidence consistent 
with the hypothesis that the reminis- 
cence is due only to a low level of 
responding (rest) during the inter- 
polated periods. 


The experiment demonstrates the oc- 
currence in the Skinner box of a number 
of the phenomena usually associated 
with human motor learning under con- 
ditions of massed and distributed prac- 
tice. The general superiority of dis- 
tributed practice is evident during the 
latter stages of discrimination training 
when Ss are relatively inactive during the 
negative periods. In the postdiscrimina- 
tion periods occur the phenomena of 
work decrement (when the work periods 
are relatively massed) and reminiscence 
following a rest. In addition, the aver- 
age performance under the massed 
practice regime of the postdiscrimination 
period progressively decreases relative 
to distributed practice performance. 
Warm-up decrement is not especially evi- 
dent except at the beginning of each 
day’s performance. A final similarity to 
human motor performance is the decrease 
in the amount of reminiscence as massed 
practice training proceeds. 


354 


SUMMARY 


Pigeons were trained in a Skinner box to 
discriminate a key with and without a figure 
projected upon it and exhibited rapid re- 
sponding to the positive stimulus and little or 
no responding to the negative. No more than 
two positives or negatives were presented 
successively. The Ss were then given se- 
quences of eight positives followed by one 
negative. The resulting performance ex- 
hibited phenomena characteristic of human 


C. ALAN BONEAU AND SEYMOUR AXELROD 


motor learning performance under massed — 


practice (warm-up decrement, temporary and 
permanent work decrement, and remi- 
niscence). 


REFERENCE 


Kimsie, G. A. An experimental test of a 
two-factor theory of inhibition. J. exp. 
Psychol., 1949, 39, 15-23. 


(Received August 11, 1961) 


“ 


— 
ee 


Journal of Experimental Psychology 
1962, Vol. 64, No. 4, 355-363 


STUDIES OF DISTRIBUTED PRACTICE: XXII. SOME CON- 
DITIONS WHICH ENHANCE RETENTION ` 
BENTON J. UNDERWOOD, GEOFFREY KEPPEL 
Northwestern University 
AND RUDOLPH W. SCHULZ 
University of Iowa 


The accumulated evidence indicates 
that in verbal learning some relatively 
high level of interference must be 
present before differences between 
massed practice (MP) and distributed 
practice (DP) will be observed. This 
law holds for differences in acquisition 
under MP and DP as well as for 
differences in retention following 
learning by MP and DP. However, 
the general statement relating inter- 
ference and DP effects must be accom- 
panied by subsidiary statements which 
relate particular DP phenomena to 
particular loci of the interference. A 
recent study (Underwood & Schulz, 
1961) indicated that DP will facilitate 
acquisition only when interference 
occurs in acquiring or integrating re- 
sponse terms as such; DP will have no 
influence on retention when the inter- 
ference is of this nature. On the 
other hand, when interference occurs 
in associating a response term to a 
stimulus term, DP may enhance re- 
tention but will have little effect on 
learning. It is this second subsidiary 
law with which the present experi- 
ments are concerned since these ex- 
periments explore further the condi- 
tions associated with better retention 
following DP than following MP. 

In the study noted above, Ss 
learned four paired-associate lists in 
which the stimulus terms were non- 
sense syllables and the response terms 

1 This work was done under Contract Nonr- 
1228 (15), Project NR 154-057, between 
Northwestern University and the Office of 
Naval Research. 


were two-syllable adjectives. Be- 
cause of the similarity across lists 
among the syllables, interference in as- 
sociating the response terms with the 
stimulus terms increased as the num- 
ber of lists increased. Each succes- 
sive list required the learning of new 
responses to stimuli which were very 
similar to stimuli in previous lists. 
This similarity may be maximized by 
using the same stimuli in all lists in 
which case the four lists would be 
symbolized as A-B, A-C, A-D, and 
A-E. In line with the second subsi- 
diary law, if DP is given on A-E, 
learning would not be facilitated but 
retention would be; DP appears to 
reduce proactive inhibition (PI) in 
retention when interference is of this 
type. Extinction and spontaneous 
recovery were postulated to be the 
basic mechanisms which reduced PI 
in the situation described above where 
interlist stimulus similarity was high. 
More particularly, it was assumed 
that in learning A-E the associations 
previously acquired (B, C, and D) to 
A were extinguished. Moreover, 
when A-E learning was by DP, it was 
assumed that the rest intervals al- 
lowed for some spontaneous recovery 
of the previously acquired associa- 
tions; hence, subsequent learning 
trials would result in the re-extinction 
of these associations. Finally, it was 
assumed that these successive extinc- 
tion-recovery cycles lead to a more 
permanent extinction of interfering 
tendencies under DP than under MP, 


355 


356 


thus reducing PI in retention. To 
say that interfering tendencies are 
more permanently extinguished under 
DP than under MP is to say only that 
recovery will be slower and will never 
attain as high a level under DP as 
under MP. 

The major purpose of the present 
experiments is to attempt to identify 
the associations which, according to 
this theory, are more permanently ex- 
tinguished during DP than during 
MP. Inan A-B A-C paradigm it is 
possible to identify two different sets 
of associations which may be ex- 
tinguished in learning A-C. The 
most obvious set of associations which 
might be extinguished are the A-B 
associations. As S learns C to A, the 
A-B association may become extin- 
guished. These associations will here- 
after be referred to as specific S-R 
associations. The response terms 
identify a second set of associations. 
The B response terms may be learned 
without also being associated with 
their particular verbal stimuli in the 
list. That is, Tesponse learning may 
occur independently of specific S-R 
learning. For this response learning 
the stimulus may be the general en- 
vironmental situation or, as it will be 
called here, the stimulus may be a 
contextual one. In learning the A-C 
list it is possible that the associations 
between context stimuli and the B 
responses become extinguished and 
are replaced by the C responses, 
Thus, we may identify at least two 
kinds of associations which may be ex- 
tinguished in learning A-C following 
the learning of A-B. The experi- 
mental question asked is whether the 
extinction of both types of associa- 
tions is involved in the better reten- 
tion following DP than following MP, 
or whether the effect can be attributed 
to one or the other class of associa- 
tions. That extinction of both classes 


B. J. UNDERWOOD, G. KEPPEL, AND R. W. SCHULZ 


of associations may occur seems to 
have been demonstrated by Barnes 
(1960). However, the relative per- 
manence of the extinction of the two 
classes of associations and the relative 
recovery rates are unknown so that it 
is quite possible that the facilitating 
effect of DP on retention may be tied 
to the extinction of one class or the 
other rather than to the joint extine- 
tion of both classes, 

In order to separate the effects of 
extinguishing the two classes of asso- 
ciations (specific S-R and contextual 
associations) a second paradigm is 
needed. This paradigm is known as 
the A-B re-paired paradigm in which 
on each successive list the same 
stimulus and response terms are used 
but the particular pairings differ from 
list to list. A study of this paradigm 
will show that in learning a second (or 
third, or fourth) list there are specific 
S-R associations which may be ex- 
tinguished. However, since the same 
responses are used in each list, no ex- 
tinction of contextual associations can 
occur, with the result that the con- 
textual stimuli should remain asso- 
ciated for all lists, Thus, when the 
A-B, A-C paradigm is posed against 
the A-B re-paired paradigm it is seen 
that both have specific S-R associa- 
tions which may be extinguished 
whereas only the A-B, A-C paradigm 
has contextual associations which may 
be extinguished. With these differ- 
ences in the two paradigms in mind, 
the experimental situation to be used 
may be considered, 

The Ss will learn four lists. In one 
case these lists will be A-B, A-C, A-D, 
A-E. In the other case there will be 
four A-B re-paired lists. To minimize 
the awkwardness in referring to these 
two paradigms the first will hereafter 
be called RD (responses different m 
each list) and the second RS (re- 
sponses same in all lists). For both 


EO EEO == 
= | a 


STUDIES OF DISTRIBUTED PRACTICE 


paradigms DP is introduced in the 
learning of the fourth lists and the 
effect of DP on retention measured by 
having other groups in which the 
fourth list is learned by MP. 

The results to be obtained in reten- 
tion following MP and DP of List 4 
for these two paradigms may demon- 
strate the need to limit application of 
the recovery-extinction theory to par- 
ticular kinds of associations, i.e., 
either to specific S-R associations or 
to contextual associations. This is to 
say that the empirical results may be 
used to refine the gross theory. Three 
different possible empirical effects of 
DP on retention of List 4 for the two 
paradigms may now be stated along 
with the theoretical inferences to be 
drawn given each effect. (a) Effects 
of DP may be positive and equal for 
the RD and RS paradigms. The con- 
clusion would be that extinction of 
specific S-R associations is responsible 
for the effect. (b) Effects may be 
positive for both paradigms but 
greater for RD than for RS. The 
conclusion would be that extinction 
of both specific S-R associations and 
contextual associations is responsible 
for the effect. (c) Effects may be 
positive for the RD paradigm with no 
effect for the RS paradigm. The con- 
clusion would be that extinction of 
contextual associations is responsible 
for the effect. 

Experiments 1 and 2 were directed 
specifically toward testing the above 
notions. Experiment 3, which deals 
only with the RD paradigm, was 
designed to investigate the influence 
of a different level of learning than 
used in Exp. 1 and 2 and to make some 
initial determinations of differences in 
length of the DP interval. 


METHOD 


The three experiments have many pro- 
cedural details in common. Therefore, Exp. 


357 


1 will be described in detail followed by a 
description of the changes introduced for 
Exp. 2 and 3. 


Experiment 1 


Design.—Two parallel sets of conditions 
were used, one set being based on lists forming 
the RD paradigm and one set based on lists 
forming the RS paradigm. For each para- 
digm four lists were learned by each S. For 
the first three lists all Ss within a paradigm 
were treated alike since the learning of these 
lists was imposed merely to build up interlist 
interference. For List 4, half the Ss under 
each paradigm learned by DP (1-min. inter- 
trial interval) and half by MP (4-sec. inter- 
trial interval). Twenty-four hours after 
learning List 4, recall and relearning measures 
were obtained on it. 

Lists.—Each list consisted of eight paired 
associates. The stimulus terms were non- 
sense syllables, the response terms two- 
syllable adjectives. For both paradigms only 
eight syllables are needed as stimuli for the 
four lists. The syllables used are those listed 
in Table 1 under List 1 of the previous experi- 
ment (Underwood & Schulz, 1961, p. 229). 
For the RS paradigm only eight different 
response terms are needed for the four lists. 
These terms were the adjectives given for 
List 1 of Table 1 in the previous study. To 
construct the four lists the stimulus and 
response terms were simply re-paired ran- 
domly with the restriction that a given pairing 
occur in only one list. For the RD paradigm, 
eight different adjectives were used for each 
list, these four sets being those in Table 1 
of the aforementioned study. List 4 was 
exactly the same for Ss under both paradigms, 
this list being the List 1 in Underwood and 
Schulz (1961) except, of course, in the present 
study the adjectives were response terms and 
the syllables were stimulus terms. The se- 
quence of Lists 1-3 was the same for all Ss 
within a paradigm, i.e., they were not counter- 
balanced. 

Procedure-—Lists were presented at a 2:2- 
sec, rate with anticipation learning through- 
out. Each list for Ss in the RD paradigm 
was presented for 12 anticipation trials with S 
instructed to give as many correct anticipa- 
tions as possible on each trial. Four different 
orders of items were used for each list with 
each order being used as a start order about 
equally often. Lists 1 and 2 were presented 
on Day 1; Lists 3 and 4 on Day 2, and 5 re- 
learning trials were given on List 4 on Day 3, 
24 hr. after Day 2. Relearning was by MP 


ee MASSED PRACTICE jl 
eese. = DISTRIBUTED Practice dj 


RD PARADIGM 


-N y a 


24 HOURS 


MEAN NUMBER CORRECT RESPONSES 


E r  e 


1234867 esis wis 


LEARNING 


234s 
RELEARNING 


Fic. 1. Learning and relearning curves 


for Exp. 1, 


for all groups. Approximately 30 sec. elapsed 
between the learning of the two lists on a 
given day. The DP intervals on List 4 were 
introduced after each trial and were filled with 
symbol cancellation. The procedures for Ss 
in the RS paradigm were exactly the same as 
for those outlined above except that 15 
anticipation trials were given on each list, 
Since the RS Paradigm produces more 
interference than the RD paradigm, 15 trials 
were used for the RS paradigm (as compared 
with 12 for the RD) in order to make the 
level of learning under the two paradigms 
roughly comparable, 


The Ss were all college students, 
and, while all were not naive to 
verbal-learning experiments, none had 
previously learned syllable-adjective 
pairs. For the RD Paradigm 32 Ss 
learned List 4 by DP and 32 by MP. 
For the RS Paradigm there were 30 Ss 
in each group, 


Experiment 2 


As in Exp. 1, both paradigms were 
studied. The changes from Exp, 1 
were as follows: (a) Lists 1-3 were 
given on Day 1, List 4 was given on 
Day 2, and List 4 was relearned for 5 


B. J. UNDERWOOD, G. KEPPEL, AND R. W. SCHULZ 


trials on Day 3, 24 hr. after Day 2, 
(b) Under both paradigms 12 anticipa- 
tion trials were given on all lists. 
(c) MP was 4 sec. between trials while 
DP consisted of a 3-min. interval after 
Trials 1, 3, 5, 7, 9, and 11. (d) All 
four groups (MP and DP groups for 
each paradigm) had 32 Ss. 


Experiment 3 


In Exp. 3 only the RD paradigm 
was studied. All four lists were pre- 
sented during a single session but each 
was presented for only 5 anticipation 
trials. Three groups of 25 Ss each 
were used, the groups being differ- 
entiated only by the intertrial interval 
on List 4. One group had the usual 
MP (4-sec. intertrial interval), a 
second had 60 sec. between trials, and 
a third had 180 sec, between trials. 
Again List 4 was given 5 relearning 
trials 24 hr. after original learning. 


RESULTS 


Experiment 1—The equivalence of 
the learning ability of the two groups 
of Ss within each paradigm can be 
gauged by the learning scores for Lists 
1-3 since the conditions were identical | 
for all Ss for these lists. For the RD 
paradigm, the mean total correct an- 
ticipations for Lists 1-3 in order for 
the MP Ss were 64.09, 64.00, and 
72.31. For the DP Ss the correspond- 
ing values were 63.38, 63.53, and 
73.03. For the RS paradigm the 
values for the MP Ss were 79.57, 
80.67, and 76.73 and for the DP SS, 
78.40, 80.53, and 75.23. K 

The acquisition and relearning 
curves for Ss in both paradigms for 
List 4 are shown in Fig. 1. For both 
paradigms learning is somewhat slower 
under DP than under MP; this is true 
throughout the 15 trials for RS lists 
and for the initial part of learning for 
the RD lists. This finding confirms 


STUDIES OF DISTRIBUTED PRACTICE 


that of an earlier study (Underwood 
& Schulz, 1961) and also conforms 
to the extinction theory in that the 
__DP interval should allow interfering 
associations to recover, thus impeding 
acquisition. 

Looking next at the recall trial it 
can be seen that for the RS paradigm 
recall is almost identical for both MP 
and DP. The slight difference in the 
telearning curves in favor of MP 
probably reflects the small difference 
present in original learning. It must 
be concluded that DP does not 
facilitate retention of lists forming the 
RS paradigm. For the RD paradigm, 
however, recall is better under DP 
than under MP. The exact values 
are 2.59 for MP and 3.66 for DP 
(t = 2.28, .05 > P > .02). If loss 
scores are used as described in the pre- 
vious study (Underwood & Schulz, 
1961, p. 231-232), in order to adjust for 
any differences in original learning, the 
evaluation does not change (t = 2.32). 
It is concluded that DP in the RD 
paradigm facilitates recall after 24 hr. 
The effect, however, is very transitory 
since no difference in the subsequent 
four relearning trials is evident. 

Certain expectations concerning 
overt errors follow from the extinction 
hypothesis. First, more intrusions 
from previous lists should occur in 
learning List 4 under DP than under 
MP. Inthe RS paradigm, the defini- 
tion of an intrusion is somewhat ambi- 
guous since the same responses occur 
in all lists. However, the nearest 
approximation would be to consider 
only responses given to a stimulus 
with which that response had been 
paired in previous lists. Approxi- 
mately two-thirds of all errors made 
in learning List 4 were of this nature. 
And, while more of these (a total of 
= 387) occurred under DP than under 
' MP (347), statistically speaking, the 

_ difference is far from significant. For 


359 


the RD paradigm, 13 responses from 
List 1-3 occurred in learning List 4 
under DP, and 7 under MP. While 
this difference is in accordance with 
the theory, the numbers are so small 
that little should be made of the 
effect. 

A second expectation from the 
theory is that fewer intrusions should 
occur in relearning following DP than 
following MP. For the RS paradigm 
132 intrusions (as per the definition 
given above) occurred in relearning 
following MP and 138 following DP. 
Clearly, there is no evidence in this 
paradigm of a more permanent ex- 
tinction occurring under DP than 
under MP. For the RD paradigm, 
44 intrusions occurred during relearn- 
ing following MP and 24 following 
DP. These data are in conformance 
with the theory. 

Experiment 2.—In this experiment, 
Lists 1-3 were learned on Day 1 with 
List 4 being learned on Day 2, and 
the DP interval for List 4 was 3 min. 
after every other trial. 

In learning Lists 1-3 the mean total 
correct responses for the MP Ss in the 
RD paradigm were 74.59, 69.53, and 
74.13. For the DP Ss the correspond- 
ing values were 69.78, 64.34, and 
73.13. The MP Ss learning the RS 
lists showed means of 68.38, 60.97, 
and 67.00, while the comparable 
values for the DP Ss were 69.13, 56.56, 
and 64.25. Thus, the two groups 
within each paradigm are fairly com- 
parable in learning ability. 

The performances in learning and 
relearning List 4 are plotted in Fig. 2. 
Again the Ss learning RD List 4 under 
DP show inferior performance to 
those learning under MP. For the 
RS paradigm, the picture is a little 
different. After nearly every rest 
interval the DP Ss show inferior 
performance to the MP Ss but on the 
immediately succeeding trial the per- 


360 


RD PARADIGM 


Oe MASSED PRACTICE 
manne “© DISTRIBUTED PRACTICE 


-nu è © © ~ @ 


24 HOURS 


MEAN NUMBER CORRECT RESPONSES 


— wu son sa 


24 HOURS 


"2345678 91072 


12345 


LEARNING RELEARNING 
Fic. 2. Learning and relearning curves 
for Exp. 2. 


formance of the two groups is es- 
sentially equivalent. Thus, while the 
rest interval appears to inhibit per- 
formance, the overall learning rate 
does not appear to be seriously im- 
paired. 

The recall of the DP Ss under the 
RD paradigm is again superior to the 
MP Ss. The mean values are 3.13 for 
MP Ss and 4.53 for the DP Ss 
(= 3.11, P< 01). Clearly, DP 
facilitates recall of the fourth list for 
this paradigm. The DP Ss are also 
superior to the MP Ss for the first four 
trials of relearning. However, the 
difference across the five trials in 
terms of mean total correct responses 
(1.85) is not significant (t = 1.37). 

For Ss learning List 4 of the RS 
paradigm no difference of consequence 
is apparent in recall and relearning. 
While recall favors the DP Ss, the 
difference of .11 items gives a £ of only 
-22. Thus, as in Exp. 1, it is seen 
that recall of List 4 is facilitated in the 
RD paradigm and no appreciable 
effect is noted for the RS paradigm. 

No difference of consequence in 
number of intrusions in learning under 


B. J. UNDERWOOD, G. KEPPEL, AND R. W. SCHULZ 


MP and DP for the RS Paradigm was 
noted ; the same was true for relearn- 
ing. For the RD paradigm 13 in- 
trusions were recorded for the MP $s 
learning List 4 and 9 for the DP Ss, 
In this case the difference is in the 
opposite direction from the difference 
found in Exp. 1 and is not in accord- 
ance with extinction theory. For re- 
learning, however, MP Ss made 55 
intrusions and DP Ss, 28, which, as in 
Exp. 1, is in line with expectations 
from the theory. , 

Experiment 3.—In this experiment 
only the RD paradigm was used. 
Lists 1-4 were presented for five 
acquisition trials each, with three 
subgroups of 25 Ss each having 4, 60, 
or 180 sec. as the intertrial interval 
on List 4. The mean total correct 
responses given on Lists 1-3 by the 
4-sec. Ss were 17.76, 14.92, and 19.52, 
respectively. The comparable scores 
for the 60-sec. Ss were 17.80, 14.92, 
and 20.28, and for the 180-sec. Ss, 
17.88, 14.96, and 21.32. 

The learning and relearning curves 
for List 4 for the three groups are 
shown in Fig, 3. Again it is to be 
noted that during learning the perg 
formance of the two DP groups is 
inferior to that of the MP group. 
Both DP curves are consistently lower 
than the MP curve throughout the 
five trials. Again, however, the dif- 


—_—- 4 Se. 
@-=--@ 60 Sec. 
S=- 180 Sec. 


EERE 12345 
LEARNING RELEARNING 
Fic. 3. Learning and relearning curves 


for Exp. 3. 


wa ee 


STUDIES OF DISTRIBUTED PRACTICE 


ferences are just short of significance 
at P= .05 (F=2.91, df = 2/72). 
Nevertheless, in view of the fact that 
for this paradigm DP has been con- 
sistently inferior to MP in learning in 
all three experiments, the inhibitory 
effect may be taken to be reliable. 

The mean number of responses cor- 
rectly recalled after 24 hr. were 1.20, 
2.04, and 2.12 for 4, 60, and 180 sec., 
respectively (F = 1.95, df = 2/72, 
P > .05). However, it seems clear 
that adjustments must be made to 
compensate for differences in level of 
learning originally attained by the 
three groups. When this adjustment 
is made by the method noted in the 
earlier paper (Underwood & Schulz, 
1961), the F becomes 21.40 (df =2/72, 
P <.001). Therefore, it may again 
be concluded that for the RD para- 
digm distributed practice reduces PI 
in retention. 

In the results of Exp. 1 and 2 it was 
noted that the facilitation in retention 
for the RD paradigm produced by DP 
was largely limited to the first recall 
trial. In the present data the effect 
is much less transitory ; in spite of the 
fact that MP Ss were appreciably 
better in learning, all groups are about 
equal in relearning. This effect of 
DP on relearning may be estimated by 
subtracting for each S the number of 
correct responses given on the five 
learning trials from the number of 
correct responses given on the five re- 
learning trials. The means for these 
differences were 2.16, 7.84, and 9.20 
for the 4-, 60-, and 180-sec. Ss, re- 
spectively. The F (12.60) is beyond 
the .01 level. The potency of the PI 
effects on the MP Ss can be further 
described by noting that of the 25 Ss, 
11 showed performance in relearning 
which was inferior to that shown 
during learning. By the same token 
the reduction in PI effects produced 
by DP can be seen by noting that 


361 


only one of the 60-sec. Ss performed 
more poorly in relearning than in 
learning and none of the 180-sec. Ss 
performed so. 

In learning List 4 the 4-sec. Ss gave 
7 intrusions, i.e., responses which were 
appropriate for an earlier list. The 
60-sec. Ss gave 32 such responses and 
the 180-sec. Ss gave 11. In the 4-sec. 
condition all 7 intrusions were given 
by a single S, whereas 16 Ss gave 
intrusions under the 60-sec. condition 
and 4 under the 180-sec. condition. 
Thus, while more intrusions were 
made under the DP conditions than 
under the MP condition, the fact that 
the 60-sec. Ss gave more intrusions 
than did the 180-sec. Ss would not be 
anticipated by the extinction hy- 
pothesis. 

The intrusions during relearning 
totaled 34, 33, and 25, for the 4-, 60-, 
and 180-sec. Ss, respectively. The 
theory predicts more intrusions for 
MP Ss than for DP Ss. While the 
values indicate this to be the case the 
differences are so small that they do 
not allow any strong support for this 
aspect of the theory. 


DISCUSSION 


The results of Exp. 1 and 2 will be 
evaluated first. The basic findings were 
as follows. When Ss learn four lists 
with the RS paradigm, DP on List 4 does 
not facilitate the 24-hr. retention of this 
list. But, when Ss learn four lists with 
the RD paradigm, the retention of List 4 
is facilitated by learning the list under 
DP conditions. In terms of the argu- 
ment advanced in the introduction, the 
implication of these facts is that reten- 
tion of a list learned by DP is facilitated 
only when the paradigm allows extinc- 
tion of contextual associations (associa- 
tions between the general environment 
and the response terms). Both para- 
digms studied allow, presumably, extinc- 
tion of specific S-R associations but only 
the RD paradigm allows for extinction 


362 


of contextual associations. Thus, the 
notion that DP, by allowing for succes- 
sive recovery-extinction cycles, leads to 
a more permanent extinction of interfer- 
ing tendencies appears to be supported 
for contextual associations only. 

The question may be raised as to why 
the successive-extinction hypothesis is 
not supported in the case of specific S-R 
associations in view of the fact that the 
extinction of such associations seems to 
have been demonstrated (Barnes, 1960; 
Barnes & Underwood, 1959). One pos- 
sibility is that recovery of such associa- 
tions occurs very slowly; thus, the short 
DP intervals used here may not in fact 
allow for recovery-extinction cycles. 
And in fact there is evidence (e.g., Briggs, 
1954) that recovery of such associations 
is indeed very slow. A barrier to the 
acceptance of this position is the fact 
that in learning List 4 in the RS para- 
digm there was evidence for some process 
which was impeding performance under 
DP. It is reasonable to think that this 
could represent the recovery of error 
tendencies of some sort. Yet, it is pos- 
sible that the interference which appears 
to increase with DP of List 4 in both 
paradigms is not representative nor 
symptomatic of the process which pro- 
duces the better retention following DP 
of List 4 under the RD paradigm. The 
inconsistency from experiment to ex- 
periment of the differences in intrusion 
frequency between MP and DP on List 4 
might argue for such a position. There- 
fore, although slower learning of List 4 
by DP than by MP is consistent with an 
extinction hypothesis, the slower learning 
need not necessarily be taken to mean 
that better retention occurs as a con- 
sequence, 

Empirically speaking, there is a rela- 
tively simple principle which may be 
stated which will summarize the situa- 
tions in which DP may be expected to 
facilitate retention. As noted in the 
introduction, to expect any facilitation in 
retention by DP during learning requires 
first that appreciable interlist interfer- 
ence be present. Given this situation, 
the principle is that whenever the re. 
sponse terms of the previously learned 


B. J. UNDERWOOD, G. KEPPEL, AND R. W. SCHULZ 


associations producing the interference 
are not present in the list being learned, 
DP will facilitate the retention of the list. 

This empirical induction not only con- 

forms to the paradigms and findings of 

the present experiments but also will 

handle other findings. For example, 

when there is high intralist similarity 

among syllables within a list, no effect on 

retention is noted if the list is learned by 

DP (Underwood & Richardson, 1958). 

The interference falling on a given 
association in this situation is caused by 
letter sequences which are appropriate or 
correct for other associations in the list; 
that is, the response units of the associa- 
tions producing the interference are cor- 
rect for other items in the list. The same 
situation holds for the RS paradigm in 
the present studies. To account for this 
empirical generalization—the generaliza- 
tion that DP will facilitate retention only 
when the response terms for the associa- 
tions producing the interference are not 
in the list being learned—we have used 
the notion of recovery-extinction cycles 
leading to a more permanent extinction 
of contextual associations. S A 

Why these contextual associations 
“behave” differently than other ao 
tions (e.g., specific S-R associations) is 
not knows” Of course, it should be 
remembered that the present experiments 
have only scratched the surface in terms 
of studying what appear to be the 
relevant variables and their interaction. 
Degree of learning of the interfering lists, 
degree of learning of the list being inter- 
fered with and to be recalled, length of 
DP interval, and length of retention 
interval, all should be pertinent variables. 
Clearly they are pertinent in terms of the 
extinction-recovery theory but they un- 
doubtedly would also be judged to be of 
importance by any careful empiric 
analysis. 

The final point of discussion concerns 
the results of Exp. 3. In this experiment 
the RD paradigm was used and, im 
conformance with the results of Exp. 1 
and 2 with this paradigm, facilitation m 
retention of List 4 was observed following 
learning by DP. The effects on reten- 
tion were rather substantial in that 


STUDIES OF DISTRIBUTED PRACTICE 


relearning was enhanced following DP. 
Clearly, PI was reduced by DP. It had 
been expected that the 3-min. DP in- 
terval would produce more facilitation 
than would the 1-min. DP interval. 
There was no strong evidence of differ- 
ences in the effect of these two conditions. 
It is true that relearning was a little 
better following a 3-min, DP interval 
than following the 1-min. interval. 
Also, analyses of particular items showed 
that items having the greatest number 
of correct anticipations at the end of 
learning of List 4 produced higher re- 
call following 3-min. DP than following 
1-min. DP. Still, the effects were not as 
great as anticipated. In terms of the 
extinction-recovery notion, it might be 
argued that 1 min. was sufficient for the 
recovery of all interfering tendencies for 
the relatively low degree of learning of 
the interfering lists used. If this is true, 
a more direct relationship between re- 
tention and length of intertrial interval 
would be expected if the degree of learn- 
ing of the interfering associations were 
higher. A comparison of the results for 
the RD paradigm of Exp. 1 and 2 would 
tend to support this notion. In these 
experiments the degree of learning of 
the interfering associations, was much 
stronger than in Exp. 3. The 3-min. 
interval of Exp. 2 gave somewhat greater 
facilitation in retention than did the 60- 
sec. DP interval used in Exp. 1. How- 
ever, this is by no means a clear test 
since other factors also varied between 
the two experiments. In any event, 
some conviction is held for the notion that 
in most situations a direct relationship 
between length of intertrial interval and 
facilitation in retention will be found. 


SUMMARY 


Three experiments were performed to 
study retention following massed (MP) and 
distributed practice (DP) when interlist 
interference was high. In Exp. 1 and 2 half 
the Ss learned four lists of paired associates 
(syllable-adjective pairs) in which the stimuli 


363 


were identical across all lists but with the 
responses different in each list (RD para- 
digm). The other Ss learned four lists in 
which the stimuli and responses were identical 
across all lists but with different pairings for 
each list (RS paradigm). Distributed prac- 
tice was introduced in learning List 4. with 
retention of this list measured after 24 hr. 
In Exp. 1 the DP interval was 60 sec. between 
each trial while in Exp. 2 the interval was 
3 min. after every other trial; MP consisted 
of 4 sec. between trials for both experiments. 

The results were essentially the same for 
Exp. 1 and 2; DP facilitated recall of List 4 
only for the RD paradigm. These findings 
indicate that DP will facilitate retention only 
when the response terms of previously ac- 
quired associations producing the interference 
are not present in the list being learned by 
DP. Theoretically, the results imply that 
DP allows for a more permanent extinction of 
contextual associations but does not influence 
specific S-R associations. 

Experiment 3 used only the RD paradigm 
with a low degree of learning of all four lists 
and with intertrial intervals of 4, 60, and 180 
sec. on List 4. Distributed practice markedly 
facilitated both recall and_relearning of 
List 4. However, no appreciable difference 
was noted in the results for the 60- and 180- 
sec. intervals, 


REFERENCES 


Barnes, J. M. “Fate” revisited. Un- 
published doctoral dissertation, North- 
western University, 1960. 

Barnes, J. M., & UNDERWOOD, B.J. “Fate” 
of first-list associations in transfer theory. 
J. exp. Psychol., 1959, 58, 97-105. 

Brices, G. E. Acquisition, extinction, and 
recovery functions in retroactive inhibition. 
J. exp. Psychol., 1954, 47, 285-293. 

Unperwoop, B. J., & RICHARDSON, Ay 
Studies of distributed practice: XVIII. The 
influence of meaningfulness and intralist 
similarity of serial nonsense lists. J: exp: 
Psychol., 1958, 56, 213-219. 

UnpERWOOD, B. J., & Scuutz, R. W. Studies 
of distributed practice: XX. Sources of 
interference associated with differences in 
learning and retention. J. exp. Psychol., 
1961, 61, 228-235. 


(Received August 25, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 4, 364-372 


THE ROLE OF RESPONSE SIMILARITY 


IN PROACTIVE 


INHIBITION ! 


KENT M. DALLETT ° 


University of 


It has generally been supposed that 
interlist response similarity is an im- 
portant determinant of interlist inter- 
ference. However, while variations 
in response similarity have several 
times been shown to affect retroactive 
inhibition (RI) (Slamecka & Ceraso, 
1960), effects of interlist similarity on 
proactive inhibition in retention (PI) 
have not yet been demonstrated. 
Young (1955) failed to find significant 
differences in PI as a function of 
response similarity, although he did 
find the expected differences in RI. 
Similar findings with respect to PI 
were reported by Morgan and Under- 
wood (1950), the only significant PI 
in their study resulting from the use of 
identical stimuli and dissimilar re- 
sponses. In each of the studies 
mentioned, however, there was an 
insignificant trend suggesting an effect 
similar to that generally found in RI; 
i.e., recall improved as response 
similarity increased. This trend was 
directly opposed to the differences 
predicted by Young (1955), who sup- 
posed that as response similarity 
increased, greater generalized strength 
would be added to List 1, increasing 
its potential for interfering with List 2 
recall. 


1 This paper is based 
submitted in partial 
quirements for the Ph 
versity of California. The author is grateful 
to Leo Postman for his enthusiastic advice and 
encouragement at all stages of the research, 
At the time, the author was a National In- 
stitute of Mental Health Predoctoral Re- 
search Fellow, on Fellowship MF 12.379. 

2 Now at University of California, Los 
Angeles 


upon a dissertation 
fulfillment of the re- 
D degree at the Uni- 


364 


California 


Recently, Barnes and Underwood 
(1959) have proposed separate mech- 
anisms for the learning of similar and 
dissimilar responses in List 2. When 
the responses in List 2 are dissimilar 
to those on List 1 (with identical 
stimuli: A-B, A-C), List 1 responses 
are unlearned during List 2 learning; 
while with similar responses, it is 
suggested that S makes use of the 
List 1 response as a mediator. Thus, 
in learning A-B’ after A-B, S is as- 
sumed to be learning A-(B)-B’, al- 
though Ss report the dropping-out of 
the mediator as learning progresses. 
Accepting Barnes and Underwood's 
analysis, and assuming that PI results 
from the response competition pro- 
duced when unlearned List 1 responses 
spontaneously recover in strength, one 
might be led to expect considerable 
PI in the A-B, A-C paradigm, but 
little or no PI in the case of A-B, A-B 
(Postman, 1961). This analysis can 
readily be extended to intermediate 
degrees of response similarity by as- 
suming that as response similarity 
increases from dissimilarity, unlearn- 
ing gradually gives way to mediation. 
In this way, the data of Morgan and 
Underwood (1950) and of Young 
(1955) can be explained. 

However, we do not know that the 
unlearning-and-recovery sequence 15 
the only mechanism involved in PI. 
It is possible, for example, that 
similar responses may lead to loss 
of differentiation (Underwood, 1945) 
with the passage of time. Thus, Ss 
may recall both responses, but may 
not remember which response belong? 
in List 2. One might, on the other 


Jw 


PROACTIVE INHIBITION IN RETENTION 


hand, expect proactive facilitation if 
the List 1 mediator protects List 2 
responses from the extraexperimental 
interference to which a control group 
may be subject. These possibilities 
hold for delayed recall only, little PI 
being expected at short intervals in 
any case. Indeed, since PI appears 
to increase with time, it would seem 
advisable to investigate the effects of 
response similarity on PI using reten- 
tion intervals considerably longer than 
the 20-min. retention intervals em- 
ployed by both Morgan and Under- 
wood (1950), and Young (1955). 

In the experiment to be reported, 
independent groups of Ss learned two 
lists with identical stimuli (S1) and 
highly similar (SıRı), less similar 
(SrR2) or dissimilar (S1Rx) responses, 
different Ss being tested for recall of 
List 2 either 30 sec. or 48 hr. after 
learning. In addition, 30-sec. and 
48-hr. groups were tested under each 
of two control conditions: (a) the 
standard PI control group, having no 
first list (No PL), and (6) a group 
having dissimilar stimuli and re- 
sponses on the two lists (SnRy). 
This second control condition served 
as a control for warm-up and learning- 
to-learn effects in the learning of List 
2, and also was intended to sample the 
low-similarity end of the continuum 
of stimulus similarity. 


METHOD 


Materials and apparatus:—The words used 
as responses were chosen from Haagen’s 
(1949) norms. Subjects in all conditions 
learned the same List 2. Corresponding to a 
given word on List 2, words were chosen for 
three different first lists so as to be highly 
similar (Rı), less highly similar (R:), or dis- 
similar (Rw) to the List 2 word. The Ry and 
R: lists had mean Haagen similarities of 1.40 
and 3.36, respectively. The Rw list was made 
up of Haagen words not in the same category 
as the corresponding List 2 response. In 
addition, an attempt was made to match 
corresponding words for familiarity using 


365 


TABLE 1 
Responses USED IN THE EXPERIMENT 


List t 
List 2 
| 

Ri R: Rx 
AGILE NIMBLE ALERT UNKIND 
BELOVED CHERISHED VALUED PETTY 
COMPLETE ENTIRE PERFECT HEAVY 
CRAFTY CUNNING STEALTHY PRIOR 
DISTANT REMOTE FURTHER SPOKEN 
DECRASED LIFELESS EXTINCT OBSCENE 
FRUITFUL | FERTILE PREGNANT | IMPURE 
HAUGHTY | SNORRISH SCORNFUL | SHAKY 
LIQUID FLUID Juicy FOREMOST 
SACRED HOLY TAROO IDLE 
SHINING GLEAMING SPARKLING | CONCEALED 
WICKED EVIL VICIOUS DAINTY 


Haagen's norms, and for frequency of usage 
using the Lorge magazine count (Thorndike 
& Lorge, 1944). Thus, the three sets of 
List 1 responses, as well as the List 2, or 
“standard” set, were matched both in mean 
frequency and familiarity and in frequency 
and familiarity of corresponding words, The 
responses are presented in Table 1. 

The stimuli were CVC trigrams of 93- 
100% Glaze association values (Hilgard, 
1951). Two sets of 12 trigrams were selected 
so as to minimize both interlist and intralist 
similarity. Only in two instances did syl- 
lables on the two lists have two letters in 
common, and within each list there were 16 
repetitions of letters. 

Stimuli and responses were combined in 
four different pairings, making a total of 20 
lists. Each list was presented in four differ- 
ent orders. The lists were presented on a 
Phipps and Bird memory drum at a 2:3 sec. 
rate, with 6 sec. between trials. 

Procedure.—Subjects were seated before 
the memory drum and read standard paired- 
associate learning instructions. On the first 
presentation of each list they were not re- 
quired to respond, but on subsequent pres- 
entations they were encouraged to try to 
anticipate as many of the words as possible, 
and it was made clear that there was no 
penalty for guessing. List 1 was then pre- 
sented nine times. The list was changed, 
and S was told that he was to learn List 2 “in 
the same way.” The change of lists required 
30-40 sec, Nine trials were given on List 2. 
Following List 2, 30-sec. Ss were encouraged 
to stretch their legs for a few seconds: after 
approximately 15 sec. instructions for recall 
were given them. Those Ss assigned to the 
48-hr. recall groups were told to return in 2 
days for the “second hour of the experiment.” 

The recall instructions differed slightly in 


366 


wording for the 30-sec. and 48-hr. groups. 
For each condition, however, it was empha- 
sized that they were to recall the second list, 
and that they should begin with the very first 
syllable they saw. The differences in wording 
were minor, consisting mainly of greater 
emphasis on the fact that List 2 was in 
question for the 48-hr. Ss, and presentation of 
the task as involving possible effects of a 
“brief pause” for the 30-sec. Ss. 
Subjects—The Ss were students from the 
introductory course in psychology at the 
University of California. They participated 
in order to fulfill a course requirement, and 
were naive with respect to verbal learning, 
They were assigned to conditions at random 
prior to their appearance, except that 30-sec, 
and 48-hr. Ss were tested in alternation. 
Since the experiment was concerned with 
the effects of List 1, it was felt advisable to 
discard Ss who did not meet a minimal 
criterion of List 1 learning within the nine 
trials allowed, On the basis of the pilot Ss, 
it was decided to discard Ss who had not 
achieved at least five correct anticipations on 
any trial. Seven Ss were dropped for not 
reaching this criterion. An additional 9 Ss 
were dropped for failure to follow instructions; 
6 of these admitted 


TABLE 2 


CORRECT RESPONSES AND ERRORS 
IN List 2 LEARNING 


Se 
Condition : a nse 

% % 

No PL | $2 se [fisi ar | = 
sr 2g gaiz] sea | sr 


— ee Pd 


: 30 sec. | 77.08] 8.28 | 16.9 | 18.0 
SiRt |38 hr. | 78.67| 9.52 21.0 | 15.6 
tN eee 


KENT M. DALLETT 


MEAN CORRECT ANTICIPATIONS 


aoe 6 7 8 9 
TRIALS 


Fic. 1. List 2 learning curves. 


usually do quite well when the instructions 
were explained to them. An additional 3 Ss 
were dropped for E error or apparatus failure. 
The number of Ss discarded for any one of the 
above reasons in any one similarity condition 
(pooling 30-sec. and 48-hr, groups) did not 
exceed 3. A total of 120 Ss, 12 in each of the 
10 groups, was retained, 


REsuULTS 


List 1 learning.—An analysis of 
variance of total correct anticipations 
for the eight anticipation trials © 
List 1 revealed no differences signifi- 
cant at the .05 level. A similar 
analysis of intralist errors also re- 
vealed no differences. The mean of 
total correct anticipations for all 
groups combined was 49.95, with SDs 
ranging from 11.37 to 18.21. For 
intralist errors, the mean was 8.22; 
with SDs from 2.47 to 7.44. The 
groups with high response similarity 
(SrRi) made a total of seven interlist 
errors (words scheduled to appear in 
List 2) before ever having seen List 2. 

List 2 learning. —An analysis 0 
variance of total correct anticipations 


eee aa 


PROACTIVE INHIBITION IN RETENTION 


similar to that employed for List 1 
learning was carried out. Groups 
scheduled for 30-sec. and 48-hr. recall, 
not yet differentially treated, did not 
differ significantly from one another. 
There were significant differences 
among the groups as a function of 
interlist similarity (F = 23.738, 
df = 4/110, P < .005), and these 
differences, while small, were in the 
expected direction (Table 2). All 
groups showed net positive transfer 
with respect to the group with no PL. 
The S;Rw groups, while showing posi- 
tive transfer with respect to No PL, 
were slightly inferior to the SyRw 
groups. This is in agreement with 
several recent studies which have 
shown that SRy may produce nega- 
tive transfer only with respect to a 
group with equivalent warm-up and 
learning-to-learn experience (e.g., 
Besch & Reynolds, 1958; Spiker & 
Holton, 1958). Examination of the 
learning curves (Fig. 1) indicates that 
after the nine trials, all groups with 
prior learning are within one correct 
anticipation of one another, while the 
No PL Ss reached a criterion com- 
parable to that reached by the other 
groups on List 1. 

Since differences were found among 
groups in correct anticipations, all 
error measures were corrected for 
opportunity. The total number of 
intralist errors for each S was ex- 
pressed as a proportion of that S’s 
opportunities for error: each stimulus 
presentation not resulting in a correct 
anticipation was considered one such 
opportunity. The resulting propor- 
tions were submitted to an arc-sine 
transformation, and an analysis of 
variance was carried out. No sig- 
nificant differences in intralist errors 
were found. In the case of interlist 
errors, which were extremely infre- 
quent in all groups except those with 
similar responses (SiR, SrRz), the 


367 


total number of such errors was 
expressed as a proportion of total 
opportunities for each group. These 
data are presented as percentages in 
Table 2: the percentages, when plot- 
ted against decreasing interlist simi- 
larity, yield decreasing regular func- 
tions, with the 30-sec. groups closely 
paralleling the 48-hr. groups. A 
similar gradient results from plotting 
the number of Ss making at least one 
interlist error. In the Ri groups, 
21/24 Ss made at least one interlist 
error, while in the Re groups 12/24 Ss 
made such errors, and in the SıRy 
groups only 5/24 Ss made interlist 
errors. 

Recall.—In the evaluation of List 2 
recall, two methods of estimating 
differences attributable to differences 
in List 2 learning have been used. In 
one, the 30-sec. recall test is used to 
estimate the strength of List 2 at the 
end of learning: any difference be- 
tween 30-sec. and 48-hr. recall scores 
is attributable to the differential time 
of testing. A similar estimate results 
from the other method used, the 
successive probability analysis (Un- 
derwood, 1954, 1956 unpublished’). 
These methods do not allow one 
to say whether groups which have 
attained different criteria forget differ- 
ent amounts because of the experi- 
mental treatment, or simply because 
differential amounts of loss are char- 
acteristic of different criteria. The 
ambiguity resulting from different 
groups having different performance 
levels at the end of learning arises 
primarily with respect to comparisons 
of the group having no PL with the 
other groups, the No PL Ss having 
ended List 2 learning at a lower level 
of performance. However, there is 
no pressing reason to suppose that Ss 


3 Unpublished manuscript by B. J. Under- 
wood entitled, “Strength of association and 
forgetting.” 


368 


TABLE 3 
Correct RESPONSES AND ERRORS 

IN RECALL 
=U 
Aa Intra- | Inter- 
Condition Erom Errors 
Mean SD % % 
30 sec. | 8.66] 2.53 | 16.2 | — 
em ne | 67 | 188 | 63] a 
SNR; 30 sec. | 11.00 | 1.71 8.3 0.0 
NN | 48hr. | 7.42| 2.27 | 64 0.0 
SR 30 sec. | 11.17 | 1.19 | 10.0 | 0.0 
ASN | 48 hr.-| 5.33 3.05 2.5 | 10.0 
SIR 30 sec. | 9.83] 2.04 | 13.5 | 9.6 
m™ 148 hr. | 7.50] 2.07 | 5.5 11.1 
SIR 30 sec. | 10.92 | 1.00 | 19.2 | 19.2 
™ 148 hr. | 7.25] 2.09 | 35 18.4 


who have reached a lower criterion 
would ordinarily forget Jess than Ss 
who have reached a higher criterion, 
which appears to have been the case 
in these data. 

Turning first to the comparison of 
30-sec. and 48-hr, recall, an analysis 
of variance revealed that the inter- 
action of retention interval and condi- 
tions was significant at the .05 level 
(F = 2.822, df = 4/110). The main 
effects of Conditions (F = 3:053, 
df = 4/110) and Retention Interval 
(F = 93.04, df = 1/110) were also 
significant, indicating merely that 
different conditions performed differ- 
ently in retention (as in learning), and 
that forgetting occurred. The mean 
recall scores are presented in Table 3, 
They suggest that a good part of the 
significant interaction may be due to 
the fact that the loss in SiRy is larger 
than the loss in any other group. 
When the interaction is Partitioned, 
SiR» accounts for most of the vari- 
ance, the difference among the other 
groups in amount lost being in- 
significant. 

For the successive probability anal- 


KENT M. DALLETT 


ysis, the probability of a correct 
response on Trial 9 following 1,2, ... | 
7 correct anticipations on Trials 2-8 
was obtained for each condition, using 
the combined learning data of the 30- 
sec. and 48-hr. groups. Since the 
probability of a correct anticipation 
following seven correct anticipations — 
was .95 or better in all groups, the 
probability of a correct response fol- _ 
lowing eight correct antici pations was 
assigned an arbitrary value of 1.00. 
Loss scores were obtained for each 
48-hr. S individually, by computing 
an expected recall score on the basis 
of the probability analysis, and sub- 
tracting from this value the obtained 
recall. These loss scores (Table 4 
were subjected to an analysis of- 
variance which confirmed the results 
of the first analysis of raw recall — 
scores. The conditions differ in 
amount lost (F = 4.53, df = 4/55, 
P <.01), and the differential loss | 
appears to result mainly from the 1 
greater loss in Cond. S;Ry, with no 
difference among the other groups. 
The logic of the experiment, however, | 
justifies one selected comparison be- | 
tween the No PL control group and 
the other groups (with SiRy ex- 
cluded). This difference is significant 
at the .05 level (F = 4.868, df = 1/55), 
but since it is a selected comparison, 
one would probably insist upon sig- 
nificance at the .01 level or better. 
Finally, a test described by Snedecor 
(1957, p. 251) which allows for the 


TABLE 4 
MEANS AND SDs or Expectep RECALL, 
BASED ON Propasitity ANALYSIS, 
MINUS OBTAINED RECALL 


Condition 
Measure S 
No Px | SwRw | SiRw | StR: SR 
Mean | 2.61 | 4.06 | 605 | 3.88 a 
SD 1.31 | 1.97 | 2.98 | 1.39 | 1 
L Lao 


—_— 


PROACTIVE INHIBITION IN RETENTION 


change in probabilities attendant upon 
repeated testing of selected groups 
also indicated that the only significant 
difference in the loss scores is between 
SiRy and the other groups. 

Thus, only in Cond. SrRy has PI 
been demonstrated. This finding is 
the same as that reported by Morgan 
and Underwood (1950) with 20-min. 
recall. 

Intralist and interlist errors.—Since 
errors were not considered to be fre- 
quent enough to justify correcting 
each S’s errors, total errors of each 
kind for each group of Ss were ex- 
pressed as a proportion of total 
opportunities for that group. As in 
learning, there does not seem to be any 
obvious systematic relationship be- 
tween intralist errors and recall. 

Interlist errors as a percentage of 
opportunity increase from 30 sec. to 
48 hr. for SıRy (Table 3) while the 
two groups with similar responses 
manifest the same percentage of inter- 
list errors in the two recall tests. The 
SyRw groups made no interlist errors 
in recall. Since 48-hr. recall scores 
were lower than 30-sec. recall scores, 
SıRy, SiR2, and SiR; all showed an 
increase in the absolute frequencies of 
interlist errors, but only in SıRy was 
this increase out of proportion to the 
increase in opportunities for error. 

Transfer and recall as a function of 
item strength—In the analyses just 
presented, the measures employed 
represented an average of each S’s 
performance on 12 items. It is as- 
sumed that such scores indicate what 
is going on in the learning and recall 
of individual items. However, few 
analyses in terms of individual items 
have been reported (Runquist, 1957). 
The procedure followed in this experi- 
ment, of giving a fixed number of 
trials on each list, is particularly 
suited to the analysis of item strength, 
since an item with a relatively large 


369 


number of correct anticipations (“re- 
inforcements"’) is likely to have high 
strength regardless of whether this 
strength is the result of its being an 
easy item or the result of its being 
learned by a fast learner. This state 
of affairs should be contrasted with 
the situation encountered when Ss 
learn to criterion. Here, a high 
number of reinforcements may indi- 
cate an easy item or a slow learner. 
Since a slow learner may be assumed 
to gain less associative strength 
per reinforcement (Underwood, 1954) 
there is a confounding of two factors 
(slow learners and easy items) with 
opposed effects. Runquist (1957) at- 
tempted to reduce this confounding 
by ranking items within Ss. Here, no 
such procedure was felt to be neces- 
sary. Not only do the ability of the 
learner and the ease of the item work 
in the same way, but an examination 
of the data revealed that each S 
covered a considerable range of item 
strengths, with no overwhelming tend- 
ency for any one S to contribute only 
weak, or only strong, items. 

In preparing Fig. 2 and 3, the num- 
ber of correct anticipations of an item 
in List 1 learning was taken as a 
direct measure of that item’s List 1 
strength, and the successive prob- 
ability analysis was used to adjust for 
List 2 strength. All curves were 
smoothed by the use of three-point 
moving averages: the 0 and 8 data are 
based on averages of two points. 

Each of the figures presents recall 
data as a function of the strength of 
corresponding items on List j1.. A 
corresponding item, for most groups, 
was the item with the same stimulus. 
For Cond. SxRw, the correspondence 
is formal, and reflects (a) minor differ- 
ences in word frequency, minimized 
for corresponding words, and (b) com- 
mon relative serial positions in the 
four presentation orders, 


370 


ITEMS LOST, List 2 
-NUA auarnno eo 


ol 2 i CES 
Reinf. on Corresponding List-I Item 
Fic. 2. Recall losses after 48 hr. as a 
function of List 1 strength. 


Figure 2 shows the losses (obtained 
recall subtracted from the prediction 
of the probability analysis) in terms 
of the number of items lost. Figure 3 
shows percentages of recall, obtained 
by dividing observed by predicted 
recall and multiplying by 100, There 
are no a priori reasons for supposing 
one way of presenting the data to be 
better than the other, However, 
Underwood? has shown that in at 
least one instance, percentage of recall 
was monotonically related to strength, 
as expected, while the relationship be- 
tween loss and strength was non- 
monotonic. 

Examination of Fig. 2 and 3 reveals 
that, for some or all degrees of List 1 
strength, there is an indication of PI 
in every group. In Cond. SRy, 
there appears to be a relative maxi- 
mum of interference in the vicinity of 
six reinforcements on List 1. This is 
in agreement with Underwood’s (1945) 
finding that maximal interference 
seemed to result when items were of 
roughly equal strength on the two 
lists: assuming a normal distribution 
of item strengths, it would not be 


KENT M. DALLETT 


unreasonable to expect that most item 
pairs of roughly equal strength would 
also be of intermediate strength. The 
finding is also similar to the familiar 
fact that, in RI, increased degrees of 
IL first increase, and then decrease 
the amount of RI. Particularly in- 
teresting is the high degree of for- 
getting in the groups with similar 
responses, when the List 1 response is 
weak. This may help to explain why 
relatively little PI has been obtained 
with similar responses: it may be 
necessary to keep List 1 strength 
low to obtain maximal interference. 
There is no reason to expect that the 
finding of maximal interference with 
approximate equality of the two lists 
should be generalizable beyond the 
SrRw case. 


Discussion 


It will be recalled that Barnes and 
Underwood (1959) suggested that the 
unlearning-and-recovery sequence of 
events applied only to the S;Ry condi- 
tion, and that if one assumed no other 
mechanisms of interference, only SıRy 
would produce PI. However, it was also 
proposed that with similar responses, 
loss of differentiation of list membership 
might lead to forgetting in a delayed test 
of recall. The data on interlist errors in 
the present experiment, while not provid- 


% RECALL, List 2 


TNTE FEEN 


Reint. On Corresponding List-! Item 


Fre. 3. Percentage recalled as a function 
of List 1 strength. 


PROACTIVE INHIBITION IN RETENTION 


ing a direct test of either mechanism, 
have an important bearing on this 
question. All groups with identical 
stimuli show an increase in interlist 
errors from 30 sec. to 48 hr. Such errors 
are rare in Cond. SyRw, and are not 
made at all in recall. An increase in the 
number of interlist errors may or may not 
reflect loss of differentiation, depending 
on whether one grants primary im- 
portance to the error, or to the failure to 
respond correctly which made an error 
possible. If one assumes that S first 
loses the correct response, and then may 
or may not make an error, then (using 
the correction for opportunity), the data 
indicate increased interlist generalization 
only for SyRw, the other groups continu- 
ing to contribute interlist errors in the 
same proportion to opportunities at 48 
hr. as at 30 sec. If, on the other hand, 
one assumes that the error displaces a 
correct response which would otherwise 
be given, then the appropriate compari- 
son is between the absolute number of 
errors, which increase in all groups. 
This would leave unexplained the fact 
that with similar responses, the increase 
is proportional to the forgetting obtained, 
while in S;Rwy it is not. The author's 
preference is to assume that loss of 
differentiation is of relatively minor im- 
portance, as suggested by the overall 
equality of total forgetting in all groups 
except SiRn, and that the disproportion- 
ate increase of errors in the SıRy condi- 
tion is an indication of recovery of List 1 
responses. Such a conclusion is in agree- 
ment with the implications of Barnes and 
Underwood's data, and should suffice as 
a conservative explanation of the present 
experiment. 

Two things remain unexplained : the 
indication of some PI in the SnRy item 
analysis, and the apparent interaction of 
PI and List 1 strength in the similar- 
response conditions. In the case of 
SyRw, it is clear that the total stimulus 
contexts of the two lists are not com- 
pletely dissimilar—in fact, letters com- 
mon to the two sets of stimuli may have 
been sufficient to generate a marginal 
amount of interlist interference. Other 


371 


common stimuli involve the room, E, the 
memory drum, etc. . . 

In the similar-response groups, the 
mediation hypothesis leads one to suspect 
that the items readily forgotten were 
those with weak mediators, while the 
items well recalled were those with strong 
mediators. Weak mediators might be 
eliminated by extraexperimental interfer- 
ence, and may provide ideal conditions 
for the confusion of the two responses. 
Thus, while loss of differentiation seems 
not to explain the data as a whole, it may 
interact with the mediation mechanism, 
being effective only with low List 1 
strength. The explanation of List 2 for- 
getting as a result of the loss of the List 1 
mediator runs into several difficulties. 
First, the transfer data suggest that when 
a List 1 item is weak, the dependence of 
the List 2 response upon it should be 
minimal—and the loss of an ineffective 
mediator would be of little consequence. 
Furthermore, the mediation involved 
may not be a simple response-chaining 
process in which the List 1 response 
serves as a cue for the List 2 response. If 
a weak List 1 response is strengthened 
during List 2 learning (as RI experiments 
seem to suggest), then an interdepend- 
ence of List 1 and List 2 responses could 
be built up which would be more complex 
than the simple response chain assumed 
to occur when a strong mediator is 
available. It is obvious that further 
study of the similar response conditions 
is necessary before any conclusions can 
be reached regarding these possibilities. 

The data of the present experiment 
once more indicate the usefulness of con- 
sidering the unlearning-and-recovery se- 
quence as a basic process in RI and PI, 
and suggest that while interlist similarity 
per se is not a highly potent variable, it 
may yet be found to produce large effects 
in interaction with degree of learning. 


SUMMARY 


Failures to demonstrate an effect of inter- 
list response similarity on PI have previously 
been reported in experiments in which short 
retention intervals have been used, with 
relatively small amounts of PI. In the 
experiment reported, an attempt was made to 


372 


maximize the chances of obtaining PI by 
using a 48-hr. retention interval, 

The design involved 10 groups of Ss, each 
of which learned a common List 2 after 
learning first lists which differed in their 
similarity to List 2. The first lists had 
identical stimuli and similar, less similar, or 
dissimilar responses in three of the basic 
conditions. Two further conditions involved 
groups which learned either no List 1, ora 
List 1 in which both stimuli and responses 
were dissimilar to those on List 2. In each 
condition, retention of List 2 was tested after 
30 sec. and 48 hr., with independent groups 
tested at each time interval. 

The results showed significant PI only for 
the condition in which the two lists had 
identical stimuli and dissimilar responses, 
However the degree of List 1 strength asso- 
ciated with maximal interference was different 
for each condition, Suggesting that significant 
PI might be obtained in each condition by 
appropriate manipulations of List 1 strength, 


REFERENCES 


Barnes, J. M., & UNpERWoop, B. J. “Fate” 
of first-list associations in transfer theory. 
J. exp. Psychol., 1959, 58, 97-105. a 

Brscu, N. F., & REYNOLDs, W. F. Associa- 
tive interference in verbal paired-associate 
oe J. exp, Psychol., 1958, 55, 554- 


HAAGEN, C. H. Synonymity, 
familiarity, and association-value 
400 pairs of common adjectives. 
chol., 1949, 30, 185-200. 

HILGARD, E, R, Methods and procedures in 
the study of learning. In S, S. Stevens 


vividness, 
ratings of 
J. Psy- 


KENT M. DALLETT 


(Ed.), Handbook of experimental psychology, 
New York: Wiley, 1951. 

Morgan, R. L., & UnvERWoop, B. J. Pro- 
active inhibition as a function of response 
similarity. J, exp. Psychol., 1950, 40, 592- 
603 


Postman, L. The present status of interfer- 
ence theory. In C. N, Cofer (Ed.), Verbal 
learning and verbal behavior. New York: 
McGraw-Hill, 1961. 

Ruyguist, W. N. Retention of verbal 
associates as a function of strength. J. exp. 
Psychol., 1957, 54, 369-374. 

Stamecka, N. J., & CERASO, J. Retroactive 
and proactive inhibition of verbal learning. 
Psychol. Bull., 1960, 57, 449-475. 

SNEDECOR, G. W. Statistical methods. 
Ames: Iowa State Coll, Press, 1957. 

SPIKER, C. C., & Hotton, R. B. Associative 
transfer in motor paired-associate learning 
as a function of amount of first-task 


practice. J. exp. Psychol., 1958, 56, 123- 
132. 
THORNDIKE, E, L, & Lorcr, I. The 


teacher's word book of 30,000 words. N ew 
York: Teacher's College, Columbia Uni- 
versity, 1944, 

UNDERWoop, B. J. The effect of successive 
interpolations on retroactive and proactive 
inhibition, Psychol. Monogr., 1945, 59(3, 
Whole No. 273). 

Unperwoop, B, J. Speed of learning and 


amount retained: A consideration of 
methodology. Psychol. Bull, 1954, 51, 
276-282. 

Younc, R. K. Retroactive and proactive 


effects under varying conditions of response 
similarity, J, exp, Psychol., 1955, 50, 113- 
119, 


(Received August 29, 1961) 


a 


Journal of Experimental Psychol 
1962, Vol. 64, No. 4, 373-379 “3 


EFFECTS OF NONREINFORCED TRIALS IN TWO-CHOICE 
LEARNING WITH NONCONTINGENT REINFORCEMENT? 


JAMES G. GREENO? 


University of Minnesota 


The purpose of this paper is to 
examine the role of nonreinforced 
trials in a simple prediction situation. 
Stimulus sampling theories have suc- 
cessfully accounted for many of the 
results from choice experiments in 
which some one of the alternative re- 
sponses is reinforced on each trial (see 
Estes, 1959). However, these theories 
have not yet been extended so as to 
provide an adequate account of results 


from experiments which include 
trials on which no reinforcement is 
presented. 


In the present studies, S$ was in- 
structed to predict which of two lights 
would flash on each of a series of trials. 
Following S’s choice on each trial, one 
of three events occurred. On some 
trials, one of the lights flashed (E1 or 
E») constituting a reinforcement for 
the response of predicting the light 
that flashed (Ai or Ag), On other 
trials, neither light flashed (Eo) con- 
stituting a nonreinforced trial. The 
proportions of trials on which Ex, Es, 
and Eyoccurred are denoted 71, T2, and 
mo respectively. In this paper, P1 will 
designate the observed proportion of 
A; choices. 


‘These results were included in the 
author's dissertation, presented to the faculty 
of the Graduate School, University of Minne- 
sota, in partial fulfillment of the requirements 
for the PhD. Thanks are due to D. L. La- 
Berge, who served as major advisor for the 
dissertation and provided a critical reading of 
this paper. Marianne Larson and Muriel 
Dieteman assisted with data analysis. While 
conducting these studies, the author received 
financial support from the Ford Foundation 
Behavioral Science Training Program and 
from the Danforth Foundation. 

2 Now at Indiana University. 


Two hypotheses have been offered 
frequently to account for the effects 
of Ep trials. One of these is the 
identity hypothesis, which asserts that 
an Eo trial leaves choice probabilities 
unchanged. A second suggestion is 
the correction hypothesis, which im- 
plies that an Ep trial reduces the 
probability of the response chosen by 
S on that trial. Estes (1959) sum- 
marized the results of several experi- 
ments involving Eo trials and sug- 
gested that the correction and identity 
hypotheses describe processes which 
occur in different degrees in different 
situations, depending upon such vari- 
ables as instructions and preliminary 
training. 

In particular, results obtained by 
Atkinson (1956), Neimark (1953), 
and Millward (1960) are consistent 
with identity-hypothesis predictions. 
However, Anderson and Grant (1957, 
1958) and LaBerge, Greeno, and 
Peterson (1962) have obtained results 
indicating that with mı > 7, Eo trials 
reduce pı; and sequential statistics 
reported in these studies suggest that 
the obtained changes in pı may not 
have been produced by a correction 
effect. 

In most studies of the effects of Ey 
trials, investigators have tested for 
quantitative agreement between data 
and predictions from specific theories. 
On the other hand, the present study 
is intended to provide evidence re- 
garding the qualitative properties of 
the effect of Eo trials. Therefore, Eo 
trials were presented in situations for 
which ordinal predictions could be 
derived so as to differentiate between 


373 


374 JAMES G. 
hypotheses. In terms of empirical 

variables, the present experiments 

were not designed primarily to test 

the quantitative effects of experi- 

mental operations. Rather, these 

data were obtained in order to deter- 

mine whether certain variables are 

relevant in relation to the effects of 

E> trials. 


EXPERIMENT I 


This study was designed to investi- 
gate further the finding that Eo trials 
reduce pı. Evidence was sought re- 
garding two questions: 

1. Is the effect of an Ep trial 
invariant with respect to the number 
of trials on which S has received Eo? 
This question arises from the possi- 
bility that the effect of Eo trials might 
decrease over a series of Partially 
reinforced trials. This might occur if 
Eo events acquired secondary rein- 
forcing properties through association 
with reinforced trials in the sequence 
(Bush, 1960), or if the effect of E, 
trials depended upon disrupting S's 
behavior and disruption effects dimin- 
ished as Ep events continued to occur 
(Neimark, 1953). 

2. Do Ep trials reduce the asymp- 
totic value of p,? LaBerge, Greeno, 
and Peterson’s (1962) results included 
differences among mean choice fre- 
quencies due to Eo trials during 
acquisition. This second question 

nen, simply asks whether such a 
difference also occurs during near- 
asymptotic performance. 


Method 
The 72 Ss were students in introductory 


r e E empha- 
sized that S “should make a choice on each 


GREENO 


trial, no matter what happens,” and thea 
asked for questions. If S asked or remarked 


about the Ep events, Æ said, “That may 
happen on some trials. In any case you 
should make a choice on each trial,” No 


other instructions were given regarding Eş 
trials, Following the instructions, trials were 
presented without interruption until all trials 
had been presented. 

Each experimental trial consisted of the 
following events: buzzer signal, 2 sec.; off, 
1 sec.; reinforcement event, 1 sec.; off, 2 sec. 
This temporal Sequence was automatically 
controlled. On each trial, S's choice and the 
reinforcement event were automatically re- 


Reinforcement schedules were constructed 
by randomly ordering 20-trial blocks of Ey 
and E3 trials, and then randomly adding the 
number of Ep trials needed to satisfy the 
Specified value of +. Each block, then, in- 
cluded 20 reinforced trials, and a total of 
20/(1 — wo) trials. ‘ 

The three experimental groups received 
reinforcement sequences as follows: 


Group 0— 
Blocks 1-10: mı:m: = 90:10; ro = 0. 
Group 50— 
Blocks 1-10: miim = 90:10; ro = 50. 
Group 50P— 


Blocks P:—P3: miira = 50:50; ro = .67, 
Blocks 1-5: miir, = 90:10; ro = .50. 


A comparison between Groups 50 and 50P 
is relevant to the question of invariance of the 
effects of Eo trials. The pretraining schedule 
received by Group 50P was selected because 
LaBerge, Greeno, and Peterson (1962) found 
that this sequence produced no change in fi 
for a 90:10 sequence with x» = 0. Thus, if 
Groups 50 and 50P were to differ over Blocks 
1-5 of the present Study, then this difference 
could be attributed to a change in the rein- 
forcing effect of Eo trials due to their presence 
in the Pretraining sequence, 

A comparison of Groups 0 and 50 over 
Blocks 6-10 is relevant to the question regard- 
ing near-asymptotic properties of Eo effects. 

Six different random schedules were used, 
with orders of E; and E; events matched 
across groups for comparable blocks. The 
right- and left-hand response, respectively, 
Was designated A, for one-half of the Ss in 
each condition, 


Results and Discussion 


The mean Proportions of A; choices 
(bı) for each group are graphed by 


g TWO-CHOICE LEARNING 


blocks in Fig. 1. Over the first five 
locks of 90:10 reinforcement, fı was 
obtained for each S. An estimate of 
i = .0106 (df = 36) was obtained as 
the residual mean square of the 
3 X 2 X 6 factorial analysis of vari- 
ance involving experimental groups, 
right or left sides, and schedules as the 
factors. Then 90% confidence inter- 


_ vals for orthogonal contrasts between 


group means were estimated as 


follows: 


px(0) — (50) + pu(s0P = 091 + .052; 


p:(SOP) — pi(S0) = — 018 + .060. 


In qualitative terms, pı was less with 
m = .50 than with ro =0; and pi 
with ro = .50 did not differ signifi- 
cantly as a result of the 50:50 pre- 
training trials. The difference due to 
Eo trials over these blocks replicates 
earlier findings and thus provides 
additional contraindication for the 


identity hypothesis. Since there was: 


not a significant increase in pı due to 
the 50:50 pretraining, there is no 
evidence that the effect of Eo trials 
decreased over the partially reinforced 
sequence of 50:50 trials. 

In order to obtain evidence as to 
whether Eo trials influenced asymp- 
totic values of pı, the value of pı was 
obtained for each S in Groups 0 and 
50 for each block during Blocks 6-10. 
These scores were subjected to anal- 
ysis of trends (Grant, 1956). The 
F for overall linear trend (1.36; 
df = 4/184; P > .25) and the F for 
difference between group linear trends 
(1.16; df = 1/46; P > .75) were not 
significant. Therefore, there is no 
statistical evidence that Groups 0 and 
50 were not at asymptote during 
Blocks 6-10. The F for difference 
between group means was significant 
(9.28; df = 1/46; P <.0005), in- 
dicating that the asymptotic value of 


375 


a 


ao ag 


Prepertion of A, choices 
3 


NI — 
-. 5 
a om s0 
CMS ee ee ee ee 
Beas of Tiy won 


Fic. 1. Mean proportions of A, choices 
across blocks of trials in Exp. I. (In Blocks 
Pi-P3, xiit2 = 50:50. In Blocks 1-10, 
ziza = 90:10.) 


pı was changed by the Eo trials 
received by Group 50. 

In Table 1 are presented estimated 
means and variances of p, scores, and 
proportions of events (p;,2) such that 
A; and E; occurred on Trial m, and Aj 
occurred on Trial n + 1. Lines 1 and 
2 of Table 1 are relevant to the present 
discussion. Anderson and Grant 
(1958) used a statistic based on 
similar data to estimate changes in 
pı across Ep trials. Let y = po,2/70 
= Po,12/To- For Group 50 of the 
present study, y = — .034, indicating 
that pı decreased across Ep trials. 
This result is consistent with those 
obtained in Anderson and Grant's 
analysis, and suggests that the change 
in fı re here was due to effects 
of Eo trials on choice probabilities, 
rather than to changes in the effects of 
E; and Ez events. 

Taken together, the obtained differ- 
ence in j; and the negative value of y 
provide evidence against the identity 
hypothesis which is particularly com- 
pelling, since it is based on near- 
asymptotic performance. 


EXPERIMENT lI 


This experiment was designed to 
provide evidence relevant to the cor- 


376 


rection hypothesis, which implies that 
the effect of an Ep trial depends upon 
S's choice on the trial. Therefore, if 
the correction hypothesis were correct, 
then the average effect of an Eo trial 
would depend upon response fre- 
quencies. An alternative possibility 
is that the average effect of an Eo trial 
might depend upon reinforcement 
frequencies. 

In the standard prediction situation 
with noncontingent reinforcement, it 
is impossible to discriminate between 
the effect of these variables on the 
basis of ordinal hypotheses, since re- 
sponse and reinforcement frequencies 
turn out to be equal at asymptote. 
In order to remove this equality, 
unequal incentives were introduced 
for the two response alternatives, 
Under these conditions, S should 
choose the response associated with 
the higher incentive more frequently 
than that response is reinforced. 
Thus, it was hoped that the contribu- 
tions of response frequencies and 
reinforcement frequencies to the effect 
of E, trials could be separated. 

Optimal conditions for this purpose 
would include a condition for which 
1 = .50 when m; > m2 and ro = 0, 
and another condition for which 
Pi < .50 with m, = T: and ro = 0, 
The simplest form of the correction 
hypothesis would Predict that Ep trials 
should increase 1 in the second con- 
dition, and that Ep trials would have 
no effect in the first condition. On 
the other hand, if the effect of Eo trials 
depended upon reinforcement fre- 
quencies, there should be no effect due 
to Ep trials in the second condition, 
and pı should be decreased by Eo 
trials in the first condition, 

The incentive Operation used was to 
instruct S that he would receive more 
points for a correct Prediction of one 
light than the other. Preliminary 
Studies indicated that the optimal 


JAMES G. GREENO 


conditions described above would be 


best approximated with a point ratio 
of 1:6. 


Method 


The 250 Ss were students in introductory 
psychology classes. Procedures and ap- 
paratus were the same as those used in Exp. I 
with the following exceptions: Instead of 
facing a box with red lights and levers, S faced 
a slanting panel which held two spring-release 
buttons with which he indicated his choices. 
The signal light above the left button was red, 
and the light above the right button was 
white. The numerals 1 and 6 were displayed 
beside the left and right buttons, respectively. 

Instructions indicated that the experiment 
was a test of S’s skill at making choices, The 
task for $ was to get as many points as he 
could by predicting which light would flash 
on each trial; and S was told that he would 
Teceive one point each time he correctly 
predicted the red left-hand light and six 
points each time he correctly predicted the 
white right-hand light. Practice trials and 
remarks about Eo trials were as in Exp. I. 

This experiment consisted of three sections 
which were run separately. In Section 75, 
miim = 75:25; in Section 50, 1:72 = 50:50; 
in Section 25, miima = 25:75. In each 
section, a group with m = 0 (Groups 75/0, 
50/0, 25/0) was compared with a group with 
To = .50 (Groups 75/50, 50/50, 25/50). 

In all cases, six blocks of trials were pre- 
sented. Eight different random schedules 
were used. Schedules were constructed in the 
manner described for Exp, I, Schedules were 
matched within sections with respect to the 
order of E, and E; events. Schedules were 
matched across sections with respect to the 
trial numbers on which Eo occurred for the 
groups with mo = .50. The left-hand re- 
Sponse was designated A, for all Ss. : 

For Sections 75 and 50, a double sampling 
technique was used to determine the number 
of Ss to be used (Cox, 1958). In each case, 
n was set so that the difference between mean 
values of pı for ro conditions could be esti- 
mated with a 90% confidence interval of 
length less than .10. The number of Ss in the 
six experimental groups were as follows: 


Group 75/0, 64; Group 75/50, 32; Groups 
50/0 and 50/50, 48; Groups 25/0 and 
25/50, 32. 


Results and Discussion 


Mean values of p, are graphed by 
blocks in Fig. 2, The value of pı over 


TWO-CHOICE LEARNING 377 
Blocks 3-6 was obtained for each S 29 
A and these scores were analyzed. Es- 
timates of o? were obtained as residual 
mean square terms of factorial anal- 5o 
yses of variance (see Table 1). Using 
these estimates, 90% confidence in- 
tervals were estimated for the differ- 
ences between mo = 0 and m = .50 

conditions as follows: 


40 


30 


B: (75/0) —p1 (75/50) =.079 +.046 ; 
3 (50/0) — p1(50/50) =.053 +.048 ; 
Bx (25/0) —p1(25/50) = —.100+.050. 


20 


Proportion of A, choices 


Lines 3-6 of Table 1 provide more 
detailed information regarding per- 10 
formance of these groups. The values 
of the y statistic described above were 
estimated as follows: Group 75/50, ; a eee 


—.039; Group 50/50, +.006; Group Blocks of 29 
l- Tle 


T 


oe 75/0 como 75/50 
ome 50/0 mesa 50/50 
O0 25/0 O=O 25/50 


25/50, +.030. These estimates sug- mes 
gest the following: pı decreased Fic. 2. Proportion of Ay choices across 
across Eo trials with mim: = 75:25; trial blocks in Exp. IT. 

pı increased across Eo trials with 

mim, = 25:75; and pı was virtually Two formulations of the correction 


unchanged by Ep trials with 1:72 hypothesis could be applied to these 


= 50:50. data, although neither of them seems 


TABLE 1 


ESTIMATES or GROUP MEANS AND VARIANCES AND PROPORTIONS 
or RESPONSE-REINFORCEMENT COMBINATIONS 


Group ĝi o(p) | Him pra | peaa p212 | pa pois | pua | 21,22 | pan pan pon | tr 
Experiment I, Blocks 6-10 

0 917 .772 | .052 | .084 | .008) — | — .052 | .022 | .008 | .001 | — | — 

50 814 pa 330 | .038 | .035 | .007 | .334 | -068 .059 | .023 | .006 | .003 | .051 | .046 


> E E a E 


Experiment II, Blocks 3-6 


a et Plat a i = 
75/0 | 584 .0242 | .310 | 135 | -085 | O50) T26 | .130| 1079 .099| :027 | :037 | 111]. 


“S05 | 10129 | .138 | -056 | .030 | - à 13¢ 
E 3 0129 | -155 | 095 | .082| .103| — | — |-112) -201 | .094 | .221 | 
aoe 043 | 136 | .039 | .128 | .086 | .246 
047 | .037 | .036 | .047 | -082 | .083 | -043 | -136 | -039 | .128 | .086 | .24 
2/0. 4% 015 | 029 |024 | .103| — | — |-045 | -160 | .082 | .543 
0139 
25/50 | .269 013 | 018 | .028 | .076 | -051 | -085 | .026 | .070 | .052 | -220 00} 261 


l Note.—See text for explanations of,entries. 


378 JAMES G. 


adequate to account for the results that 
were obtained. First, we could expect 
that Eo trials might change pı in the 
direction of .50 by an amount pro- 
portional to the difference between 
bı(To = 0) and .50. If this had been the 
case, however, then the difference be- 
tween 91(75/0) and ,(75/50) would 
seem to have been too large. Instead of 
decreasing pı toward .50, the Eo trials 
reduced pı to .50. More critically, in 
Section 50, Pı (ao = .50) was farther from 
-50 than Jı (To = 0). 

A second formulation would allow the 
probability of a correction response to be 
influenced by the incentive variable. 
Such a formulation would be consistent 
with results indicating that the incentive 
variable had a greater effect in groups 
receiving Eo trials than with To = 0, 
The results from Sections 75 and 50 of 
the present experiment are, therefore, 
consistent with such a formulation. 
However, had the incentive operation 
been more effective in Group 25/50 than 
Group 25/0, there should not have been 
a significant difference between these 
groups in the obtained direction. 

In Sections 75 and 50, then, Eo trials 
reduced the frequency with which Ss 
chose that response which was reinforced 
more frequently, although the more fre- 
quently reinforced response was asso- 
ciated with different incentives in the 
two cases, In Section 50, although Pı 
was changed by the Presence of Ep trials, 
it appears that p, did not change across 
Eo trials despite the fact that pı < .50. 
These results, then, indicate that S's 
choice frequency is not a relevant vari- 
able in relation to the effect of Eo trials; 
and the present data therefore contra- 
indicate the correction hypothesis. On 
the other hand, the present findings 
indicate that the effect of Eo trials is 
related to the relative frequencies with 
which the alternative choices are re- 
inforced. 


SUMMARY 


Two experiments were conducted in order 
to obtain evidence regarding the effect of non- 
reinforced trials in a simple prediction situa- 
tion. Evidence was obtained regarding two 


GREENO 


hypotheses: (a) The identity hypothesis, 
which asserts that an Eo trial does not change 
choice probabilities; and (6) The correction 
hypothesis, which implies that an Eo trial 
reduces the probability of the response chosen 
by Son that trial. 

In Exp. I, two groups receiving sequences 
of trials including 200 reinforcements with 
miima = 90:10 were compared over trial 
blocks during which p, was near its asymptote. 
The group receiving Eo trials showed a lower 
value of p, than did the group with zo = 0. 
This result contraindicates the identity 
hypothesis. A second comparison from Exp. 
I examined the effect of a pretraining sequence 
including Eo trials with miima = 50:50. The 
Pretraining sequence did not significantly 
change the effect of Ep trials in the 90:10 
Sequence which followed. There was, then, 
no evidence that the effect of E, trials de- 
creases over sequences of partially reinforced 
trials, 

In Exp. IT, unequal incentives were offered 
S for the two choice responses in an attempt 
to separate the contributions of choice fre- 
quencies and reinforcement frequencies to the 
effect of Eo trials. It was found that with 
miim = 75:25, Eo trials reduced pı; with 
mim: = 25:75, Ep trials increased pı, With 
miima = 50:50, pi(ro = -50) was less than 
Piro = 0), although a sequential statistic 
indicated that p, did not change across Eo 
trials. In each case, p(x» = 0) was less than 

Ti 
mi Horr 
the effect of E, trials was related to the fre- 
quencies with which the choices were rein- 
forced, rather than to the frequencies with 
which Ss chose the responses, Since the 
correction hypothesis implies that choice 
frequencies determine the average effect of Eo 
trials, the results of Exp. II were interpreted 
as a disconfirmation of the correction hy- 
pothesis. 


Therefore, it was concluded that 


REFERENCES 


ANDERSON, N. H., & GRANT, D. A. A test of 
a statistical learning theory model for two- 
choice behavior with double stimulus 
events. J. exp. Psychol., 1957, 54, 305-317. 

DERSON, N. H., & Grant, D. A. Correc- 
tion and reanalysis, J, exp. Psychol., 1958, 
56, 453-454, 

Arison, R. D. An analysis of the effect of 
nonreinforced trials in terms of statistical 
learning theory. J. exp. Psychol., 1956, 
52, 28-32. i 75 

Bush, R. R. A survey of mathematical 
learning theory. In R. D. Luce (Ed), 


TWO-CHOICE LEARNING 


Developments in mathematical psychology. 
Glencoe, Ill.: Free Press, 1960. Pp. 120- 
165. 

Cox, D. R. Planning of experiments. 
York: Wiley, 1958, 

Estes, W. K. The statistical approach to 
learning theory. In S. Koch (Ed.), Psy- 
chology: A study of a science, Vol. 2. 
General systematic formulations, learning, 
and special processes. New York; Mc- 
Graw-Hill, 1959. Pp. 380-491. 

Grant, D. A. Analysis-of-variance tests in 
the analysis and comparison of curves. 
Psychol. Bull., 1956, 53, 141-154. 

LABERGE, D., GREENO, J. G., & PETERSON, 
O. F. Nonreinforcement and neutraliza- 


New 


379 


tion of stimuli. J. exp. Psychol., 1962, 63, 
207-213. 

Mittwarp, R. B. A comparison of two 
learning models for two-choice conditioning 
experiments involving nonreinforced trials. 
Unpublished doctoral dissertation, Indiana 
University, 1960. 

Nemark, E. D. Effects of type of nonrein- 
forcement and number of alternative re- 
sponses in two verbal conditioning situa- 
tions, Unpublished doctoral dissertation, 
Indiana University, 1953. 


(Received September 1, 1961) 


J al of Experimental Psychology 
1962, Vol. 64, No. 4, 380-387 


RETENTION OF FIRST-LIST ASSOCIATIONS AS A 
FUNCTION OF THE CONDITIONS OF TRANSFER ! 


LEO POSTMAN 


University of California 


A basic question for theories of 
transfer and retroactive inhibition 
(RI) is whether the strength of first- 
list associations changes systemati- 
cally during the learning of the second 
list. A recent study by Barnes and 
Underwood (1959) has presented clear 
evidence that the “fate” of first-list 
associations depends on the condi- 
tions of intertask transfer, When the 
paradigm for negative transfer (iden- 
tical stimuli and dissimilar responses 
—A-B, A-C) is used, List 1 associa- 
tions are unlearned or extinguished 
during the acquisition of Eist 2. 
When the successive tasks conform to 
the paradigm for positive transfer 
(identical stimuli and highly similar 
responses—A-B, A-B’), List 1 associa- 
tions are maintained at high strength 
and appear to mediate the reproduc- 
tion of List 2 responses. The present 
Paper presents additional findings in 
Support of these conclusions. 


Experimental evidence consistent with the 
unlearning hypothesis has by 
Steadily but has remained short of decisive 


advanced by Melton and Irwin (1940) as part 
of a two-factor theory of RI, According to 
this theory, List 1 associations which have 
been unlearned are hot available at the time 
of recall; others which remain potentially 
available are displaced by competing associa- 
tions from List 2, The lack of correlation be- 
tween total amount of RI and the number of 
overt interlist intrusions is in accord with this 
interpretation as is the finding that RI is 
greater than proactive inhibition (PI) at short 
retention intervals (Melton & Von Lackum, 
1941; Underwood, 1948a). The assumption 


1 This research was Supported by a grant 
from the National Science Foundation. 


that some List 1 associations are not available 
to Sat the time of the retention test received 
further support from studies using the method 
of modified free recall or MFR (Briggs, 1954; 
Briggs, Thompson, & Brogden, 1954; Under- 
wood, 1948b). After learning two successive 
lists (A-B, A-C), S is presented with the 
common stimulus term (A) and is required to 
give either the List 1 response (B) or the List 
2 response (C). The relative frequency of 
List 1 responses declines steadily as a function 
of the degree of List 2 learning. The fact that 
the proportion of List 1 responses increases as 
a function of time supports the interpretation 
of unlearning as a process akin to extinction 
followed by spontaneous recovery. The re- 
sults of MFR tests do not, however, provide 
crucial evidence for reduced availability of 
List 1 associations. Since S$ is instructed to 
give either B or C, a progressive rise in the 
Proportion of C responses may merely signal 
increasing dominance of List 2 over List 1 
associations and does not compel the con- 
clusion that the latter are not available to S. 
A critical test of the unlearning hypothesis 
requires that the availability of List 1 associa- 
tions be assessed under conditions in which the 
effects of response competition and of losses in 
list differentiation are eliminated. This re- 
quirement was met for the first time in the 
study of Barnes and Underwood (1959). 4 In 
the experiment using the A-B, A-C paradigm 
with nonsense syllables as stimuli and adjec- 
tives as responses, Ss were presented with the 
common stimulus terms at the end of List 2 
rning and were required to write down both 
the List 1 and List 2 responses to each of the 
stimuli, Following a usage adopted by Mel- 
ton (1961) this modification of MFR will be 
referred to as MMFR. With List 1 learned 
to a criterion of one perfect recitation, the 
FR test was administered to different 
groups after 1, 5, 10, or 20 anticipation trials 
on List 2, As degree of List 2 learning in- 
creased, there was a steady decline in the 
number of List 1 responses and an equally 
regular rise in the number of List 2 responses. 
These trends were obtained regardless of 
whether credit was given for all responses 
recalled or only those which were reproduced 
to the appropriate stimulus and identified 


380 


RETENTION OF FIRST-LIST ASSOCIATIONS 


correctly as to list membership. With the 
exception of the lowest degree of List 2 learn- 
ing, List 1 responses were reproduced before 
List 2 responses with only chance or less than 
chance frequency. These findings give 
unequivocal support to the unlearning hy- 
pothesis and are inconsistent with the “in- 
dependence hypothesis,” according to which 
the two systems of responses remain independ- 
ent and intact and RI is due to reproductive 
inhibition. 

The inverse relationship between interlist 
response similarity and RI suggests that List 1 
responses gain in strength during the acquisi- 
tion of List 2 when intertask transfer is 
positive. One theoretical account of this 
relationship assumes that there is generaliza- 
tion of reinforcement between List 1 and List 
2 (Osgood, 1946, 1948; Underwood, 1951; 
Young, 1955). While this hypothesis ac- 
counts for the relationship between positive 
transfer and RI, a second experiment by 
Barnes and Underwood supports a different 
conception of the “fate” of List 1 associations 
under the A-B, A-B’ paradigm. In a replica- 
tion of the procedure described above with 
lists conforming to the A-B, A-B’ paradigm, 
the MMFR test showed little decline in the 
recall of List 1 associations as a function of 
the degree of List 2 learning. Recall of List 2 
associations was nearly perfect after only one 
trial. At the lower degrees of List 2 learning, 
List 1 responses tended to be recalled before 
List 2 responses. The total pattern of results, 
and especially the almost instantaneous ac- 
quisition of List 2, points to direct mediation 
of List 2 responses by List 1 responses 
(A-B-B’) rather than generalization of rein- 
forcement as the mechanism responsible for 
the positive transfer effects. Since similarity 
and associative connection between items are 
correlated, rehearsal of A-B-B’ strengthens 
List 1 responses and at the same time leads 
to a high level of recall for List 2 responses. 
Most Ss in the experiment of Barnes and 
Underwood reported using mediation during 
List 2 learning. 

The study of Barnes and Underwood 
(1959) represents an important advance in the 
analysis of the mechanisms of transfer which 
are fundamental to an interference theory of 
forgetting. The conclusions concerning the 
“fate” of List 1 associations will be strength- 
ened if it is possible to show that the differ- 
ences obtained with the two paradigms are 
hot a function of S's set at the time of recall. 
Results obtained by the method of anticipa- 
tion suggest that RI at recall may be enhanced 
by S's tendency to continue responding from 
the list which he practiced last. Such 


381 


“generalized competition” leads to a large 
proportion of failures to respond on the test of 
recall even when the two lists do not share 
common stimulus terms and List 2 has not 
been learned to a higher degree than List 1 
(Newton & Wickens, 1956; Postman, 1961; 
Postman & Riley, 1959). It is reasonable to 
suppose that generalized competition is 
greater when intertask transfer is negative 
than when it is positive, especially if there is 
direct mediation of responses under the latter 
condition. When recall for homogeneous lists 
is tested by MMFR, differential effects of set 
on the reproduction of List 1 responses cannot 
be ruled out. A mixed-list design was used in 
the present study in order to assess the 
differences in the availability of List 1 re- 
sponses for the A-B, A-C and A-B, A-B’ 
paradigms with the effects of response set 
equalized. The need for control of response 
set in MMFR by means of mixed-list designs 
has been pointed out by Melton (1961). 


Performance in MMFR is relatively 
free from the effects of response com- 
petition and list differentiation. A 
comparison of the amounts of RI as 
measured by MMFR and the conven- 
tional anticipation method will permit 
an estimate of the extent to which 
conventional measures of RI reflect 
reduction in the availability of List 1 
responses on the one hand, and re- 
sponse competition and loss of list 
differentiation on the other. A con- 
ventional test of anticipation and 
MMER were, therefore, used with 
different groups to measure RI under 
the conditions of the present ex- 


periment. 
METHOD 


Experimental design.—Four groups of Ss— 
two Work groups and two Rest groups— 
learned a list of eight paired associates (A-B) 
to a criterion of one perfect recitation. The 
Work groups were then given 20 trials on a 
second list of eight paired associates. For half 
of the pairs in List 2 the relationship to the 
pairs in List 1 conformed to the A-B, A-C 
paradigm, and for the other half of the pairs 
to the A-B, A-B’ paradigm. The Rest groups 
rated a series of pictures on several evaluative 
dimensions for a period equal to that spent in 
List 2 learning by the Work groups. At the 
end of the retention interval one Work group 


382 


and one Rest group were given an MMFR 
test in which the common stimulus terms (A) 
were presented and Ss were required to give 
both List 1 and List 2 responses. Following 
the MMER test List 1 was relearned for 10 
trials. The other Work group and Rest group 
relearned List 1 for 10 trials without an inter- 
vening MMFR test. The four conditions 
included in the design will be designated as 
MMFR Work, MMFR Rest, Conventional 
Work, and Conventional Rest. 

Lists—The stimulus terms were eight 
nonsense syllables of 47-53% association 
value (Glaze, 1928), The intrastimulus 
similarity was low. None of the consonants 
were duplicated, and each of four vowels was 
repeated once. The pool of response terms 
consisted of two-syllable adjectives from 
Haagen’s (1949) tables. There were eight 
sets of three adjectives each. Two of the 
three adjectives in each set were those used 
by Barnes and Underwood (1959) as responses 
in A-B, A-B’ pairs and had similarity ratings 
from .9 to 1.4 on Haagen’s scale. The third 
adjective in each set had no apparent relation- 
ship to the other two, Pairs were assigned to 
lists so that (a) each nonsense syllable was the 
common stimulus for similar responses half 
the time and for dissimilar responses the 
other half of the time, and (b) similar re- 
sponses and dissimilar responses from a given 
set of adjectives were each paired with a 
common stimulus half the time. There were 
two combinations of lists; the two lists within 
each combination were learned first and 
second equally often, Intralist response 
similarity was low throughout: there were no 
duplications of first letters, and no more than 
one duplication of a terminal suffix. 

Procedure.—The lists were presented on a 
Hull-type memory drum at a 2:2 rate, with 
an 8-sec. intertrial interval. There were four 
different orders of pairs each of which was 
used as a starting order equally often. The 
two lists learned by the Work groups were 
separated by 2 min. The total retention 
interval, which was filled by the picture-rating 
task for the Rest groups, was 15,3 min. 

For the MMER test the drum was operated 
manually and the exposure of successive stim- 
ulus terms was paced by S, The Ss were in- 
structed to call out the two responses in the 
order in which they occurred to them when 
the common stimulus term appeared in the 
window. 

Subjects.—With two conditions (Work vs. 
Rest) and two tests of recall (MMFR ys, 
Conventional), there were four groups of 16 

Ss each. The Ss were undergraduate stu- 
dents who were not necessarily naive to rote- 


LEO POSTMAN 


learning experiments but had no previous 
experience with MMFR tests. For purposes 
of assignment to conditions, 64 entries were 
made so that to each S in a Work group there 
corresponded an S in the appropriate Rest 
group who learned the same test list with the 
same starting order. The Ss were run in 
blocks of 4, with 1 S per block drawn at 
random from each of the four conditions. 
The running order within blocks was deter- 
mined by a table of random numbers. No Ss 
were lost because of failure to learn. 


RESULTS 


List 1 learning.—The mean number 
of trials to criterion on List 1 for the 
combined groups was 13.26, with an 
SD of 7.98. The means for individual 
groups ranged between 12.81 and 
14.00 and did not differ significantly 
(F < 1). 

The mean number of trials to a 
criterion of 4/4 for the pairs used in 
the A-B, A-B’ paradigm was 10.39, 
and 10.31 for the pairs used in the 
A-B, A-C paradigm. For purposes of 
this comparison as well as in subse- 
quent analyses of recall and relearn- 
ing, the classification of the pairs 
learned by Ss in the Work groups was 
applied to the protocols of correspond- 
ing Ss in the Rest groups. 

List 2 learning —During practice on 
List 2, the A-B’ paradigm produced 
faster learning than the A-C paradigm. 
The mean number of correct responses 
on A-B’ pairs was 65.06 (SD = 12.88), 
and 59.47 (SD = 11.15) on A-C pairs. 
The results for the Work groups were 
quite similar-—64.94 vs. 59.31, and 
65.19 vs. 59.62 for the MMFR and 
Conventional groups, respectively. 
The difference between the two kinds 
of pairs is Significant ( = 2.45, 
df = 31, 01 < P < .02) and is in 
accord with the differential transfer 
effects normally predicted for the tw 
paradigms, 

The number of interlist intrusions 
given to the appropriate stimulus 
terms was greater for the A-B, A-B 


RETENTION OF FIRST-LIST ASSOCIATIONS 


383 


TABLE 1 
NuMBER oF List 1 RESPONSES CORRECTLY REPRODUCED IN RECALL AND RELEARNING 


Paradigm 
DA DAD < Ireke TTS 
Measure Cond. A-B, A-B’* A-B, A-C* 
MMFR Groups 

MMFR Work 3.06 1.20 i 86 

Rest 3.81 39 ’ .39 

Conventional recall Work 2.31 1.53 3 -16 

Rest 3.43 -66 66 

10 trials of RL Work 33.12 5.85 6.32 

Rest 37.88 2.15 2.72 

Conventional Groups 

Conventional recall Work 2.50 1.05 1.56 1.00 

Rest 2.94 .50 3.12 93 

10 trials of RL Work 35.19 37S 32.94 3.69 
Rest 36.31 2.81 36.75 4.07 3 


^ Pairs learned by Ss in the Rest groups were classified according to treatment for corresponding Ss in the 


Work groups. 


than for the A-B, A-C paradigm. 
Twelve Ss contributed 35 intrusions 
on A-B’ pairs whereas 7 Ss made 11 
such errors on A-C pairs. This differ- 
ence agrees with that found in other 
investigations (Barnes & Underwood, 
1959: Underwood, 1951; Young, 1955). 

Transfer from List 1 to List 2.—To 
estimate the net amount of transfer 
from List 1 to List 2, the total num- 
bers of correct responses on the first 
five trials of acquisition of the two 
lists were used. Since the fastest 
learner on List 1 required five trials to 
reach criterion, all Ss could be in- 
cluded in this analysis. For the A-B, 
A-B’ paradigm the mean number of 
correct responses rose from 7.50 to 
12.22: for A-B, A-C there was only a 
small increase from 6.75 to 8.12. 
While the net transfer effect is positive 
in both cases, the gain is substantially 
larger for A-B’ than for A-C. The 


interaction, Pairs X Lists, is signifi- 
cant beyond the .001 level (F = 14.73, 
df = 1/31). In the absence of appro- 
priate controls the effects of learning- 
to-learn and warm up cannot be 
separated from the specific transfer 
effects falling on the two classes of 
pairs. The large difference in the net 
amount of gain indicates, however, 
that the specific transfer effects were 
positive for A-B’ and negative for 
A-C. 

MMFR test—The mean numbers 
of correct responses on the MMFR 
test are shown in Table 1. The scores 
are based on responses given to the 
appropriate stimulus terms. For the 
Rest group there is only a negligible 
decline in the recall of List 1. The 
Work group shows losses for both 
kinds of List 1 pairs but the amount 
forgotten is clearly greater after inter- 
polation of A-C than of A-B’. Recall 


384 


of List 2 was nearly perfect—3.94 for 
A-B’ and 3.88 for A-C. 
In the statistical analysis of the 
‘results, the scores of corresponding Ss 
in the Work and Rest groups were 
treated as paired replicates. Since the 
direction of the differences was pre- 
dicted, one-tailed tests of significance 
were used. The overall amount re- 
called is significantly greater under 
Rest than Work (¢ = 4:67, df =A5, 
P < .001). For purposes of evaluat- 
ing the differential effects of the condi- 
tions of transfer on the amount of RI, 
the difference between the recall 
scores for the two sets of pairs was 
determined for each S. The mean 
difference score is significantly higher 
for the Work than the Rest group 
(t = 2.56, 01 < P < -02). Thus, the 
reduction in the availability of List 
1 responses js reliably greater for 
the A-B, A-C than the A-B, A-B’ 
paradigm. 

There were only a few scattered 
instances of List 1 responses given to 
an incorrect stimulus on the MMFR 
test. Inclusion of these responses 
raises the total number of responses 
recalled by 2 and 3 for the A-B, A-B' 
and A-B, A-C conditions, respectively, 
For the rest group, there is an increase 
of 2 on each type of pair. Thus the 
picture remains virtually unchanged 
whether or not response to the ap- 
propriate stimulus js used as a cri- 
terion of correct recall, 


in which List 1 responses were recalled 
first were 45.8 and 45.4, respectively, 
i.e., the frequencies are close to chance 
The corresponding 
percentages obtained by Barnes and 
Underwood after 20 trials of inter- 
polated learning were 53 and 43, 


LEO POSTMAN 


Since all percentages are near chance, 
the differences between the two ex- 
periments may be considered minor. 

Conventional recall of List 1— 
Table 1 shows the mean numbers of 
items recalled by the Conventional 
groups on the paced test of anticipa- 
tion. The total recall scores of the 
Work group are significantly lower 
than those of the Rest group (¢ = 3.51, 
P<.01). The amount of interfer- 
ence is substantially greater for the 
A-B, A-C than the A-B, A-B’ para- 
digm, and the difference between the 
two sets of pairs is again significantly 
larger for the Work than the Rest 
group (t = 2.38, 01 < P < .02). 
There was only one interlist intrusion 
to an appropriate stimulus term, 
which was a substitution of B’ for B. 

Comparison of MMFR and conven- 
tional recall.—The overall level of 
recall is significantly higher in MMFR 
than in conventional anticipation 
(F = 10.36, df = 1/15, P < .01, after 
a Freeman-Tukey square-root trans- 
formation). However, the amounts 
of RI are quite similar under the two 
conditions of testing. The percent- 
ages of RI in MMFR are 19.7 for A-B, 
A-B' and 43.5 for A-B, A-C. On the 
conventional test the corresponding 
percentages are 14.9 and 50.0. The 
Rest-Work differences interact with 
the method of testing neither for total 
recall scores nor for the differences 
between types of pairs. 

Effect of MMFR on conventional 
recall—Since all groups relearned 
List 1 by the anticipation method, it 
is possible to evaluate the effects of an 
MMFR test on subsequent conven- 
tional recall. It must be recognized 
that the time between the last trial of 
original learning and the first trial of 
releaming was slightly longer for the 
MMFR groups than the Conventional 
groups because of the interpolated 
MMER test, However, this differ- 


RETENTION OF FIRST-LIST ASSOCIATIONS 


ence is minor relative to the total 
length of the retention interval. As 
Table 1 shows, the performance of the 
MMER Rest group exceeds that of 
the Conventional Rest group on the 
first trial of relearning whereas the 
differences between the two Work 
groups are small and not consistent. 
Whatever opportunity for rehearsal is 
provided by the MMER test appears 
to have a beneficial effect only for the 
Rest group. However, the resulting 
increase in RI is not significant as 
evaluated by the interaction of the 
W ork-Rest differences with the condi- 
tions of testing. 

_ The number of appropriate interlist 
intrusions also shows some increase 
after the MMFR test. Whereas there 
was only one intrusion of a B’ item for 
the Conventional Work group, the 
MMFR Work group gave four B’ 
responses and two C responses on the 
first relearning trial. The interpola- 
tion of an MMFR test appears to 
reduce the differentiation between 
lists. 

Relearning.—The mean numbers of 
correct responses in 10 trials of re- 
learning are shown in Table 1. The 
Conventional groups will be consid- 
ered first. The difference between the 
Work group and the Rest group is 
significant (t = 2.38, .01 < P < .02), 
ie., there is reliable RI when the total 
performance in relearning is consid- 
ered. However, the difference be- 
tween the amounts of RI under the 
two conditions of interpolation falls 
short of significance (t = 1.64). As 
Fig, 1 shows, the level of RI for A-B, 
A-C declines steeply and converges 
on that for A-B, A-B’. 

The overall difference between the 
Work group and the Rest group is 
also significant for relearning after 
MMFR (t = 3.30, P <.01). The 
amount of RI does not differ signifi- 
cantly for the two paradigms t= 54), 


385 
150 , === MMFR 
A-B, A-B' 
9 == Conventional 
L25 AB e MMFR 


@— Conventional 


8 


Mean amount of RI 
8 a 


9-10 


0 

12 3-4 5-6 7-8 
Trials of relearning 
Fic. 1. Amount of RI in relearning as a 
function of transfer paradigm and condition 
of testing. (All Ss relearned by the method 
of anticipation. An MMFR test preceded 
relearning for the MMFR groups but not for 

the Conventional groups.) 


and the loss scores again converge 
during relearning (Fig. 1). 

Comparison of the temporal trends 
in RI shows that interpolation of an 
MMER test produces a pronounced 
increase in RI for A-B, A-B’ in the 
early stages of relearning, and a 
smaller increase for A-B, A-C: As2 
result, the initial advantage of A-B, 
A-B' is considerably greater under the 
conventional treatment than after 
MMER: in fact, the relationship be- 
tween the paradigms is temporarily 
reversed in the latter case. A trend 
analysis of the differences between the 
two sets of pairs yields a significant 
interaction of Conditions (Work vs. 
Rest) with the Method of Testing 
(F = 4.77, df = 1/15, .02 < P.< 105). 
Thus, the difference between the 
slopes of the two RI functions is re- 
duced by the interpolation of an 
MMER test. 

The effects of MMFR on relearning 
are reflected in the frequencies of 
interlist intrusions given to the ap- 
propriate stimulus terms. These fre- 


386 


quencies and the numbers of Ss (N) 
contributing them were as follows: 
Conventional Work group—8 intru- 
sions of B’ (N = 5) and 4 of C 
(N = 2); MMFR Work group—21 
intrusions of B’ (N = 12) and 5 of C 
(N = 4). There are more intrusions 
of B’ than C responses in both cases, 
but this difference is substantially in- 
creased following MMER. 


Discussion 


The results obtained with mixed lists 
fully confirm those reported by Barnes 
and Underwood (1959) for homogeneous 
lists. The reduction in availability of 
List 1 associations as measured by 
MMFR is clearly and significantly 
greater for the A-B, A-C than the A-B, 
A-B’ paradigm. The relative amounts of 
interference observed in the two experi- 
ments are in substantial agreement. A 
direct comparison can be made of the 
percentages that List 1 responses were of 
all the responses given in MMFR under 
each condition of interpolation. For 
A-B, A-B’ this percentage was 47.0 in 
the study of Barnes and Underwood, and 
43.8 in the present experiment; for A-B, 
A-C the Corresponding percentages are 
35.8 and 35.4, 

The consistency of the results obtained 
with homogeneous and with mixed lists 
indicates that the differences obtained in 
MMER reflect primarily the strength of 
first-list associations rather than the 
influence of response sets at the time of 
recall. This finding parallels that of 
Twedt and Underwood (1959) that 
homogeneous lists and mixed lists yield 
equivalent measures of differential trans- 
fer effects, 

With response set eliminated as a 
major determinant of performance in 
MMER, the present results give further 
strong support to the hypothesis that 
first-list associations are extinguished or 
unlearned when intertask transfer is 
negative. The data are also consistent 
with the hypothesis of response media- 
tion under conditions of Positive transfer, 
When response similarity is high, the 


LEO POSTMAN 


acquisition of List 2 associations is rapid, 
and List 1 associations are maintained at 
relatively high strength. However, there 
is clearly some decline in the strength of 
A-B after interpolation of A-B’. As 
Barnes and Underwood have pointed out, 
the possibility that there is some extinc- 
tion of A-B during acquisition of A-B’ 
must remain open. 

The MMER procedure is designed to 
measure associative strength independ- 
ently of the effects of list differentiation 
and response competition. Nevertheless 
the MMFR and Conventional groups 
yield comparable measures of RI and are 
equally sensitive to the differences be- 
tween the paradigms of transfer. It 
appears that in conventional recall the 
amounts of RI observed immediately 
after the end of interpolated learning are 
largely a function of associative strength 
tather than degree of list differentiation 
and response competition. The relation- 
ship between the measures of RI ob- 
tained in MMFR and conventional recall 
is likely to change, however, with the 
Progressive decline of list differentiation 
as a function of time, 

The rise in intrusions following MMFR 
indicates a systematic decrease in list 
differentiation after a test in which Ss 
may give the two responses to a common 
stimulus term in either order. ‘The fact 
that the increase in intrusions is greater 
for A-B, A-B’ than for A-B, A-C lends 
some indirect support to the view that 
mediation occurs in the former case. 

he associative connection between 
highly similar responses may be assumed 
to be bidirectional, i.e. the pre-experi- 
mental probability of B’-B should be 
equal to that of B-B’, Both sequences 
occur with approximately equal fre- 
quency in MMFR. Reversal of the order 
of responses in MMFR appears to be 
conducive to competition between the 
two alternative sequences and results in 
Persistent interlist intrusions and RI in 
relearning. There are presumably no 
such competing sequences in the case of 
the A-B, A-C paradigm. These effects 
of MMFR on relearning add to the 
evidence for the lack of independence 

tween successive tests of retention. 


| 
| 


RETENTION OF FIRST-LIST ASSOCIATIONS 


SUMMARY 


4 This study is concerned with the changes 
in the strength of first-list associations during 
the acquisition of a second list. A mixed-list 
design was used so that for half the syllable- 
adjective pairs in the two lists the paradigm 
of transfer was A-B, A-B’, and for the other 
half, A-B, A-C. List 1 was learned to one 
perfect recitation and List 2 was practiced 
for 20 trials. To determine the availability of 
List 1 and List 2 associations at the end of 
interpolated learning, a procedure devised by 
Barnes and Underwood (1959) was followed 
and Ss were required to give both responses 
to each stimulus in an unpaced test of recall 
(MMFR). A control group learned and 
recalled a single list. In a parallel experiment 
retention for List 1 was tested by the conven- 
tional method of anticipation. 

In agreement with the results obtained by 
Barnes and Underwood for homogeneous 
lists, the MMFR test shows a significantly 
greater reduction in the availability of List 1 
responses for the A-B, A-C than the A-B, 
A-B’ paradigm. Recall of List 2 was nearly 
perfect. These findings support the hy- 
pothesis of unlearning and make it unlikely 
that the differences in the availability of 
List 1 associations are a function of response 
sets characteristic of the two paradigms. 
The conventional anticipation method yields 
measures of RI which closely parallel those 
obtained in MMFR. It is concluded that 
under both conditions of measurement the 
amount of RI immediately after interpolated 
learning is determined largely if not entirely 
by the availability of List 1 associations. 


REFERENCES 


Barnes, J. M., & UNDERWOOD, B. J. “Fate” 
of first-list associations in transfer theory. 
J. exp. Psychol., 1959, 58, 97-105. 

Briccs, G. E. Acquisition, extinction, and 
recovery functions in retroactive inhibition. 
J. exp. Psychol., 1954, 47, 285-293. 

Bricos, G, E., Tuomrson, R. F., & BROGDEN, 
W. J. Retention functions in reproductive 
itthibition; J. exp. Psychol., 1954, 48, 419- 

Guazr, J. A. The association value of non- 
sense syllables. J. genet Psychol, 1928, 
35, 255-269. 


387 


Haacen, C. H. Synonymity, vividness, famil- 
iarity, and association-value ratings for 
400 pairs of common adjectives. J. Psy- 
chol., 1949, 30, 185-200. 

MELTON, A. W. Comments on Professor 
Postman’s paper. In C. N. Cofer (Ed.), 
Verbal learning and verbal behavior. New 
York: McGraw-Hill, 1961. 

MELTON, A. W., & Irwin, J. McQ. The 
influence of degree of interpolated learning 
on retroactive inhibition and the overt 
transfer of specific responses. Amer. J. 
Psychol., 1940, 53, 175-203. 

Metron, A. W., & Von Lackum, W. J. 
Retroactive and proactive inhibition in 
retention: Evidence for a two-factor theory 
of retroactive inhibition Amer. J. Psy- 
chol., 1941, 54, 157-173. 

NewrTon, J. M., & WICKENS, D. D. Retro- 
active inhibition as a function of the tem- 
poral position of interpolated learning. 
J. exp. Psychol., 1956, 51, 149-154. 

Oscoop, C. E. Meaningful similarity and 
interference in learning. J. exp. Psychol., 
1946, 36, 277-301. 

Oscoop, C. E. An investigation into the 
causes of retroactive inhibition. J. exp. 
Psychol., 1948, 38, 132-154. 

Postman, L. The present status of interfer- 
ence theory. In C. N. Cofer (Ed.), 
Verbal learning and verbal behavior. New 
York: McGraw-Hill, 1961. 

Postman, L., & RILEY, D. 
learning and interserial 
retention. U. Calif. Publ. 
8, 271-396. 

Twepr, H. M., & UNDERWOOD, B.J. Mixed 
ys. unmixed lists in transfer studies. J. 
exp. Psychol., 1959, 58, 111-116. 

Unperwoop, B. J. Proactive and retro- 
active inhibition after five and forty-eight 
hours. J. exp. Psychol., 1948, 38, 29-38. 
(a) 

UnpeErwoon, B. J. 
of verbal associations. 
1948, 38, 429-439. (b) $ 

Unperwoop, B Associative transfer in 
verbal learning as a function of response 
similarity and degree of first-list learning. 
J. exp. Psychol., 1951, 42, 44-53. 

Youne, R. K. Retroactive and proactive 
effects under varying conditions of response 
similarity. J. exp. Psychol., 1955, 50, 113- 


119. 


A. Degree of 
interference in 
Psychol., 1959, 


“Spontaneous” recovery 
J. exp. Psychol., 


(Received September 7. 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 4, 388-392 


EFFECTS OF NONREINFORCEMENT ON SUBSEQUENT 
REINFORCED RUNNING BEHAVIOR ! 


SHELBY J. HARRIS? M. GLENN SMITH,’ anp SOLOMON WEINSTOCK + 


Lehigh University 


Bush and Mosteller (1955, p. 89) 
present a descriptive model for re- 
sponse acquisition under partial rein- 
forcement which is formally identical 
with models arising from statistical 
learning theory, They assume two 
linear operators on probability which 
are applied independently, one on 
reinforced and the other on non- 
reinforced trials. The model yields 
the result that asymptotic probability 
of responding is directly related to the 
Proportion of reinforced trials. This 
result may also be shown to hold for 
the asymptotic speed of running in an 
alleyway. The latter was directly 
contradicted by the finding by Notter- 
man (1951), Weinstock (1958), and 
others, that continuously reinforced 
Ss reached lower terminal speeds of 
running than partially reinforced Ss. 

Weinstock (1953, 1958) hypothe- 
sized that the effect of nonreinforced 
trials was to provide an Opportunity 
for nonfunctional response compon- 
ents in the response chain to be 
eliminated. Elimination of nonfunc- 
tional movements should result in an 
increase in the value of the limit-point 
of the reinforced trial operator, Wein- 
stock (1953) proposed a modification 
of the original model in which the 
limit-point of the reinforced trial 
operator is assumed to be an increas- 
ing function of the number of non- 
reinforced trials, 

*This study was Supported in part by 
Research Grant M-2286 from the National 
Institute of Mental Health, United States 
Public Health Service. 

* Now at Wells College, 

* Now at Tufts University. 

í Now at Brooklyn College. 


In the present experiment two 
groups of rats were trained to run in 
an alleyway under 50% and 100% 
reinforcement, respectively. Half of 
each group then received extinction, 
Finally, all Ss were given 100% rein- 
forcement. In the last phase of the 
experiment only the reinforced trial 
operator is applicable and the changes 
induced in the limit-point should be 
clearly exhibited, 

It was expected that both the 
partial reinforcement and extinction 
variables would lead to faster running 
and that an interaction between the 
two variables would be found. It was 
also expected that the 50% group 
which did not get extinction would 
run faster when switched to 100% 
reinforcement since the nonreinforced 
trial operator would no longer be 
applied. Finally, it was hoped that 
the differences induced in speed of 
running would correspond to dif- 
ferences in within-trial variability. 
Specifically, it was thought that 
aster running was due to conditioning 
of a response chain that contained 
fewer nonfunctional movements. As 
S traversed the alleyway, the succes- 
sive placements of its left rear paw 
were recorded. More general in- 
formation about the effect of rein- 
forcement schedules on the variability 
of running behavior was also sought. 


METHOD 


Subjects:—The Ss were 40 experimentally 
naive female albino rats, 105 to 135 days old 
at the beginning of the experiment. f 

Apparatus —The apparatus was a straight 
alley, 10 in. wide and 36 in. long, with a 
10 X 18 in. goal box and a start box 5 in. wide 


388 


EFFECTS OF NONREINFORCEMENT 


and 10 in. long. The height of all sections was 
ĝin An entranceway, 5 in. wide and 4 in. 
long, separated the alley from the goal box. 
Guillotine doors were used in the start box 
and in the goal box. A 5 X 7 in. vestibule in 
the far right corner of the goal box contained 
a drinking tube. On nonreinforced trials a 
sliding door, which was not visible before S 
interrupted the last photocell beam, prevented 
access to the drinking tube. Latency and 
total running time were measured using 
Standard Electric .01-sec. timers operated by 
photocells located 2 in. from the start box and 
9 in. inside the goal box, respectively. 

Records of the path taken by S were 
obtained by means of a 20-channel “contact” 
recorder in which S's left rear paw served to 
key a thyratron circuit when the paw made 
contact with the floor. This was accomplished 
by constructing the floor of tinned copper bus 
bars embedded in insulated wood at 4-in. 
intervals. When the paw, which was coated 
with silver paint, made contact with adjacent 
bars a thyratron circuit was completed. Re- 
sistance in the other paws was too high to 
complete the circuit. 

The runway floor was divided into a 
checkerboard of ł X 2 in. sections. Contact 
within one of these sections activated two 
electrodes of a 20-electrode paper-burn 
recorder. One of the electrodes recorded the 
Position of the paw across the runway and the 
other the position down the length of the 
runway. Eleven ł-in. and one 1}-in. seg- 
ments, adjacent to the right hand wall, made 
up the width of the alleyway.» Six 2-in. 
sections made up the first 12 in. of the length 
of the runway. Two repetitions of this 
matrix were used for the remaining portion 
of the runway proper and a third repetition 
permitted recording in the goal box. Two 
Photocells, one located 18 in. from the start 
box and the other 9 in. inside the goal box, 
operated two of the remaining electrodes of 
the recorder to aid in determining when S 
moved from one section to the next. 

Procedure.—Prior to the experiment, Ss 
were handled and habituated to 22 hr. water 
deprivation for 10 days. Throughout the 
experiment two trials per day with an inter- 
trial interval of about 45 min. were run under 
22 hr. water deprivation. Reinforcement 
consisted of 10 sec. of drinking. On non- 
reinforced trials SS was confined to the goal 
box for 30 sec. 

Before each trial S's left rear paw was 
coated with silver paint. After a 30-sec. 
interval to allow the paint to dry, 5 was 
placed in the start box facing the door. 
Closing the cover depressed a microswitch 


389 


which activated a 3-sec. thermo-delay relay. 
When the relay closed a solenoid was energized 
which served to release a lever which was held 
by stretched rubber bands and both timers 
were started. Movement of the lever raised 
the start-box door. 

Two Es (SJH and SW) ran half of the Ss 
each. The third E coated S's paw with paint 
and recorded the data. 

During the first four trials of the experi- 
ment, S was removed from the apparatus if 
it failed to leave the start box within 20 min, 
or spent more than 20 min. in the runway or 
in the goal box without being reinforced. 
After the fourth trial the removal criteria 
were reduced to 5 min. Failure to run on 
three successive trials was used as a criterion 
for eliminating S from the experiment. 

All Ss were reinforced on each of the first 
12 trials. On the basis of running speed data 
from the early trials Ss were assigned to four 
groups in such a way that the group means 
were equal.’ Group CE remained on 100% 
reinforcement during the acquisition period 
and was then given 24 extinction trials. 
Group CC remained on 100% reinforcement, 
both during acquisition and during the 
subsequent 24 trials. Group PE received 50% 
reinforcement during acquisition and then 
received 24 extinction trials, while Group PP 
received 50% reinforcement, both during 
acquisition and the following 24 trials. For 
Ss receiving 50% reinforcement 6 trials on 
which reinforcement was to be given were 
selected randomly from each block of 12 
trials. 

Mean reciprocal running times were com- 
puted for each block of six trials during the 
course of acquisition. The decision to ter- 
minate acquisition was based on inspection 
of these data. When it appeared Ss’ running 
times were stable, an additional 12 trials were 
run and the acquisition period terminated. 
As a result of this procedure a total of 96 
acquisition trials were run. Prior to the 
extinction period all Ss received 2 reinforced 
trials. A reacquisition period of 30 trials 
followed the 24 extinction trials. All Ss 
received 100% reinforcement during re- 


acquisition. 
RESULTS 


Latency and running time were 
converted to reciprocals throughout. 


5Qne S was discarded after Trial 9 in 
accordance with the criteria previously 
described. Hence, for Group PE N =9, 
while for the other groups N = 10. 


ACQUISITION 


12345 67 8 9 10 Il 1213 14 I5 i6 


S. J. HARRIS, M. G. SMITH, AND S. WEINSTOCK 


EXTINCTION | REACQUISITION 
pot 


G 


. 
a plin g 
/ = 
„Â AIS 


SS 


`i ALL GROUPS 
e “an ON CONTINUOUS 


O—o CC (N10) 
@—e CE (N*i0) 
e 4=-4 PP (N=10) 
4--a PE (N= 9) 


1234 12345 


IN BLOCKS OF six 
Fic. 1. Mean reciprocal running time plotted as a function of trials in blocks of six. 


A record of successive placements of 
S's left rear paw was obtained on each 
trial. The paw Placements were 
plotted on a scale drawing of the run- 
way and connected by straight lines 
yielding an approximate path for each 
trial. The points of intersection of the 
path with lines perpendicular to the 
side walls at distances of 7, 14, 21, and 
28 in., Tespectively, were recorded. 
Thus, the lateral Position of S$ (i.e., 
across the width of the runway) at 
each of the four points down the 
Tunway could be estimated. From 
these data a number of measures in- 
volving spatial location could be com- 
puted for each § for any block of 
trials: (a) the variability of lateral 
position across a block of trials at each 
of the four points, (b) the mean lateral 
position across all four points and 
across trials, (c) the variability of this 
mean position across trials, (d) the 
mean across trials of the variance 


within a trial computed about the 
mean position, and (e) the sum within 
each trial of the absolute deviations 
from a straight line connecting the 
Start box and goal box and the average 
of these sums over trials. è 
In Fig. 1 the reciprocal running 
time measure is plotted against blocks 
of six trials for all three phases of the 
experiment.6 Reacquisition occurred 
much more rapidly than original 
acquisition. Group CE went from a 
level of responding below that of the 
first block of acquisition trials tom 
level about that of Group CC within 
the first block of reacquisition trials. 
The major portion of the change in 
responding for Group CE took place 
in the first three reacquisition trials. 
An analysis of variance for the last 
18 trials of acquisition yielded a non- 
"The two trials at the end of acquisition 


on which all Ss received reinforcement are 
omitted from F ig. 1. 


ant F (F = 2.60, df = 1/35, 
> .05) for the difference between 
he continuous and partial reinforce- 
nent groups. The difference between 
oups CC and PP continued to be 
gnificant for the 24 trials of the 
tinction phase, while the difference 
Between Groups CE and PE yielded 
m F of 20.32 with 1/15 df, which is 
gnificant at the 5% level. 
The results of an analysis of vari- 
nce based on the last 12 trials of the 
equisition phase are summarized in 
ble 1. Since 7 Fs were computed, 
1 y's F maximum test was used 
to adjust the experiment-wise error 
te to 5%. Pearson and Hartley 
(1954, Table 19) give the 5% value as 
slightly less than 8.21. Thus, with the 
‘experiment-wise error rate controlled 
at 5% the effects of the reinforcement 
and extinction variables would be 
€ ed significant while the inter- 
| action between Es and extinction 
Would be declared nonsignificant. It 
should be noted that the size of the 
interaction was due to a larger differ- 
ence between extinction and no- 
extinction groups for one Æ (SJH) 
than for the other (SW). The direc- 
_ tion of the difference was the same 
for both Es. " 
_ Group PP showed an increase 
in speed of running after being 
‘Shifted to continuous reinforcement. 


TABLE 1 


ANALYSIS OF VARIANCE FOR THE 
REACQUISITION PHASE 


Source 


Es 
Continuous vs. Partial 
Odea vs. No-Ext. 


E-NE X C-P 
E-NE X C-P X Es 
_ Within (error) 


ESP <.05, 


EFFECTS OF NONREINFORCEMENT 


391 


The group's mean level during the ex- 
tinction and the reacquisition phases 
was 0.54 and 0.63, respectively, At 
for paired measures, based on the last 
18 trials of each of the two phases, 
yielded a value of 3.04 with 9 df, which 
is significant at the 5% level. 

Sixteen analyses employing the 
measures based on S's path were per- 
formed, 8 at the end of acquisition 
and 8 at the end of reacquisition, 
With one exception they proved to be 
nonsignificant. At the end of acquisi- 
tion the partial reinforcement group 
showed “significantly” more varia- 
bility in lateral position at a distance 
of 14 in. from the start box, While no 
method of adjusting these results to a 
5% experiment-wise error rate exists, 
common sense suggests that the result 
not be declared significant. 

Mean reciprocal running times for 
the last 12 trials of initial acquisition 
were correlated with four measures of 
Ss’ paths on the same trials. The 
correlations with running speed were: 
(a) mean position of S in the lateral 
dimension (i.e. width) of the runway, 
—.09; (ò) average within-trial vari- 
ance of lateral position computed 
about the mean position, —.24; 
(c) variance of the mean lateral posi- 
tion across trials, .03 ; and (d) average 
sum of absolute deviations from a 
straight line connecting the start and 


goal boxes, — .20. 


Discussion 


The major findings involving the time 
data of this experiment were that non- 
reinforced trials, administered either in 
an extinction or partial reinforcement 
procedure, led toa change in the terminal 
level of responding under subsequent 
continuous reinforcement. There was no 
sign of a decrease in the higher level of 
responding induced by nonreinforcement, 
which suggests that a nonreversible 
change in the reinforced trial operator 
has occurred. Although appropriate 


392 


controls were lacking, the results also 
suggest that the larger the number of 
nonreinforced trials, the greater the effect 
induced on the reinforced trial operator. 
To this extent the qualitative predictions 
stemming from a modification of the 
Bush and Mosteller model received sup- 
port. It is difficult to assess the failure 
to find the predicted interaction between 
the extinction and proportion of rein- 
forcement variables. Ideally a power 
analysis should be performed. However, 
this is impossible without a good estimate 
of the expected magnitude of the inter- 
action, which depends critically on the 
unknown parameters of the various 
learning operators. 

The problem of providing a theoretical 
underpining for the induced changes in 
the reinforced trial operator remains, 
One attempt in this direction (Wein- 
stock, 1953, 1958) leads to the prediction 
that the changes in speed of running 
should be reflected in a reduction of 
within-trial variability. This prediction 
received no support from the present 
results, 

In general, the spatial location meas- 
ures were not related either to the in- 
dependent variables investigated or to 
speed of running. The question of the 
reliability of the measures arises. To 
obtain a rough answer, values for each 
of the four major measures for Trials 86 
to 91 were correlated with those for 
Trials 92 to 97. Values of .77, .66, .86, 
and —.19 were obtained for the absolute 
deviations from the midline, the variance 
within a trial, the mean position, and the 
variance of the mean position, respect- 
ively. For purposes of comparison the 
correlation coefficient was also computed 
for the running time measure for the 
same blocks of trials and a value of .83 
was obtained. With the exception of the 
variance of the mean position, it seems 
that the measures taken were sufficiently 
reliable, : 


S. J. HARRIS, M. G. SMITH, AND S. WEINSTOCK 


SUMMARY 


Forty albino rats were given two trials per 
day, separated by 45 min., ina straight alley 


- under 22 hr. of water deprivation. Half of the 


Ss received 100% and half 50% reinforcement 
for 96 trials of acquisition. The two groups 
were further halved with half of each receiving 
24 extinction trials and the other half con- 
tinuing on their reinforcement schedules. 
All Ss then received 30 trials under 100% 
reinforcement. In addition to latency and 
running time measures, detailed recording was 
made, by use of a special “contact” recorder, 
of the path taken by S. 

The following results were obtained: At 
the end of acquisition 50% Ss were running 
more rapidly than 100% Ss but the difference 
was not statistically significant. During the 
last 30 trials of the experiment Ss who had 
had 50% reinforcement were running more 
rapidly than Ss who had had 100% reinforce- 
ment, and Ss who had had extinction were 
running more rapidly than those who had not. 
The 50% reinforcement Ss ran more rapidly 
after being shifted to 100%. These differences 
were statistically significant. The interaction 
between the extinction and proportion of 
reinforcement variables was not significant, 

In general the yarious measures of spatial 
location and variability which were taken 
showed no relation to the independent vari- 
ables and quite low correlations with speed of 
running. 

REFERENCES 


Busu, RaR., & Mostetuer, F. Stochastic 
models for learning. N. Y.: Wiley, 1955. 
NOTTERMAN, J. M. A study of some relations 
among aperiodic reinforcement, discrimina- 
tion training, and secondary reinforcement. 
J. exp. Psychol., 1951, 42, 273-291. 

Pearson, E. S, & Hartley, H. O. Bio- 
metrika tables for statisticians. Cambridge: 
Cambridge Univer. Press, 1954. 

WEmsTocK, S, Acquisition and extinction of 
a partially reinforced running response at 
a 24-hour intertrial interval, Unpublished 
doctoral dissertation, Indiana University, 
1953, 

Wetnstock, S. Acquisition and extinction of 
a partially reinforced running response at 4 
24-hour intertrial interval. J. exp. P9 
chol., 1958, 56, 151-158. 


(Received September 8, 1961) 


Journal of Experimental P. h 
1962, Vol. 64, No. 4, ‘eee 


In the Type I incidental learning 
situation, S is given no instructions to 
learn but is later tested for the ma- 
terials to which he was exposed. In 
the Type II situation, S is exposed to 
two sets of materials, instructed to 
learn only one of the sets, and is later 
tested for the materials which he was 
not instructed to learn. The finding 
that there is less incidental learning in 
the Type II situation than in Type I 
has been attributed to competition 
from the intentional task for the 
limited amount of exposure time which 
is available to S (Mechanic, 1962). It 
could therefore be expected that 
certain intentional learning variables 
would adversely affect Type II inci- 
dental learning. Any experimental 
variation which focuses S on the 
intentional items should serve to 
reduce the proportion of the total 
exposure time available for responding 
to the incidental items. Examples of 
such variables would be strength of 
the incentive motivating S to learn, 
and difficulty of the intentional items. 

Bahrick (1954) and Bahrick, Fitts, 
and Rankin (1952), using nonverbal 
materials, found that the amount of 
learning irrelevant to a set is inversely 
related to the strength of the incen- 
tive determining that set. Mechanic 


' This research was conducted during the 
writer's tenure as a National Science Founda- 
tion Postdoctoral Fellow. The writer is 
indebted to B. J. Underwood for his valuable 
suggestions and critical reading of the 
manuscript. 

2 Now at Alameda State College, Hayward, 


California, 


EFFECTS OF ORIENTING TASK, PRACTICE, AND 
INCENTIVE ON SIMULTANEOUS INCIDENTAL 
AND INTENTIONAL LEARNING ' 


ARNOLD MECHANIC? 


Northwestern University 


(1962), however, found that incidental 
learning of verbal items did not vary 
as a function of the difficulty of the 
concurrently learned intentional items. 
The divergence between these find- 
ings, with regard to the task-competi- 
tion hypothesis, becomes explicable 
when we consider the different orient- 
ing tasks which Ss were required to 
perform. Mechanic’s Ss were re- 
quired to pronounce the incidental 
items in order to rate phonetic simi- 
larity. Thus, competition from the 
intentional items (for the available 
exposure time) could not prevent Ss 
from responding to the incidental 
items. In Bahrick’s study, on the 
other hand, there was no specific 
orienting task which required S to 
respond to the incidental stimuli. 
The S could, for example, learn 
geometric forms intentionally without 
necessarily responding to their inci- 
dental colors, and large amounts of 
competition from the intentional task 
could result in a failure to respond to 
the incidental stimuli. 

If the above explanation is appro- 
priate, it would be expected that 
the incentive for intentional learning 
would have little or no effect on con- 
current incidental learning when the 
orienting task required that S respond 
to the incidental stimuli. This experi- 
ment is an attempt to clarify the joint 
effects of orienting task and inten- 
tional learning variables on Type II 
incidental learning. By varying the 
degree of responding to the incidental 
items required by the orienting task, 


393 


394 


it should be possible to get a clearer 
picture of how intentional learning 
variables influence Type II incidental 
learning. Because the intentional 
learning variable that has been chosen 
is incentive, it will, at the same time, 
be possible to assess the effects of in- 
centive upon the intentional rote 
learning of verbal materials, 


METHOD 


General dèsign—Three orienting tasks were 
selected so as to differ in the degree of respond- 


All Ss were given a list made up of 12 pairs 
of trigrams. They were instructed to learn 


other language, 
task instructions, S5 were told that we also 


bers of “pronouncing” responses to the inci- 
dental items, By a pronouncing response, 
reference is made to a hypothetical response 
by S so that he reacts to the trigram as a 
single pronunciable unit. The Operations 
involved in the three orienting tasks should 


ARNOLD MECHANIC 


serve to further clarify this definition, 
choice of the Pronouncing response as an 
important response for rote learning is con- 
sistent with the recent conclusion of Under- 
wood and Schulz (1960) that, “emitted 
frequency is of fundamental importance in the 
integration of response elements [italics 
mine]” (p. 291). The orienting task devised 
to produce maximal responding to the inci- 
dental items will be called Phonetic Similarity 
ratings (PS). To PS Ss, the experiment was 
introduced as a test of the notion that words 
meaning the same thing in different languages 
tend to sound more alike than words with 
i The Ss were asked to 
similarity between the 
members of each pair of trigrams. The Ss 
were required to pronounce the members of 
each pair to themselves and to rate their 
ity on a five-point scale, 
instructions, learning in- 
structions for only one set of items were given 
as described above, 

It may be noted that this orienting task 
required that 5 respond to the incidental 
items as single Pronunciable units. Another 
i task was designed in order to 
minimize the probability of 5 responding to 
the incidental items in this manner. ‘This 
minimal Tesponding situation will be referred 
to as Letter Cancellation (LC). The LC Ss 
were told that we were interested in the effects 
of quick changes of set upon letter cancella- 
tion performance. 


All letters other 
» and U were considered 
Ss were asked to work 
not permitted to erase 
after having marked a letter. It is clear that 
this orienting task exposes S to the incidental 
stimuli without necessarily requiring that he 
respond to the items as pronunciable units. 
Having defined 
Pothetical dimension, an intermediate orient- 
Bert was selected, which will be called 


of the items from the two languages, 


These 50 cards were said to be in an envelope 
that was left on the desk in front of the room. 
e Ss were told that we wanted to check the 

i investigators that 
Suess better than would be 
hance in such a situation. They 
were required to look at each pair of syllables 
and then rate the likelihood of its being 
represented in the envelope. As with the 


INCIDENTAL AND INTENTIONAL LEARNING 


phonetic similarity judgments, the ratings of 
likelihood were on a five-point scale. This 
kind of orienting task was chosen because it 
seemed to offer some probability of S re- 
sponding to the incidental items as pronunci- 
able units. At the same time, such responses 
were not required as they were in the PS 
situation, and should be of lower frequency 
for the ESP group. However, the ESP group 
could clearly be expected to make more 
pronouncing responses than the LC group. 

_ It should be evident that the amount of 
time required to perform each of the three 
orienting tasks will vary, and that effective 
exposure times for learning will vary con- 
comitantly. Although there are no quantita- 
tive data on this issue, it is apparent from 
observations made during the experiment that 
PS ratings take the most time while LC takes 
the least time. Because both intentional and 
incidental scores were obtained from each S, 
it will be possible to gauge the incidental 
scores for each orienting task by the corre- 
sponding intentional scores for that condition. 
The intentional scores may be thought of as 
an index of the time available for learning to 
ae place under each respective orienting 
task, 

Incentive conditions—The Ss were intro- 
ductory psychology students at North- 
western University who met a course require- 
ment by serving as Ss. In addition, “‘incen- 
tive” Ss were offered the possibility of one 
“bonus hour” toward the required total of 
10 hr. They were told that an extra hour of 
credit would be given to all Ss who learned 
more of the intentional syllables than the 
average number learned by the group. Tt was 
added that they would be eligible for the 
bonus only if their performance on the other 
task (orienting) met certain standards of 
acceptability. The LC and PS Ss were told 
that they would be eligible for the bonus only 
if their performances met minimal standards 
of accuracy. The ESP Ss were told that they 
would not be eligible if their guesses were in 
the bottom 10% of the group. It was made 
clear, however, that half of the Ss in the group 
would win bonus hours and would be informed 
of these awards promptly. 

The “nonincentive” Ss were not offered 
bonus hours, but were otherwise read in- 
structions identical to those of their incentive 
counterparts, Incentive Ss were question 
about the effects of the incentive during @ 
postexperimental inquiry. Those who stated 
that they did not need the bonus hour, and 
that they did not try any harder as a result 
the incentive, were discarded and replaced. 
The positive statements of many 5s would 


395 


seem to indicate that a bonus hour has 
incentive value in the same sense as do the 
small monetary bonuses paid by Bahrick 
(1954). 

Practice conditions.—The list of 12 pairs 
was exposed for either two or five presenta- 
tions. Successive presentations of the list 
involved different random orders for the set 
of 12 intentional items and the set of 12 
incidental items. As a result, the pairings of 
individual items from the two sets were 
randomly varied from presentation to pres- 
entation. This was done to prevent the 
learning of incidental items through the 
mediation of the intentional members of the 


irs, 

Design detatls—The 12-pair lists were 
constructed from two separate 12-unit trigram 
lists, The two 12-unit lists were equated for 
meaningfulness and were made up of pro- 
nunciable high-frequency items. The two 
lists and their method of construction have 
been presented elsewhere (Mechanic, 1962). 
For clarity of exposition, henceforth these 12- 
unit lists will be referred to as sels and the 
term list will refer to the 12-pair lists actually 
presented to the Ss. Each of the 12 pairs ina 
list was arranged vertically with one member 
in the top position and the other member 
directly below it. This arrangement was used 
to prevent the left to right chaining which 
might result from the reading habits of Ss. 
The top members of the pairs came from one 
of the stimulus sets while the bottom members 
came from the other set. Through presenta- 
tion of the list during training, both sets of 
items were presented concurrently to S. The 
Ss were, of course, instructed to learn only 
one of the two sets. 

Five random orders were prepared for each 
of the sets. This was done to obtain different 
orders of the items with successive presenta- 
tions of the list. Additionally this insured 
that the pairings of individual items from the 
two sets were randomly varied from trial to 
trial. For each of the five orders of presenta- 
tion, two variations were prepared. These 
reversed the top and bottom member posi- 
tions but were otherwise identical. 

The Ss were given a number of sheets of 
paper, each covered with a strip of cardboard 
in which a cut-out window allowed exposure 
of one pair of syllables at a time in accordance 
with E's instructions. The pairs of syllables 
on each sheet were numbered 1 through 12. 
When Æ called out the number 1, S moved 
the window to expose the first pair of syllables 
and responded to the items in accordance with 
the requirements of the orienting task. The 
procedure was repeated for each successive 


396 | 


ARNOLD MECHANIC 


TABLE 1 


AVERAGE NUMBERs oF INTENTIONAL AND INCIDENTAL 
ITEMS RECALLED By THE VARIOUS Groups 


Two Presentations Five Presentations 
Type of Item and PPS RESP) Teak PS!) Tak LCG | task esp | tack ps 
Mean] SD |Mean| sp Mean| SD | Mean | SD Mean| SD |Mean SD 
EAEN APER a N N EN L 
Incidental Items j 
Xornal 70) .56 | 1.90 1.38 | 2.75 | 1.48 1.85 | 1.77 | 2.65 | 1.93 | 4.50 2.17 
Augmented -65 | 1.11 | 1.30 | 1.05 2.75 |2.16| 1.75 1.58 | 2.35 | 2.24 | 4.45 | 1.66 
S SA peed 
Total -68 1.60 2.75 1.80 2.50 4.48 
Intentional Items k 
Normal 7.55 | 2.16 | 6.75 | 2.62 | 5.65 1.53 | 10.25 | 1.97 | 8.35 | 1.96 | 8.15 | 2.24 
Augmented 7.40 | 1.50 | 7.05 2.16 | 5.80 | 2.18 10.30 | 1.95 | 8.90 | 2.30 9.20 | 1.91 
Total 7.48 6.90 5.72 10.28 8.62 8.68 
I 


page had been 
presented at a 
between presenta- 
. A 5-min. test of 


an attempt was 
items, 


groups, half the Ss 
received materials with the items from one 


set in the top positions of the pairs, For 
the other half of the 
identical 


The Ss were tested in a large room in 
groups averaging 14 in number. Conditions 
were assigned to the groups in a random order 
for each replication of the basic experimental 
design. Instructions to learn top items were 
alternated with instructions to learn bottom 
items for the different groups serving in the 
same experimental conditions, The Ss within 
each group were assigned the two different 


sets of stimulus materials (top-bottom varia- 
tions) by rotation, The only restrictions on 
these procedures were to correct for equal Ns. 

Because Ss were tested in groups, extra Ss 
were obtained in most of the experimental 
variations, Those used in the analyses were 
selected at random from the total number of 
Ss in each variation, These selections were 
made after discarding Ss whose postexperi- 
mental reports indicated that they did not 
understand the instructions, that they tried 
to learn the incidental items, or that the bonus 
hour had no value as an incentive. Less than 


10% of the Ss appearing for the experiment 


were discarded for one of these reasons. 


RESULTS AND DISCUSSION 


Table 1 contains means and SDs 
for both the incidental and intentional 
learning scores for the 12 basic 
conditions. The incidental and in- 
tentional scores for any combination 
of experimental variables were ob- 
tained concurrently from the same Ss. 
Analyses of variance of the incidental 
and intentional scores were done 
Separately, 

Incentive —If incentive is to be used 
to study the effects of an intentional 
earning variable on concurrent inci- 
dental learning, it should be possible 
to demonstrate that incentive is an 


INCIDENTAL AND INTENTIONAL LEARNING 


effective variable with regard to the 
intentional items. The data in Table 
1 indicate that incentive has no 
significant effect upon intentional rote 
verbal learning (F=1.47, df= 1/228). 
Therefore, it is not surprising that 
incentive for the intentional items has 
no effect upon concurrent incidental 
learning (F = .69, df = 1/228). It 
may be concluded that incentive, as 
here manipulated, is not an effective 
variable for either intentional or 
incidental learning in the Type ib 
verbal learning situation. For the 
purpose of evaluating other experi- 
mental variables, the normal and aug- 
mented incentive conditions may be 
regarded as replications of the experi- 
mental design. 

Degree of practice.—Both inten- 


tional and Type II incidental learning ` 


increase significantly as a function of 
number of presentations (F = 86.83 
and F=32.23, respectively, df = 1/228, 
P < .001 in both cases). These data 
confirm earlier findings for the Type 
II situation (Mechanic, 1962). There 
are no interactions between practice 
and the other experimental variables. 
Orienting tasks—Table 1 clearly 
shows the predicted differences in in- 
cidental learning as a function of 
orienting task. With two incentive 
levels and two practice levels, there 
were four replications of the compari- 
son among orienting tasks. In every 
case, incidental learning increased 
directly as a function of the hypothe- 
sized number of pronouncing Te- 
sponses. There were no reversals of 
order and the differences within each 
replication were of substantial magni- 
tude. The overall analysis indicates 
the highly significant effect of ori- 
enting task on incidental learning 
(F = 40.05, df = 2/228, P < .001). 


Tt could be argued that the effects of 
orienting task were not due to the hy- 
pothetical pronouncing dimension which 


397 


dictated the choice of the three tasks. 
Rather, the tasks may have differed in 
time required to perform them, with 
concomitant variation in the amount of 
exposure time available for learning. If 
this is the case, intentional learning 
should show the same relation to orient- 
ing task as does incidental learning. 
Previous research indicates that amount 
of intentional learning may vary with the 
nature of the orienting task (Postman & 
Adams, 1956). Again, Table 1 shows four 
replications of the comparison among 
tasks. For three of the replications, the 
order for intentional learning was directly 
opposite to that for incidental learning. 
The LC Ss showed the greatest learning 
while the PS Ss showed the least learning. 
The fourth replication (augmented in- 
centive-five presentations) showed a re- 
versal with the PS Ss scoring slightly 
higher than the ESP Ss. In spite of this 
minor reversal, the reduction in inten- 
tional learning from LC to ESP to PS is 
highly significant (F=13.55, df=2/228, 
P<.001). This would seem to indicate 
that the predicted differences in inci- 
dental learning are not due to the differ- 
ences in time and effort required by the 
various orienting tasks. If anything, the 
predicted differences in incidental learn- 
ing may be minimized because the 
orienting tasks requiring more pronounc- 
ing responses happen (when gauged by 
the intentional scores) to be more diffi- 
cult, or time consuming to perform, than 
the tasks requiring less pronouncing 
responses. 

It cannot be concluded that these data 
favor a pronouncing response hypothesis 
without considering still another alter- 
native interpretation. Perhaps the ef- 
fects of orienting task are a function of 
how much time the orienting task re- 
quires S to devote to the incidental 
items. If an orienting task requires 5 to 
spend more time on the incidental items, 
it is reasonable to assume that less 
exposure time will be available for the 
intentional items. This “time” inter- 
pretation can explain the finding that 
from LC to ESP to PS, incidental 
learning increases while intentional learn- 
ing decreases. The data are accounted 


398 


for without utilization of the hypothetical 
Pronouncing responses which originally 
dictated the choice of the three orienting 
tasks. However, the data in Table 1 
allow a check on this interpretation by 
means of a further analysis. It may be 
noted that at five Presentations, the 
intentional means for ESP (8.35 and 
8.90) were approximately equal to the 
corresponding means for PS (8.15 and 
9.20). Whereas intentional learning did 
not decrease from ESP to PS, incidental 
learning increased greatly (2.65 and 2.35 
ys. 4.50 and 4.45). Even where the 
intentional scores were not lower, and 
could not therefore be said to reflect 
greater time available for incidental 
learning, incidental learning increased as 
a function of hypothesized number of 
pronouncing responses, 

At two Presentations, it appears that 
the “time” interpretation is consistent 
with the data. This interpretation is 
based upon the distribution of the total 
exposure time between intentional and 
incidental learning. It implies that the 


after performing 
Even if the ESp Ss, b 
on the orienting task, are learning less 
incidentally, they should at least make 
up for this by learning more intentional 
items in the greater time period remain- 
ing. The time interpretation would re- 
quire that the ESP Ss learn no fewer 
items than the PS Ss. The Pronouncing 
response interpretation has no such 
requirement, 

In order to evaluate the above argu- 
ment statistically, the different incentive 
groups in Table 1 were combined to give 
40 Ss for each orienting task at each level 
of practice. For each S, the intentional 


ARNOLD MECHANIC 


and incidental scores were combined to 
give a total learning score. The mean 
total learning for each orienting task 
after two presentations is 8.15, 8.50, and 
8.48 for LC, ESP, and PS, respectively. 
Comparable means with five presenta- 
tions are 12.08, 11.12, and 13.15, ‘re: 
spectively. With two presentations, the 
total learning scores for the three 
orienting tasks do not differ (F = .20, 
df = 2/117). As noted above, the data 
for two presentations do not distinguish 
between the Pronouncing and time inter- 
pretations. While the pronouncing view 
predicts differences in incidental learn- 
ing, equality of total learning could still 
result from differences in amounts of 
intentional learning permitted by the 
different orienting tasks, 

The data for five presentations argue 
against a time interpretation. The total 
learning scores of the three task groups 
show a significant degree of variation 
(F = 4.38, df = 2/117, .01 < P < .05). 
The gap between Groups ESP and PS is 
significant beyond the -01 level. Thus, 
the differences in incidental learning 
among the three orienting tasks are not 
compensated for by reciprocal differ- 
ences in intentional learning. Clearly, 
the differences obtained as a function of 
orienting task are not merely due to 
different distributions of S's time be- 
tween the incidental and intentional 
items. It would seem more reasonable 
to conclude that the differences in inci- 
dental learning among the tasks are a 
function of the activities required by the 
different tasks, 

Item analysis.—Underlying our view of 
the incidental verbal-learning situation 
is the assumption that learning—both 
intentional and incidental—takes place 
as a result of pronouncing responses made 
by Sto the Stimulus items. If these same 
kinds of responses underlie both inten- 
tional and incidental learning, the stimuli 
facilitating these responses should be the 
Same for both forms of learning. Those 
stimulus items most frequently learned 
intentionally should also show most fre- 
quent incidental learning. There are 
data for 24 trigrams for both intentional 
and incidental learning. The frequencies 


INCIDENTAL AND INTENTIONAL LEARNING 399 


with which each item was given as a 
correct intentional response by 120 Ss 
were correlated with the frequencies with 
which each item was given as a correct 
incidental response by the other 120 Ss. 
The product-moment correlation across 
the 24 items was .81. For purposes of 
evaluating this correlation, r’s were com- 
puted for the 24 items between: (a) in- 
tentional learning at two presentations 
and intentional learning at five presenta- 
tions; (b) incidental learning at two 
presentations and incidental learning at 
five presentations; and (c) total learning 
at two presentations and total learning 
at five presentations. The r's were -76, 
59, and .82, respectively. These three 
r's were computed as measures of the 
reliability of item difficulty. Clearly, the 
item by item correlation between inten- 
tional and incidental learning matches 
the reliability indices of item difficulty. 
These results are consistent with the 
belief that the same responses are re- 
quired in both intentional and incidental 
learning. In this view, instructions to 
learn increase learning because they 
facilitate the performance of the appro- 
priate responses. 


SUMMARY 


Type II incidental learning refers to the 
situation in which S is exposed to two sets of 
materials, instructed to learn only one of the 
sets, and is later tested for the materials which 
he was not instructed to learn. f 
expected that such learning will vary with: 
(a) the orienting task by which S is expo 
to the incidental items; (b) the incentive for 


the concurrent intentional items; and (c) the 
number of presentations. Twelve verbal 
learning groups of 20 Ss each were run ina 
standard 3 X 2 X 2 factorial design, with the 
variables being orienting task, incentive, and 
number of presentations, respectively. Sepa- 
rate scores were obtained for intentional and 
incidental learning from each S. Incentive 
was found to have no effect on either inten- 
tional or incidental verbal learning. In 
agreement with previous results, both kinds 
of learning increased reliably with number of 
presentations. Incidental learning increased 
reliably as a function of the hypothesized 
number of pronouncing responses required by 
the orienting task. After considering alterna- 
tive explanations, it was concluded that the 
nature of the responses required of S by the 
orienting task are of crucial importance for the 
amount of incidental learning which occurs. 


REFERENCES 


Incidental learning under 


Basrick, H. P. 
J. exp. Psychol., 


two incentive conditions. 
1954, 47, 170-172. 

Bamrick, H. P., Firts, P. M., & RANKIN, 
R. E. Effect of incentives upon reactions 
to peripheral stimuli. J. exp. Psychol., 
1952, 44, 400—406. 

Mecuanic, A. The distribution of recalled 
items in simultaneous intentional and 
incidental learning. J. exp. Psychol., 1962, 
63, 593—600. Ce 

Postman, L., & ADAMS, P. A. Studies in 
incidental learning: IV. The interaction of 
orienting tasks and stimulus materials. 
J. exp, Psychol., 1956, 51, 329-333. 

Unperwoop, B. J., & Scaviz, R. W. Mean- 
ingfulness and verbal learning. Phila- 


delphia: Lippincott, 1960. 


(Received September 13, 1961) 


J al of Experimental Psychology 
1962, Vol. 64. Nova 405 ten 


RETROACTIVE INHIBITION OF R-S 


ASSOCIATIONS ! 


GEOFFREY KEPPEL anp BENTON J. UNDERWOOD 
Northwestern University 


Consider the S-R association that is 
formed during the learning of a paired- 
associate list (A-B). Ifa second list, 

ing an A-C relationship to the 
first list (i.e., stimuli same, responses 
different), is interpolated between 
A-B learning and subsequent A-B 
recall, the A-B association appears to 
go through an extinction process 
(Barnes & Underwood, 1959; Briggs, 
1954). This finding can lead to the 
assumption that extinction will occur 
whenever an A-B, A-C interference 
paradigm exists, Furthermore, nega- 
tive transfer on the A-C list appears 
to be associated with, and may be 
caused by, extinction of the first-list 
S-R associations, Finally, it seems 
apparent that retroactive inhibition 
(RI) in recall which occurs immedi- 
ately after interpolated learning, will 
be primarily determined by the 


amount of extinction occurring during’ 


the learning of the second list, 


son, 


flicting backward associations may be 
responsible for negative transfer in 
certain paradigms, particularly in the 
A-B, C-B paradigm (stimuli different, 
responses identical), 
that if a backward association devel- 
ops in learning A-B, and if, similarly, 
a backward association is to develop 
in learning C-B, the backward associa- 
tion of List 1 (B-A) will interfere with 


1 This work was done under 
Nonr-1228 (15), Project 154.057 
Northwestern University and the 
Naval Rescarch. 


Contract 
between 
Office of 


the learning of the backward associa- 
tion in List 2 (B-C). In effect, the 
backward associations in the A-B, 
C-B paradigm form an A-B, A-C 
paradigm. If the R-S associations 
between two lists form an A-B, A-C 
paradigm, it seems quite possible that 
the R-S (B-A) associations of List 1 
may also be extinguished in the same 
fashion as the S-R (A-B) associations. 
Barnes (1960) has inferred this ex- 
tinction and she accounts for certain 
transfer phenomena by it. More 
particularly, Barnes inferred three 
different types or classes of associa- 
tions which are subject to extinction 
whenever they complete an A-B, A-C 
paradigm between two lists: (a) for- 
ward, (b) backward, and (c) contex- 
tual associations. The present study 
provides a more direct test of the 
effect of extinction of R-S associa- 
tions; in effect, it will be a study of 
RI of backward associations, Four 
paradigms will be used; if List 1 
learning is symbolized as A-B, then 
the four paradigms may be distin- 
guished in terms of List 2 symbols and 
their relation to List 1. Expectations 
concerning the RI of List 1 R-S 
associations for the four paradigms 
may now be stated. 

A-B, A-C paradigm.—In learning the A-C 
list, no A-B, A-C Paradigm is set up for ithe 

kward association, B-A; hence, no extine- 
tion of B-A is to be expected. This prediction 
implies complete independence between the 
forward and backward associations. How- 
ever, to the extent that these two associations 
are nonindependent, RI of the B-A associa- 
tion will be affected by the extinction of the 
A-B association, since an A-B, A-C relation 


C paradigm offers an emcees 
Since the only possible 


RETROACTIVE INHIBITION OF R-S ASSOCIATIONS 


source of extinction of the B-A association 
lies in the extinction of the A-B association. 

A-B, A-Br paradigm—tn this paradigm 
List 2 contains the same stimuli and responses 
as List 1, but the stimulus and response terms 
are paired differently than in List 1. In 
learning the A-Br list an extinctive relation- 
ship (A-B, A-C) is formed for the backward 
association, i.e., B-A, B-Ar, so that RI of the 
B-A association is to be expected in this 
paradigm. In addition, further RI will be 
produced to the extent that the A-B, A-C 
relationship for the forward association 
directly affects the backward association. 
Since both the A-C and A-Br paradigms 
contain an extinctive relationship for the 
forward association while a similar relation- 
ship for the backward association is present 
in the A-Br paradigm only, a comparison 
between these two paradigms will provide a 
direct test of the hypothesis that extinction 
of the backward association can occur. 

A-B,- C-D paradigm.—This paradigm 
specifies the learning of two unrelated lists and 
may provide for a third type of association 
which is subject to extinction. Although an 
A-B, A-C relationship exists neither for the 
backward association nor the forward associa- 
tion for item pairs, there is the possibility of an 
extinctive relationship being set UP between 
contextual cues and the stimuli of the two 
lists, In the learning of a list of paired 
associates, the responses must first be ac- 
quired before they enter into an associative 
connection (Underwood & Schulz, 1960). In 
the same manner, when the backward associa- 
tion is measured, the stimulus term must 
necessarily have been acquired as a response 
if it is evoked. Although it is not known to 
what stimulus either the response term or the 
stimulus term are attached when learned in 
this sense, it can be assumed to be some 
aspect of the experimental context, ¢-8+ the 
memory drum, the experimental room, etc. 
If the association between contextual cues 
and the stimuli of List 1 are represented as 
A-B, an A-B, A-C relationship is formed 
during the learning of List 2 since a different 
set of stimuli (C) must now be attached to the 
same contextual cues It is in this 
manner that the contextual association 18 
thought to provide for a potential class of 
extinction-susceptible associations. At any 
rate significant RI of the backward association 
for this paradigm can be taken as support for 
the existence of this third class of associations. 

A-B, C-B paradigm.—ln this paradigm 
two types of associations which can form an 
A-B, A-C association may be identified and 
predicted to contribute to the RI of the B-A 


401 


association, namely, the backward association 
and the contextual association. The potential 
A-B, A-C relationships for these two classes 
of association already have been elaborated 
above. As with the first two paradigms 
listed, the C-D and C-B paradigms allow for 
a direct test of the extinction of the backward 
association; that is, the contextual association 
as a source of extinction is held in common 
for the two paradigms so that greater RI in 
the C-B paradigm will indicate direct ex- 
tinction of the R-S association. 


METHOD 


Design.—The design was essentially iden- 
tical to that employed by Barnes (1960). 
Since an RI paradigm was employed, all Ss 
received the same List 1; the groups were 
differentiated only by the construction of 
List 2. The four transfer paradigms of A-B, 
C-D; A-B, C-B; A-B, A-C; and A-B, A-Br 
were represented. A control group (Group C) 
received the single A-B list and was used to 
measure the amount of stimulus recall possible 
following an interval equal to the time that 
List 2 was presented to the transfer groups. 

Lists—The same materials constructed by 
Barnes were used in the present study. Each 
list consisted of eight nonsense syllable- 
adjective pairs. The stimuli for all lists 
consisted of nonsense syllables of 537% to 73% 
Glaze association value with intralist simi- 
larity and, where applicable, interlist simi- 
larity at a minimum. Responses were 
common adjectives having minimal intralist 
and interlist meaningful similarity. Four 
orders of presentation were constructed for 
each list to minimize serial learning of the 
responses; each order was used as a starting 
order an equal number of times for each 
condition. 

Subjects — Twenty Ss were placed in each 
of the five groups. To assure random assign- 
ment of Ss, the five experimental conditions 
were randomized 20 times so that in each of 
the 20 blocks of 5 Ss, each condition occurred 
once. The Ss were assigned to the experi- 
mental conditions in the order of appearance 
in the laboratory. The Ss were students of 
undergraduate psychology courses for whom 
serving in experiments was a class require- 
ment. Most Ss had previously served in 
other paired-associate experiments; however, 
none had experienced an RI paradigm nor had 
been given a procedure involving stimulus 
recall. 

Procedure.—The paired associates were 
presented on a memory drum by the anticipa- 
tion method at a 2:2 sec. rate. A 4-sec. 


402 


TABLE 1 
Mean R-S RECALL 


Groups Mean om 
A-Br 3.40 34 
A-C 5.05 43 


intertrial interval was employed. 
learning was taken toa criterion of one perfect 
recitation. For all groups except the control, 


` terion, was given a corresponding 1-min. rest 
and then instructions for the interpolated 


List 1 stimuli opposite the appropriate List 1 
responses listed on a dittoed sheet. 


commencement of the test, 
reach List 1 criterion, while 1 § was dropped 
for failure to follow anticipation instructions, 
and 1 S failed to follow recall-test instructions, 


ResuLTS 


The mean number of trials to reach 
criterion on List 1 for the five groups 
ranged from 10.55 to 15.10. An 
analysis of these means, however, 
failed to reach significance (F = 1.68, 
df = 4/95, P > -05). Although not 
of primary interest in the present 
experiment, the Performance of the 
transfer groups on List 2 revealed 
Group C-D to be superior to all groups 
throughout the 15 acquisition trials of 
List 2, and Group A-Br to be generally 
inferior. These results are quite in 
line with those of previous investi- 


GEOFFREY KEPPEL AND BENTON J. UNDERWOOD 


gators (Barnes, 1960; Twedt & Under- 
wood, 1959). 

In measuring for the presence of 
backward or R-S associations, § was 
given the response terms of the A-B 
list and was asked to supply the stim- 
ulus terms paired with them during 
original learning, Only if a stimulus 
term was placed with its appropriate 
response term was a backward associa- 
tion said to have been demonstrated. 
The mean number of R-S associations 
for each group is shown in Table 1. 
The score for Group C, which is ap- 
Proximately 72.5% of the maximum 
Possible (8.00), represents R-S recall 
without any apparent possibility of 
extinction of R-S associations. In- 
spection of Table 1 shows all transfer 
groups to have suffered some degree 
of RI when compared to Group C. 
An analysis of variance of the recall 
Scores produced highly significant 
variation (F = 13.70, df = 4/95, 
P < .001). Comparison of means 
were made using the within mean sum 
of squares to estimate the standard 
error (sai = .59) ; only the differences 
between Group A-C and Groups C-D 
and C did not prove to be reliable. 

A problem of method arises in the 
interpretation of certain of the differ- 
ences between means, An unavoid- 
able procedural difference between 
Groups A-Br and A-C and the other 
three groups is the fact that the former 
groups received 15 more exposures to 
the A stimulus terms than did the 
latter groups. Increased exposure to 
the A terms during List 2 learning 
may make these terms more available 
to S at recall. This does not mean 
that the B-A associations will neces- 
sarily be Stronger, but only that the 
A terms might be more available as 
responses at recall. This, in turn, 
might increase the number of “‘hits’ 
"9 guessed. There is, thus, the 
distinct Possibility that, relative to 


RETROACTIVE INHIBITION OF R-S ASSOCIATIONS 


Groups C-B and C-D, the amount of 
RI for Groups A-Br and A-C is under- 
estimated. 

The results for the four transfer 
paradigms may now be assessed in 
terms of the three classes of associa- 
tions thought to be susceptible to ex- 
tinction whenever an A-B, A-C asso- 
ciation is formed. The fact that 
significant RI was not obtained in 
Group A-C (t = 1.27,df = 95, P > .05) 
suggests that extinction of the A-B 
association does not result in the ex- 
tinction of the corresponding B-A 
association as indexed by subsequent 
R-S recall. It is possible, however, 
that the methodological problem dis- 
cussed above would result in an over- 
estimation of the amount of recall for 
Group A-C through an increase in the 
number of stimulus terms available 
to S. Without any data concerning 
the extent to which the recall scores 
are inflated by guessing, it can only be 
tentatively concluded that extinction 
of the S-R and RS associations 
represent two independent processes. 

A comparison of the recall for 
Groups A-Br and A-C will not be 
biased due to differential stimulus- 
term exposure because both groups 
had the same number of exposures. 
Since both groups contain the S-R 
extinction factor in common, the 
significant difference in R-S recall for 
these two groups (t = 2.80, df = 95, 
P < .01) may be taken as one indica- 
tion of an extinction of R-S associa- 
tions when these associations form an 
A-B, A-C paradigm. 

The final critical comparisons are 
between Groups C-D and C-B. Since 
Group C-D contains only the con- 
textual extinction factor and Group 
C-B both the contextual and R-S 
extinction factors, the greater RI for 
Group C-B (t = 4.66, df = 95,P < 01) 
indicates again that the R-S associa- 
tion can be extinguished. The fact 


403 


that the contextual associations are 
involved in the recall of Groups C-B 
and C-D is shown by the comparison 
between Groups C-D and C (t = 2.03, 
df = 95, 01 < P < 05). 


Discussion 


This study was designed to study RI 
of backward (R-S) associations. The 
evidence presented makes it clear that 
when R-S associations between two 
successively learned lists form an A-B, 
A-C paradigm, RI of List 1 R-S associa- 
tions may occur. Indeed, in the A-B, 
C-B paradigm, the magnitude of the RI 
was great; a difference of about four 
items was apparent between the control 
condition and recall of List 1 R-S associa- 
tions with this paradigm. In line with 
certain previous interpretations of RI of 
S-R associations (Barnes & Underwood, 
1959), the RI of R-S associations in the 
present experiment has been interpreted 
as being due to an extinction or un- 
learning of the R-S associations of the 
first list during the learning of the 
second. Certain issues related to such an 
interpretation require further discussion. 

The fact that the A-Br paradigm 
resulted in a significant decrement in 
recall of R-S associations strongly implies 
an extinction process of specific R-S 
associations in the first list. Indeed, the 
amount of decrement observed is prob- 
ably underestimated in the present 
experiment. As noted earlier, stimulus 
terms should be more available in this 
paradigm than in the control condition 
because of the greater number of trials 
they were presented. This could result 
in more correct R-S pairings based on 
guessing alone. That guessing might 
have occurred is suggested by the fact 
that in 22 instances Ss gave a correct 
stimulus term but paired it with the 
wrong response term, and in only 1 of 
these 22 cases was the pairing appropriate 
to the association learned in List 2. Of 
course, if guessing did occur it may have 
been heightened because R-S associations 
of the first list had indeed been extin- 
guished ; Ss knew the stimulus terms but 
extinction of the specific R-S associations 


404 


left them with no appropriate pairing 
tendencies. Such an interpretation 
might be supported by the fact that in 
the A-C paradigm only 9 stimulus terms 
were paired incorrectly and in this para- 
digm frequency of stimulus-term ex- 
posure was equal to that of the A-Br 
paradigm, but no extinction of R-S 
associations should have occurred, 

In addition to extinction of R-S 
associations between pairs of items in the 
two lists, it has been Suggested that 
another source of extinction may con- 
tribute to the observed recall decrement, 
namely, extinction of contextual associa- 
tions. The extinction of such associa- 
tions is said to be the sole source of 


not been extinguished, It is clear, there- 
fore, that further assumptions are needed 
in order to mediate the findings if ex. 
tinction of contextual associations is to be 
retained as a basic notion, 


several alternative assumptions which 
may be added and which would satis- 


However, 
it seems most apparent that the critical 


experimental simulation of contextual 
cues (hence associations) in order to 
bring them under laboratory Control and 
thus determine whether or not the basic 
notion (extinction of contextual associa- 
tions) is sound. 

Finally, it should be Pointed out that 
just as in the case of presumed extinction 
of S-R associations (Barnes & Under- 
1959), the reason why all R-S 
associations are not extinguished is not 


GEOFFREY KEPPEL AND BENTON J. UNDERWOOD 


SUMMARY 


This experiment investigated RI of back- 
ward or R-S associations. Various interlist 
relationships were studied on the premise that 
first-list R-S associations, like S-R associa- 
tions, would be subject to extinction if the 
R-S associations for the two lists formed an 


paradigm formed, these being: C-D, C-B, 
-C, and A-Br, Following acquisition of the 
second list, S was allowed 2 min. to write 
down the stimulus terms from the first list 
Opposite the appropriate first-list response 
be: A control group was given no second 
ist. 

All groups showed significant RI in R-S 
recall except the group learning the lists 
forming the A-B, A-C paradigm. No source 
of extinction of R-S associations for this 
paradigm is apparent and extinction of the 
S-R association does not affect the R-S 
association, 
A-B, C-B; this paradigm may involve extinc- 
tion of specific R-S associations as well as 
Contextual associations, The A-Br paradigm, 
involving only extinction of specific R-S 
associations, produced more RI than did the 
A-B, C-D paradigm; this latter paradigm is 
assumed to involve a decrement resulting 
only from extinction of contextual as- 
Sociations, 

REFERENCES 


Barnes, J, M. “Fate” revisited. Unpub- 
lished doctoral dissertation, Northwestern 
University, 1960, 

Barnes, J. M., & Usperwoon, B. J. “Fate” 
of first-list associations in transfer theory. 
J. exp. Psychol., 1959, 58, 97-105. 

Bricgs, G. E Acquisition, extinction, and 
recovery functions in retroactive inhibition. 
J: exp. Psychol., 1954, 47, 285-293, j 

Comparison of S-R and R-S 

ning of paired-associates, Psychol. Rep., 

8. 


Twepr, H, M., & UNpERWwoon, B. J. Mixed 
Vs. unmixed lists in transfer studies. J. exp. 
„Psychos, 1959, 58, 111-116. 
UNDERWOOD, B, J., & Scuurz, R. W. Mean- 
ingfulness and verbal learning. Chicago: 
960. 


Journal of Experimental Psychology 
1962, Vol. 64, No. 4, 405-409 


CUE SELECTION IN PAIRED-ASSOCIATE LEARNING 


BENTON J. UNDERWOOD, MARGARET HAM, asp BRUCE EKSTRAND 


Northwestern University 


Consider a paired-associate list in 
which the stimulus term for each re- 
sponse consists of two distinct com- 
ponents, A and B. Both components 
are consistently present on each 
learning trial. Assuming that learn- 
ing occurs in this situation, there are 
many possible interpretations which 
may be given as to what the effective 
stimulus or cue is for each response. 
It might be said that the effective cue 
is a configuration formed by A and B. 
Or, it might be said that each com- 
ponent is independently a cue for the 


response; that one or the other com- ` 


ponents is the effective cue, but not 
both, and so on. 

The present study is predicated on 
the notion that when a complex stim- 
ulus is presented to S a selection 
process may occur so that the effective 
cue for the response is some com- 
ponent of the complex stimulus that 
is actually presented. Thus, the 
assumption is that there may 
a discrepancy between the nominal 
stimulus (the stimulus actually pre- 
sented S) and the functional stimulus 
(the component of the nominal stim- 
ulus which becomes the effective cue 
for response elicitation). That such 
discrepancies may exist is suggest 
by the reports of Ss that they have 
used only a single letter of a three- 
letter stimulus as the effective cub 
(Underwood & Schulz, 1960). Such 
discrepancies might also be inferred 
from the so-called context experi- 
ments (e.g., Weiss & Margolius, 1954) 
in which the removal of a component 
of a compound nominal stimulus pro- 
duces a decrement in recall, although 
such studies offer other interpretative 


possibilities, e.g., the functional stim- 
ulus is a configuration and the removal 
of any component reduces the effective 
associative strength. 

If cue selection occurs—if only a 
part of a compound stimulus becomes 
the functional stimulus—certain vari- 
ables should influence the selection. 
The hypothesis tested in the present 
experiment is that given two com- 

ents of different classes as the 
nominal stimulus, the more meaning- 
ful component will become the func- 
tional stimulus. This hypothesis 
seems very close to the notion of dif- 
ferences in discriminability as a vari- 
able determining stimulus selection, 
a notion suggested by Sundland and 
Wickens (1962), and for which some 
experimental support was obtained. 

The particular predictions for the 
present study may now be specified. 
Two lists for original learning were 
constructed. The stimulus compound 
for one list consisted of colors and low- 
meaningful trigrams; for the other 
list the compound consisted of colors 
and common three-letter words. For 
the first list it was assumed that the 
colors were more meaningful than the 
trigrams; therefore, the functional 
stimuli should be the colors. In the 
case of the word-color compound it 
was assumed that the words were 
more meaningful than the colors, 
hence, the functional stimuli should be 
the words. (It would have been more 
precise to have used two sets of verbal 
units of known meaningfulness for the 
compounds, but the use of colors was 
recommended by the desire to keep 
the experiment continuous with the 
context experiments.) Given the 


405 


406 


above assumptions, it was predicted 
that following the learning of the list 
with the trigram-color compounds, 
very little decrement would be ob. 
served if the trigrams were removed 
on a transfer test, but that a great 
loss would appear if the colors were 
removed from the compound. Con- 
trariwise, in the case of the word-color 
compounds, removal of the words ona 
transfer test would result in a large 
loss but removal of the colors would 


have little effect on transfer per- 
formance. 
METHOD 
The general procedure required that half 


the Ss learn an original list with trigram- 
color compound stimuli, and half learn a list 
with word-color stimuli. To test for each 


trigrams, and C for colors, and the symbols 
before a hyphen designate the stimulus during 
original learning, those after the hyphen the 
stimulus on the transfer test. The six groups 
are, therefore: WC-WC and TC-TC (the two 
control groups); WC-C and TC-C (only color 
stimuli on transfer test); WC-W and TC-T 
(the verbal units appear alone as the stimuli 
on the transfer test), 
Lists—The materials 


for the lists are 
shown in Table 1, i 


list consisted of 


TABLE 1 
STIMULUS Components Usep IN THE Lists 
Sei 
Words Trigrams Colors 
Pamper are eee E tite = 
GAS Gws Red 
DAY DWK Brown 
NEW NXQ Yellow 
DIE DHX Blue 
BAD BWD | Orange 
Gor Gvs | Black 
BED BXD Green 


first letter) 


B. J. UNDERWOOD, M. HAM, AND B. EKSTRAND 


TABLE 2 


MEAN NuMBER oF TRIALS To CRITERION ON 
ORIGINAL LEARNING AND Mean NUMBER 
OF ITEMS Lost on First 
TRANSFER TRIAL 


——— a e I 


Original Learning Transfer 


Cond. 

Mean Om Mean Om 
WC-WC 8.80 97 ors = 
WC-w 8.00 | 1.18 | 1.20 44 
WC-C 9.20 | 1.14 | 2.50 AT 
TC-TC 11.55 95 = ee 
TCT 10.00 | 1.20 | 2.85 36 
TC-C 10.00 | 1.21 05 37 


nections between letters as based on the 
Underwood-Schulz (1960) tables. It should 
also be noted that the trigrams have relatively 
high formal similarity as indexed by repeated 
letters. The Purpose of this was to minimize 
the possibility that a single letter (such as the 
might become the functional 
stimulus, The frequency of repeated letters in 
the word list is about the same as for the 
trigrams and the repetitions are in the same 
Positions. Both lists have the same initial 
letters, 

The color components were made of con- 
Struction paper and pasted on the vellum 
tape. Rectangular frames of color com- 
Pletely surrounded the verbal unit, the width 
of the frame being approximately } in, When 
the color was the only component on the 
transfer test the frame appeared exactly as it 


Paired with Particular verbal units appear in 
The response terms 


original learning was 
> S achieved one perfect 
recitation of the list, The transfer tests were 
carried for 10 trials with S instructed to give 
48 many correct Fesponses as possible on the 
The usual paired-associate 
given prior to original 
In addition, S was told that both 


intent of these instructions was to inform S 
of the nature of the stimulus compound with- 


CUE SELECTION IN PAIRED-ASSOCIATE LEARNING 407 


o ~ 


MEAN CORRECT RESPONSES 
on 


EE om cme 


TRANSFER TRIALS 


à------A TC—T 


inoue) 456) 7 69: 10 
TRANSFER TRIALS 


Fic, 1. Acquisition curves on the 10 transfer trials. 


out, at the same time, biasing him toward 
using” one or the other components. Prior 
to the transfer trials S again was fully in- 
formed as to the nature of the stimulus which 
would be present on these trials. Approxi- 
mately 45 sec. elapsed between original 
learning and the first transfer trial. 

_. Each of the six groups contained 20 Ss. 
Twenty blocks were made up such that each 
condition occurred once within each block, 
with the order of the six conditions within a 
block being randomly determined. The Ss 
were then assigned to the schedule in terms 
of their appearance at the laboratory. No S 
was lost for failure to learn. 


RESULTS 


The mean numbers of trials to 
attain the criterion on original learn- 
ing are shown in the left portion of 
Table 2. Differences among the three 
WC groups and among the three TG 
groups represent random variation, 
the F being less than 1 in each case. 
For the 60 WC Ss the mean is 
8.67 + .63, and for the 60 TC Ss, 
10.52 + .65. The difference (1.85 
+ .90) gives a £ of 2.06, which is just 
past the 5% significance level. 

The mean performance ON each 
transfer trial is shown in Fig. 1. The 
left-hand section refers to the TC 


groups, the right-hand section to the 
WC groups. For the TC groups it can 
be seen that when the colors alone 
were used as stimuli, transfer was 
virtually complete; the performance 
of this group is only slightly below the 
control (TC-TC), thus indicating that 
the colors alone were completely 
effective functional stimuli. If only 
the color component was the func- 
tional stimulus, performance of Group 
TC-T should start at zero. It does 
not. Forat least some Ss the trigrams 
were also functional stimuli for at 
least a few responses. 

The right-hand section of Fig. 1 
shows that for the WC groups neither 
the words nor the colors developed 
complete effectiveness. The colors 
are less effective than the words, but 
the words alone show some loss as 
compared with the control. 

The clearest inferences concerning 
the functional stimuli in original 
learning can be made from the per- 
formance on the first transfer trial. 
Since Group TC-TC showed a larger 
criterion drop than did Group WC- 
WC, loss scores have been calculated. 
To do this, each S’s score on the first 


408 


trial was subtracted from the mean 
score of the appropriate control group. 
These loss scores are shown in the 
right section of Table 2. AN 25€ 2 
analysis of variance was performed 
on these scores, using as one classifica- 
tion variable TC and WC as identified 
in original learning, and as the other, 
colors and verbal units on transfer. 
Only the interaction F was significant, 
being 25.31; with 1 and 76 df, the F 
needed for the 1% significance level 
is approximately 7,00. Thus, the 
predictions that for the TC lists the 
colors would become the functional 
stimuli and that for the WC lists the 
words would become the functional 
stimuli, are given some support. 


Discussion 


The transfer tests for the TC com- 
pounds showed that color alone was a 
completely effective functional stimulus, 
This fact precludes an interpretation of 
the functional stimulus as being a con- 
figuration, However, it was noted that 
the trigram stimuli were not completely 
ineffective on the first transfer trial, 
There are at least two possible inter. 
pretations of this finding. First, it may 
mean that some trigrams, quite in- 
dependently of the color component, 
become associated directly with the re- 


The transfer data following learning 
of the WC lists raise three interpretative 
problems. First, it was noted that 
transfer was greater when the words 
became stimuli than when the colors 
became stimuli, Two circumstances 
could lead to this finding. (a) Most Ss 
in original learning used words as stimuli 


B. J. UNDERWOOD, M. HAM, AND B. EKSTRAND 


for all associations but a few Ss used 
colors as stimuli for all associations, 
(b) All Ss used words as stimuli for most 
associations during original learning but 
used color as stimuli for a few associa- 
tions. Given a large number of Ss a 
choice between these two alternatives 
could be made by examining the distribu- 
tions of scores on the first transfer trial. 
If the first of the two possible explana- 
tions is appropriate, the distributions 
should be bimodal when words alone or 
when colors alone are stimuli on the first 
transfer trial. If the second alternative 
is appropriate, each distribution should 
be continuous. Actually, bimodality is 
Suggested in the present distributions but 
with only 20 cases in each this may be 
quite fortuitous, 

The second interpretative problem is 
the same as that posed for the TC com- 
Pounds where the data show that for 
Some associations for some Ss both the 
trigram and the color elicited the re- 
sponse. Such dual functionality may 
also be deduced from the data for the 

C compounds. On the first transfer 
trial a mean of approximately 5,0 correct 
responses occurred when the words were 
Presented alone and 3.8 when the colors 
were presented alone. These two values 
sum to 8.8, which is appreciably higher 
than the mean of 6.3 shown by the 
Control Ss on the first transfer trial. 
Clearly, dual functionality of the two 
Components obtained for at least some 

S. However, just as in the case of the 

compounds, this apparent duality 
may result from direct associations be- 
tween the Components and the response 
term or it may result from mediation 
between the stimulus components. 

The third interpretative problem pre- 
sented by the results of the WC com- 
pounds is the fact that the words showed 
greater transfer than did the colors. As 
stated in the Procedure section, it was 
believed that the words would be more 
meaningful than the colors. We have no 
independent evidence for this and it may 
not be valid, The Ss, being much more 
Practiced jn dealing with word stimuli 
than with Patches of color as stimuli, 
may be biased toward the selection of the 


a. 


CUE SELECTION IN PAIRED-ASSOCIATE LEARNING 409 


verbal stimuli. The data are quite in 
harmony with such a notion. However, 
a somewhat different approach may be 
taken to the problem. An empirical test 
can be made to determine which stimulus 
compound leads to most rapid learning 
when this learning is not preceded by 
learning in which the compound is 
present. To determine this three new 
groups of 15 Ss each were run. One 
group learned the trigram-number pairs, 
a second the word-number pairs, and the 
third the color-number pairs. The mean 
total correct responses in 10 trials were 
38.80 + 3.61, 51.20 + 3.52, and 50,40 
+ 2.19, respectively. While the F is 
significant far beyond the 1% level it is 
clear that most of the variance is pro- 
duced by the trigram-number pairs. The 
words and colors do not differ appreciably 
in their effectiveness as stimuli, The 
small difference in favor of the words 
occurred primarily on the first three 
trials. Thus, it seems quite reasonable 
to conclude that when S is given a com- 
pound consisting of common words and 
colors, he is likely to select the words as 
functional stimuli, not necessarily be- 
cause they are more meaningful, but 
because he is more accustomed to dealing 
with such stimuli. 

Finally, it may be stated that the re- 
sults of the present experiment, taken in 
conjunction with the study by Sundland 
and Wickens (1962), would seem to 
indicate that it may be more fruitful to 
view the so-called context experiments as 
experiments investigating the variables 
determining cue selection. 


SUMMARY 


_ This experiment was based on the assump- 
tion that when S is presented a compoun' 


stimulus in a verbal-learning experiment, cue 
selection may occur. Word-color or trigram- 
color compound stimuli were used in learning 
original paired-associate lists with numbers as 
responses, followed by a transfer test in 
which one or the other components alone was 
presented as the stimulus. Control groups 
were also used, these groups being given 
further trials with the original compound 
stimuli. 

The results show: 

1. For the trigram-color compounds, color 
was a completely effective stimulus on the 
transfer test. The trigrams, however, also 
produced a small positive transfer effect. 
The selection of the color component as the 
primary functional stimulus was assumed to 
be due to its higher meaningf ulness. 

2. For the word-color compounds, transfer 
was higher when the words appeared alone 
than when the colors appeared alone. This 
may be due to a bias Ss have toward dealing 
with verbal material (as compared with the 
color patches used) rather than to higher 
meaningfulness of the words. 

It was concluded that experiments dealing 
with the effects of context changes on reten- 
tion may be viewed as representing cases of 
cue selection. 


REFERENCES 


Sunptanp, D. M., & Wickens, D. D. Con- 
text factors in paired-associate learning and 
recall. J. exp. Psychol., 1962, 63, 302-306. 

THORNDIKE, E. L., & LORGE, I. The teacher's 
word book of 30,000 words. New York: 
Teachers College, Columbia University, 
1944. 

Unperwoop, B. J., & Scuutz, R. W. Mean- 
ingfulness and verbal learning. Chicago: 
Lippincott, 1960. 

Weiss, W., & MARGOLIUS, G. The effect of 
context stimuli on learning and retention. 
J. exp. Psychol., 1954, 48, 318-322. 


(Received September 15, 1961) 


Experimental Psychology 


J al oj 
1962, Vol. 68 Ne ee ee 


CONSUMMATORY AND INSTRUMENTA 


L RESPONDING 


AS FUNCTIONS OF DEPRIVATION! 
GEORGE COLLIER 


University of Missouri 


The present study investigates the 
conjoint effect of deprivation on in- 
strumental and consummatory re- 
sponses, 


METHOD 


Apparatus.—Eight Skinner boxes deliver- 
ing liquid reinforcement were used (Collier & 
Myers, 1961). Electronic switches sensed 
each lick made in the magazine cup. Bar 
presses (BP), reinforcements, and licks were 
recorded by an Esterline-Angus recorder. 

Subjects.—The 12 Ss were naive rats, 120 
days old, of the Sprague-Dawley strain 
(Holtzman Company) maintained on Purina 
lab chow and tap water. 

Procedure.—Two grou 
formed. The first (H 
running. The secon 


H 


a 6o04} =t aP E 
© 500 U E 
400 
g x00 
200. Maere -. 
= < *-0: 
420. 
æ O . 
= 400 grin 
2 390 ea 
3 baa eS arte 
S 360 
S aso 
340 
330: 
320 
30 
o 
123456 7890y 23466 7 8 20 
DAYS 
Fic. 1. Total daily bar presses and weight 
of Ss as a function of deprivation. 
1 This investigation was supported in part 


by Research Grant 
tional Institute of 
Maryland, 


M-3328, from the Na- 
Mental Health, Bethesda, 


410 


10 gm. per day. These cha; 
to keep Ss’ weights relativ: 

Each S received 5 days of table training, 
handling, and deprivation accommodation 
followed by 4 days of magazine training, 
10 days of reinforced BP on a 1-min. fixed 
interval (FI), and 3 days of extinction. Dur- 
ing reinforced bar Pressing all Ss were given 
+3 ml. of 16% sucrose solution per reinforce- 
ment. Thus, since by the fifth day of bar 
Pressing all reinforcements possible were 
Presented and all those presented were taken, 
each S received 15.6 ml. of solution per day 
in the 52-min. test period. 

All Ss were weighed every day immediately 
before running. 


nges were required 
ely constant. 


RESULTS 


Reinforced bar Pressing.—The lower 
panel of Fig. 1 presents the weights of 
the two deprivation groups over the 
course of the experiment. Group H 
ranged from approximately 86% of 
the body weight of Group L at the 
beginning of the study to approxi- 
mately 80% at the end. The weight 
of Group L increased significantly 
across sessions. The upper panel of 
Fig. 1 presents the total number of 
BP in 52 min. for each of the depriva- 
tion groups over the course of the 
experiment. It is clear that Group H 
Was superior to Group L under all 
circumstances, 

An analysis of variance of the num- 
ber of BP and of licks on Day 17 
shows that for bar pressing the main 
effects of Deprivation (F = 22.6, 
df = 1/10, P < -01) and of Minutes 
(F = 243, df = 51/510, P < .01) 
were significant, while for licks the 
main effect of Minutes only (F = 1.46, 
df = 51/510, P < -05) was significant. 
For neither Measure did the inter- 
action of Deprivation Xx Minutes 


= < 


CONSUMMATORY AND INSTRUMENTAL RESPONDING 


(SECONDS) 


TIME 


MINUTES 


Fic. 2. Latency, duration, 
as functions 


approach significance. The total 
numbers of BP per session were 439.7 
and 174.0 for Groups H and L, re- 
spectively, while the total numbers of 
licks were 2763 and 2400, respectively. 

In order to assess the locus © the 
rate differences and changes observed, 
the pattern of responding in each 
minute of the FI schedule, i-e., De 


rate, and burst ra! icki 
of deprivation and satiation. 


411 


LICKS 


—_— 


iat 26 


rier 


LICKS/sec. LICKS/sec. 
OD, 


Oo 


$ : 
2 
=: 
Aare 
g 3 ` 
b 
= 2 
O o D 


MINUTES 


te of licking, and bar pressing 


tween reinforcements, was examined. 
Figure 2 (top left) presents a sche- 
matic of 1 min. (between reinforce- 
ments) of a typical tape from the 
Esterline-Angus records: t; is the time 
from the reinforcement to the first 
lick, fy is the duration of licking, ts is 
the time from the last lick to the first 
BP, t, the time from the first BP 


412 


CUMULATIVE BP a LICKS 
ésas 


S 


o 235 
MINUTES 


Fic. 3. Cumulative number of bar presses 
and licks over the Course of extinction as a 
function of deprivation and satiation, 


to the next reinforcement; fı is the 
number of licks, fs the number of BP. 
The average rate of licking (f1/t2) and 
of BP (f2/t4) as well as the burst rates 
were also determined, A burst was 
defined as 

licks or BP 


a BP burst, an additional measure, the 
reciprocal of the time between the last 
two responses before reinforcement 
(B-rates), was calculated as q check, 
Figure 2 Presents the values averaged 
over S-min. blocks, Analyses of vari- 
ance on each of the measures were 
performed for deprivation and min- 
utes of the session, 

Summarizing these results, no sig- 
nificant effect of Deprivation on the 
average (H = 6.0/sec, L = 4.8/sec) or 

omentary (H = 8.8/sec, L = 8.4/ 
sec) Rate of Licking was found; how- 
ever, there was a Suggestion that there 


rate (F = 4.35, 
af = 1/10, P < 10). The i 


GEORGE COLLIER 


tended to increase significantly to- 
ward the end of the session. The 
average duration (10.1 sec, ) of licking 
was unaffected by the course of 
satiation. That some licking beyond 
that required occurred is indicated by 
the fact that the average volume per 
lick was .0056 ml. per lick and .0065 
ml. per lick for Groups H and L, 
respectively, where the typical value 
in the unrestricted access situation is 
usually of the order .0080 ml, per lick, 
The average rate declined significantly 
Over a session while the burst rate 
appeared to increase. 

The latency of the first BP following 
reinforcement (ts) was significantly 
shorter in Group H (15 vs. 26 sec.) 
and increased Significantly over the 
session for both groups. Thed uration 
(t4) of BP was, therefore, necessarily 
significantly longer (35 vs, 23 sec.) in 
Group H and significantly decreased 
Over the session for both groups. For 
neither measure did the Deprivation 
X Minutes interaction approach sig- 
nificance, suggesting that the rate of 
decline in responding was independent 
of the degree of deprivation. No 
evidence was found for a difference 

tween within-cluster (burst) rates 
on either measure (B-rate;: H = .29/ 
Sec, L = .27/sec; B-ratez: H = .27/ 
sec, L = .24/sec), However, both the 
number (fa) and the average rate 
(f:/ts) were Significantly greater for 
Group H (8.4 vs. 3.5 and -24/sec vs. 
-17/sec, respectively). The average 
rate did not decline over the session 
while the frequency of responding did. 
The difference between the average 
Tate of responding and the momentary 
Tate reflects either the burst duration, 
interburst interval, or both. The 
decline in frequency across the session 
appears to be solely due to the 

Ccreasing time available for BP be- 
cause of the increasing latency of the 
first BP, 


A 


CONSUMMATORY AND INSTRUMENTAL RESPONDING 413 


Extinction.—The cumulative form 
of the extinction curve is given in Fig. 
3. Both the total number of BP and 
total number of licks differed sig- 
nificantly as a function of deprivation. 
The rate of decline in bar pressing was 
significantly faster for Group L as 
was the interaction of minutes and 
deprivation. No analysis of the effect 
of time and its interaction with dep- 
rivation was performed for licks 
since Group L stopped licking within 
the first few minutes. 

Two things were clear in the ex- 
tinction data. First, the rate at which 
consummatory responding as well 
as bar pressing declined depended 
strongly on deprivation, stopping with 
a few licks of the empty magazine at 
low deprivations. Second, bar press- 
ing fell off at a much slower rate than 
licking, substantial amounts of bar 
pressing occurring after the licking 
response to the magazine had stopped. 


Discussion 


When the distribution of responses 
between reinforcements is examined it is 
clear that the momentary rate of re- 
sponding for both the consummatory and 
instrumental response is relatively in- 
sensitive to either deprivation or the 
number of reinforcements consumed. 
The difference in average rate as a 
function of deprivation is the result of 
differences in the duration, or interval 
between bursts, or both, and the differ- 
ences in overall rate of BP as a function 
of deprivation result from both the 
latency of the first response following 
reinforcement and the differences in 
average rate. The decline in the over: 
rate of BP asa function of the number of 
reinforcements consumed appears to be 
solely a function of the latency of the 
first response following reinforcement 
and the rate of decline is independent of 
the degree of deprivation. This distribu- 
tion of responses has been reporte 
Previously for the consummatory re- 


sponse when deprivation and amount 
consumed are examined (e.g., Davis & 
Keehn, 1959; Stellar & Hill, 1952) and 
for the instrumental response on a fixed- 
ratio schedule (Sidman & Stebbins, 
1954). The relatively small difference 
between average rates of licking, the 
fixed duration of licking, and the ap- 
parent lack of difference in the latency of 
the first lick probably result from the 
schedule used in which a fixed, small 
volume is presented which can be con- 
sumed in a single or a few brief bursts of 
licking. That is, the duration required 
is within the range of durations of bursts 
of licking. The failure of the latency 
of the first lick to vary as a function of 
the amount consumed (see Stellar & 
Hill, 1952) is probably the result of the 
fact that the consummatory response in 
the Skinner box situation is under the 
control of a precise discriminative stim- 
ulus, namely the operation of the 
magazine. 

Ingestion ceases before nutrition has 
been accomplished. This fact suggests 
that satiation and deprivation are two 
distinct, independent processes (Collier 
& Myers, 1961). The results of the 
present study provide further evidence 
for this distinction. They suggest that 
satiation, which varies as a function 
of the momentary postingestive load, 
evinces its major effect on the latency 
of the response measured, while depriva- 
tion affects both the latency and burst 
density. The present data further sug- 
gest the possibility that the parameters 
of the consummatory response are in- 
sensitive to such variables as deprivation 
and satiation, and thus it is the ancillary 
food approaching and food producing 
responses which determine the rate, pat- 
tern, and amount of consumption. 


SUMMARY 


Twelve albino rats were divided into two 
deprivation groups; high and low, and were 
run in Skinner boxes for a .3-ml., 16% sucrose 
reinforcement. Analysis of the grain of the 
consummatory (licks) and instrumental (bar 
press) responses led to the conclusion that 
for the consummatory response neither the 


414 


latency, duration, average, nor momentary 
rate of responding varied significantly as a 
function of deprivation, Latency showed a 
slight tendency to decrease for high depriva- 
tion and increase for low deprivation, average 
rate to decrease, and momentary rate to 
increase across a session, For the BP re- 
sponse, on the other hand, the latency and 
therefore necessarily the duration of pressing 
i deprivation and 
The average rate Proved to be 
related to deprivation only, while two meas- 
ures of the momentary rate appeared related 
to neither deprivation nor satiation. In 
extinction the rate of occurrence of licking 


showed a much more rapid decline than did 
rate of BP. 


GEORGE COLLIER 


REFERENCES 


CoLLIER, G., & Myers, L. The loci of rein- 
forcement. J. exp. Psychol., 1961, 61, 
57-66. 

Davis, J. D., & Krenn, J. D. Magnitude of 
reinforcement and consummatory behavior. 
Science, 1959, 130, 269-271. 

Sipman, M., & STepsins, W. C. Satiation 
effects under fixed-ratio schedules of 
reinforcement. J, comp. physiol. Psychol., 
1954, 47, 114-116, 


STELLAR, E., & Hur, J. H. The rat's rate of 


drinking as a function of water deprivation. 
J. comp. physiol. Psychol., 1952, 45, 96-102. 


(Received October 1, 1961) 


a 


Journal of Experimental 
1962, Vol. bar ‘Noi A Ste 


DEPTH PERCEPTION IN ROTATING DOT PATTERNS: 
EFFECTS OF NUMEROSITY AND PERSPECTIVE * 


MYRON L. BRAUNSTEIN* 


University of Michigan 


The changing shape of a visual pat- 
tern may elicit a three-dimensional 
perception, even when other cues to 
depth are so reduced that any station- 
ary view of the pattern appears two- 
dimensional. Stimuli for the study of 
this effect have generally been pro- 
duced by rotating an object or pattern 
between a light source and a trans- 
lucent screen, thus displaying shadows 
of the transforming pattern. This 
method was employed by Miles 
(1931), who used a two-bladed electric 
fan as a shadow caster, and by 
Metzger (1934), who used a number 
of cylinders to generate his displays. 

The effect was systematically in- 
vestigated by Wallach and O'Connell 
(1953), who termed it the “kinetic 
depth effect,” and by Gibson and 
Gibson (1957). While earlier investi- 
gators regarded the depth effects 
created by such displays as illusions, 
recent experimenters have recognized 
that they involve a cue to depth not 
fully incorporated into the classical 
set of depth cues. Gibson (1950) has 
aptly termed this cue “motion per- 
spective.” 

Green (1959a, 1959b) introduced a 
method which allows greater variety 


1 This paper is based on a dissertation 
submitted to the Department of Psychology 
at the University of Michigan in partia 
fulfillment of the requirements for the PhD 
degree. The author gratefully acknowledges 
the generous assistance of B. F. Green, Jr- 
of the MIT Lincoln Laboratory in the 
Preparation of the stimuli, and the valuable 
guidance of W. L. Hays in the planning and 
performance of this research. 

* Now at Cornell Aeronautical Laboratory, 
Incorporated, Buffalo, N. Y. 


in the presentation of stimuli and 
greater control of the motion of parts 
of a display than the shadow projec- 
tion method. By means of instruc- 
tions to a high speed digital computer 
equipped with a CRT output recorder, 
a motion picture can be made of a 
two-dimensional projection of any 
mathematically specifiable transfor- 
mation of any figure. Green has 
studied the effects of numerosity, 
speed, and axis of rotation on subjec- 
tive judgments of the extent to which 
the parts of the display maintain the 
same relative position, i.e. coherence, 
for filmed sequences representing pro- 
jections of points or of line segments 
rotating in three-dimensional space. 

Definition of perspective—The per- 
spective used in producing a two-di- 
mensional projection of a three-dimen- 
sional display may be conveniently 
defined as the ratio of the distance 
between the projection point and the 
most distant X-Y (frontal parallel) 
plane to the distance between the 
projection point and the closest X-Y 
plane, along the Z axis (line of sight). 
This is equivalent to the ratio of the 
projection of a distance on the closest 
X-Y plane to the projection of the 
same distance on the most distant 
X-Y plane. Figure 1 illustrates these 
definitions' of perspective for a four- 
point pattern confined to an imag- 
inary cube. 

If the projection point is at infinity 
and thus equally distant from all 
elements of the pattern, the perspec- 
tive ratio is 1, and the projection of 
the pattern does not change with its 
location in depth. This method of 


415 


Fig. 1, 
(Perspective is defined as a/b, or equivalently, 
c/d.) 


Projection of points in a cube. 


projection is referred to as parallel 
projection. If the ratio is greater than 
1, the projection of the pattern varies 
with its location in depth, and the 
Projection is referred to as polar. 
Motion berspective—Motion per- 
spective is a complex cue to depth 
involving several better known 
aspects of perception, Tf a polar 
Projection of a three-dimensional pat- 
tern rotating about an axis perpen- 
dicular to the line of sight is displayed, 


of a part of the pattern is an inverse 
its distance from the 
Projection point, This is essentially 
motion parallax, in the classical sense, 
Also, in the case of a polar projection, 
the distance between the Projection 
of two parts of a pattern is an inverse 
function of the distance of those parts 
of the pattern from the projection 
point. This cue is similar to linear 
Perspective, except that it involves 
temporal as well as Spatial variations 
in the display, 

With both polar and parallel projec- 
tions, changes in the telative distances 
between the Projections of Parts of the 
pattern changes jn 
orientation of the pattern from a fixed 
Projection point. In the case of 
parallel Projections, this effect is not 
confounded with motion parallax and 
linear perspective, Although only 
very distant objects are seen in 


MYRON L, BRAUNSTEIN 


parallel projection in direct vision, the 
Present study compares polar and 
Parallel projections in order to sepa- 
rate the more familiar depth cues from 
the novel aspects of motion per- 
spective. 

As the projection point is equivalent 
to the nodal point of the observer's 
eye in retinal projections, variations 
in perspective are normally accom- 
panied by changes in the visual angle 
subtended by’ the pattern. In the 
Present study the size of the projection 
Was controlled by placing O's eye at a 
fixed intermediate distance from the 
computed location of the three-dimen- 
sional Pattern, and not at the com- 
puted projection points, Perspective 
is thus “inappropriate” to 0's distance 
from the computed location of the 
pattern. It is exaggerated by closer 
projection points and reduced by more 
distant ones, 


METHOD 


Stimuli —The stimuli were produced by 
Green’s (1959a) computer method. Specifi- 
cally, from two to six points in a cube having 
sides of two units, centered about the origin, 
were randomly selected. A projection of the 
points onto a specified plane perpendicular to 
the Z axis was computed, using a projection 
Point having a Z coordinate of 2, 4, 16, or 512, 
and X and Y coordinates of 0. The projection 
plane was always equidistant from the 
projection point and the origin so that a 
distance d on the plane Z = 0 was projected 
into a distance d/2, regardless of the distance 
of the projection point from the origin. 
Projections were computed at each 4.5° as 
the hypothetical cube was rotated 360° about 
the Y axis. There were thus 80 projections 
for each display. Each projection was plotted 
on the face of a CRT using an overlapping 
cluster of four spots to represent a point. 
These clusters, hereafter referred to as spots, 
were roughly circular in shape and were 
Approximately .01 unit in diameter, The 
plots were photographed on 16-mm. film and 
each sequence of 80 photographs became a 
stimulus display in the final film. 

orty such displays were prepared, repre- 
senting 20 treatment combinations (5 levels 
numerosity and 4 of perspective), each 


e 


poupenn sr rt mre 
ve pr. 


DEPTH PERCEPTION IN ROTATING DOT PATTERNS 


produced twice, with different selections of 
random points. The stimulus films consisted 
of pairs of these displays. Each pair was 
composed of an 80-frame display, 28 frames 
of blank film of the same optical density as the 
background of the displays, a second 80-frame 
display, and 108 frames of blank film, again 
of the same density as the background of the 
displays. Presented at 16 frames per sec., 
this would mean a 5-sec. display, a 1.75-sec. 
pause, another 5-sec. display, and a 6.75-sec. 
pause, followed. immediately by the first 
display in the next pair. 

Apparatus.—The films were projected by a 
Kodak Royal 16-mm, projector in which a 
500-w. bulb was used, The lens had a focal 
length of 2.5 in. The displays were projected 
onto a translucent plastic screen, 1744 in. from 
the front of the lens. One unit on the theo- 
retical projection plane became 3.76 in. on the 
screen. The O viewed the displays from the 
opposite side of the screen. Background and 
spot luminances were approximately .009 and 
1.1 ft-L, respectively. The projector operated 
at a speed of 16.2 = .1 frames per sec. The E 
operated the projector and recorded O's 
fesponses, which were transmitted to a tape 
recorder in E’s cubicle. 

The O viewed the displays monocularly 
through a reduction tunnel. This consisted 
in part of a cylinder 41 in. in length and 2} in. 
in diameter, against which O placed his eye. 
The cylinder was covered on the outside with 
black tape and lined with black paper on the 
inside. It was inserted into a circular aper- 
ture 2 in, in diameter in a wooden disk 15 in. 
in diameter and 14 in. thick. The length of 
the cylinder inserted into the aperture was 
} in., leaving 44 in. protruding on the side on 
which O viewed the displays. Black cloth, 
formed roughly into a cylinder, covered the 
space between the disk and the aperture in the 
wall in front of O, which was also 15 in. in 
diameter. ‘The distance from the front of the 
disk to the wall was 21} in, The O's eye was 
thus approximately 26 in. from the screen, Or 
13.8 theoretical units from the origin of the 
display. The O could see neither the borders 
of the aperture in the wall as he looked into 
the reduction tunnel, nor the location of the 
screen as he entered the room. The Os were 
31 male students enrolled in the elementary 
psychology course; 20 served in Exp. I and 11 
in Exp. If. 

Procedure.—Each O was tested individually 
in a 2-hr. session. In Exp. I, O was told that 
he would see pairs of displays, either of which 
could be seen as occurring in space or on 2 
flat surface in front of him, and was to im- 
dicate, by saying “first” or “second,” which 


417 


display in each pair gave the stronger im- 
pression of occurring in space. After a 
practice reel, O was shown four reels contain- 
ing the 190 pairs of the present experiment, 
as well as 34 additional pairs not included in 
this report. 

The procedure in Exp. II was identical to 
that used in Exp. I, except for a change in the 
critical portions of the instructions. For each 
pair of displays, O was asked to decide which 
display showed the greater coherence, with 
coherence defined as “the degree to which the 
parts of the display seem to maintain the 


same relative positions as the display moves." 
No mention was made of the possibility of 
seeing the displays either as two-dimensional 
or three-dimensional. 


RESULTS 


Each treatment combination was 
represented in 19 paired comparisons. 
For each of the 20 Os, the number of 
times a stimulus representing each 
treatment combination was chosen? 
was calculated. The method em- 
ployed to analyze these frequencies 
is a direct extension to three factors 
(numerosity, perspective, and Os) of 
the median test described by Mood 
(1950) for “two factor experiments, 
one observation per cell” (pp. 399- 
402). 

In Exp. I significant effects 
(P < .05) were found for numerosity 
(x? = 125.96, df = 4), perspective 
(x? = 21.90, df = 3), and the inter- 
action of the two treatment dimen- 
sions (x? = 108.30, df = 19). The 
proportion of trials on which stimuli 
representing each of the treatment 
combinations were chosen by all Os 
is given in Table 1. A greater effect of 
perspective is indicated for the stimuli 
having smaller numbers of points. 
The relationship between the propor- 


3“Chosen” is used to mean “chosen as 
giving a stronger impression of occurring in 
space than the stimulus with which it was 

ired” when reference is made to Exp. I, 
and “chosen as appearing more coherent than 
the stimulus with which it was paired” when 
reference is made to Exp. IL. 


418 


TABLE 1 


PROPORTION oF TRIALS on Watcy Stmuty 
WERE CHOSEN As GIVING A STRONGER 
DEPTH Impression; Exp. I 


MYRON L. BRAUNSTEIN 


TABLE 2 


PROPORTION op TRIALS on Wuicu Srimuty 
WERE CHOSEN As APPEARING 
More COHERENT: Exp, JI 


Number of Points 


Number of Points 


SS aS ae ieee Numbers Perspective - Nowa 
2 es ae A} 6 6 | 
<a) Zp | 
1.00 |.232 |.329 445 |.524 |.561 418 1.00 -541 | .606 
1.13 |245 +508 |.547 |.537 -632| 494 1.13 603 | 585 
1.67 |.358 |376 -621 |.634 |.589 516 1.67 483! .506 
3.00 /.447 |524 -589 |.608 |695 573 3.00 201] .303 


tion of times stimuli Tepresenting each 
level 


relationship for levels of Perspective, 

The frequencies of trials on which 
stimuli Tepresenting each treatment 
combinatior 
the 11 Os in Exp. II 


numerosity (y? = 25.59, df = 4), per- 
Spective (2 , = 
their interaction Oe = 63.22, df = 19). 


S CHOSEN 
o 
g 
2 


ee 
COMERENCE JUDGMENTS: 


PROPORTION OF Time: 


o 
Ga 


3 ‘4 
NUMBER OF Points 


Effect of numerosity on depth 
and coherence judgments, 


Fic. 2. 


ables, depth judgments and coherence 


are primarily ordered on perspective 
and Secondly on numerosity, 


„The Proportion of trials on which a 
stimulus give a stronger 
Impression of occurring in space than the 
ith which it Was paired was 
J an approximately linear 
function of the logarithm of the number 
of points in the stimulus, 


This supports 
Or 
= 
a DEPTH suocwents 
Bas ve x 
a w 
Fos = 
& oa. Pe 
f Comt eeneg Noowenrs\ ae 7 
j os ~e 


PARALLEL jo. ey Potae— 

to tes 16? 260 300 
LEVELS OF perspective 

Fig, 3, Effect of perspective on depth 

and coherence judgments. 


DEPTH PERCEPTION IN ROTATING DOT PATTERNS 


the hypothesis of a direct relationship 
between number of points and judged 
strength of the depth impression. Al- 
though the logarithmic relationship was 
not specifically predicted, it is the same 
as that obtained by Green (1959a) using 
larger numbers of spots and ratings of 
subjective coherence. 

Perspective significantly affected the 
choice of stimuli giving stronger depth 
impressions, although it appeared to be 
secondary to number of points in its 
effect on the ordering of the stimuli. 
This would indicate that the interrelated 
factors of motion parallax and linear 
perspective do affect the depth impres- 
sion created by these displays. 

The question arises as to whether 
number of points may be important only 
because increasing numbers of points 
provide additional opportunities to view 
the effects of perspective. This is contra- 
indicated by the findings that larger 
differences between levels of perspective 
occurred for smaller numbers of points 

and that the relationship between num- 
ber of points and proportion of choice 
holds even for parallel perspective. It 
would appear instead that perspective 
becomes more important as the cues to 
change in orientation of the pattern are 
reduced, i.e., with smaller numbers of 
points. 
_Some insight into the role of subjective 
| rigidity or coherence of the patterns 
in influencing perceived depth may be 
gained from the results of the second 
experiment, which showed different ef- 
fects of the independent variables when 
coherence judgments were used as the 
dependent variable. Displays of two 
spots were judged most coherent. is 
would be expected, as two spots moving 
; on a plane could always represent the 
Projections of two points at a constant 
distance in three dimensions. The 
relationship between numerosity an 
coherence judgments for larger numbers 
of spots was not very pronounced. 
_ A much greater effect on coherence 
judgments was produced by perspective. 
Stimuli produced with closer viewing 
Points were judged less coherent than 
those with more distant viewing points. 


419 


There was thus an inverse relationship 
between depth judgments and coherence 
judgments across levels of perspective. 
This would seem to result from a com- 
bination of two factors. First, as all 
stimuli were not perceived as three- 
dimensional at all times, the changes in 
distances between the two-dimensional 
projections of the points may have 
influenced average coherence judgments, 
and more variability in the two-dimen- 
sional projections did occur with greater 
perspective. The other factor is the 
distortion resulting from the magnifica- 
tion and demagnification of the projec- 
tions used to maintain a uniform “‘aver- 
age” size. This would especially affect 
the stimuli with maximum perspective, 
which differed furthest from appropriate 
perspective, and the difference between 
depth and coherence judgments is great- 
est in this case. 

The use of perspective has been an area 
of disagreement between J. J. Gibson 
and H. Wallach, the two major experi- 
menters in this area. Gibson (1957) 
uses what he properly regards as appro- 
priate perspective in his displays. A 
translucent screen is placed equidistant 
between O and a point light source and 
shadows of objects rotating between the 
light source and screen are observed. 
This is equivalent to displaying projec- 
tions computed with the projection point 
at the position at which O will place 
his eye when he observes the stimuli. 
Wallach (1953), on the other hand, 
uses a distant projection point which is 
further from the screen than O's eye. 
The resulting situation is similar to the 
use of parallel perspective in the present 


research. 


results wo ; u 
eneral lead to a stronger impression of 


depth than does Wallach’s method. The 
“Kinetic depth effect,” as formulated by 
Wallach, does however require parallel 
projections of the three-dimensional dis- 
plays for its isolated study. 

The choice of perspective should then 
be made according to the purpose of 
the experimenter in these respects. The 


420 


present results indicate that the im- 
portance of variations in perspective will 
depend upon the type of judgments used. 
When the effects of variation in per- 
spective are of interest, at least for 
displays involving small numbers of 
elements, depth judgments and rigidity 
judgments cannot be assumed equiv- 
alent. 


SUMMARY 


Motion picture sequences of spots repre- 
senting projections of points rotating in three 
imensions were produced using the CRT out- 
The sequences 
varied in number of points and in perspective, 
ratio of the 


of the display. A paired comparison method 
was used to elicit judgments of the relative 
strength of the depth impressions created by 
the sequences and of the relative coherence 
of the patterns while in motion. 

Judged strength of the depth impression 
increased with increasing numbers of spots 
and, to a lesser degree, with increasing 
perspective. Subjective coherence decreased 
with increasing perspective. The role of 
perceived coherence of moving patterns in 
depth perception and the effects of variations 


MYRON L. BRAUNSTEIN 


in perspective on depth 


judgments are 
discussed. 


REFERENCES 


Gipson, J. J. The perception of the visual 
world. Boston: Houghton Mifin, 1950. 
GIBSON, J. J., & Grieson, E. J. Continuous 
perspective transformations and the percep- 
tion of rigid motion. J. exp. Psychol., 

1957, 54, 129-138, 

Green, B. F. Kinetic depth effect. (Psy- 
chology Group 58) Quarterly Progress Re- 
port, Massachusetts Institute of Tech- 
nology, Lincoln Laboratory, 1959, (a) 

Green, B. F. Mathematical notes on 3-D 
rotations, 2-D perspective transformations, 
and dot configurations. Group Report No. 
58-5, Massachusetts Institute of Tech- 
nology, Lincoln Laboratory, 1959, (b) 

Metzcrr, W. Tiefenerscheinungen in op- 
tischen Bewegungsfeldern, Ps ychol. For- 
sch., 1934, 20, 195-260. 

Mues, W. R. Movement interpretations of 
the silhouette of a revolving fan. Amer. J. 
Psychol., 1931, 43, 392-405. 


Moon, A, M. Introduction to the theory of 
statistics. New York: McGraw-Hill, 1950. 

Watacn, J, & O’ConneLt, D. N. The 
kinetic depth effect, J. exp. Psychol., 1953, 
45, 205-217. 


(Received October 2, 1961) 


Journal of Experimental P. 
1962, Vol. 64, No. a ee 


SUPPLEMENTARY REPORT: PROACTIVE INHIBITION AS A FUNCTION 
OF THE METHOD OF REPRODUCTION ' 


JOHN L. WIPF axo WILSE B. WEBB 
University of Florida 


Greenberg and Underwood (1950) found 
that paired-associate recall, after 5 hr., was 
inversely related to the number of lists previ- 
ously learned. Thus, proactive inhibition 
occurred. Their learning materials were two- 
syllable adjectives arranged in four paired- 
associate lists of 10 pairs each, The anticipa- 
tion method was used to test recall. 

F Method,—The present study is a replica- 
tion of the Greenberg and Underwood study 
in that it employed a 5-hr. retention interval, 
a 2-sec. presentation rate on a memory drum, 
the same learning-relearning criteria (8/10 
correct and 10/10 correct, respectively), and 
four lists over 4 days. However, in the pres- 
ent study, the method of presentation was 
modified to permit the use of free recall or 
reproduction as a retention measure. The 
10 Greenberg-Underwood response words ina 
list were shown one at a time. After each 
trial (one presentation of the 10 words), Ss 
were required to reproduce in writing the 
words they could remember. A 3-min. limit 
was set on this recall. Successive trials, using 
four different serial arrangements, were given 
until learning criterion was met. 

The method of reproduction 
employed in testing recall, ie, 
reproduced, in writing, as many words as 
they could recall. Recall was immediately 
followed by relearning. A different list was 
used on each of 4 successive days. Twenty- 
four different orders of the four lists were 
used. Twenty-four Ss (11 men and 13 women 
introductory psychology students) each learned 
the lists in a different order. 

Results.—Separate analyses of three de- 
pendent measures, number of trials to learn, 
number of words correctly recalled, and num- 
ber of trials to relearn, were performed. The 
means and SDs of these measures are given 
in Table 1. An analysis of variance show 
the differences among the mean number of 
words correctly recalled over days to be 
significant (P <.001). Thus, significant 
PI at recall occurred from day to day, after 
the first day. Recall differences among s 


was similarly 
after 5 hr. Ss 


1 Based on a thesis submitted to the Graduate School, 
ent of the re- 


University of Florida, in partial fulfillm 
quirements for the Master of Arts degree. 


TABLE 1 


MEANS AND SDs oF TRIALS TO LEARN, WORDS 
RECALLED, AND TRIALS TO RELEARN 


Trials to Words Trials to 
Learn Recalled Relearn 
Day 
Mean | SD |Mean| SD 
1 2.46 .16 | 6.17 | 215 
2 2.91 1.55 4.95 2.42 
3 2.58 81 4.62 | 2.64 
4 2.21 OL 3.92 244 


were significant. Recall differences among 
lists were not significant. 

The analysis of the learning measures 
showed differences due to days to be signifi- 
cant (P <.05). These differences appear 
to result from facilitation effect (see Table 1 
means, Days 2 to 4). List differences were 
significant (P < 01). Thus, word lists dif- 
fered in difficulty in terms of trials-to-learn. 
Subject differences were again significant. 

List difficulty in terms of trials-to-learn 
chould have had no differential effect because 
all possible sequences of the four lists were 
used; hence list differences were counter- 
balanced throughout the study. 

The analysis of the relearning measures 
showed Day and List differences not to be 
significant. Subject differences were sig- 
nificant. 

Allowing ample time for response and 
written reproduction, the use of unpaired 
presentations increased the number of words 
recalled and decreased the number of trials 
required to learn and relearn (compare Table 
1 with the Greenberg-Underwood results). 
However, a PI effect was demonstrated. A 
minimal conclusion which may be drawn from 
these data is that the PI effect demonstrated 
by Greenberg and Underwood is not a func- 
tion of the particular learning-recall procedure 


utilized. 
REFERENCE 


R & UNDERWOOD, B. J. Retention as a 


GREENBERG, 
stage of practice. ‘J. exp. Psychol., 1950, 


function of, 
40, 452-4 


(Received July 13, 1961) 


421 


Journal erimental Psychology 
1962, Vol, iets 4, 422-423 


NDI- 


JAMES D, WYNNE awn Wi h BROGDEN 
University of Wisconsin 


Hoffeld, Thompson, and Brogden (1958) 


de of sen 


ince tone 
the i 


for 
maximum at 4 
however, a confo; 
duration, and the 
represent con 
Tf a fixed d 


magnitude 
Sec, 


Re! 
ng and pre, 
Preconditioning, 


circles 


is taken from H 
circles 


resen: 
The smooth curve 
for the data of the pre. 
or which is given in th 


experi , 

€ Upper left corner or the figure 
! This research was 

the National Scien, 

Committee of 

by the Wisc On 


Supported in rt 
ce Foundation and 


p ie 
the Graduate School from funds 
sin Alumni Research Foundation. 


ous and a 
relation; (c) 
training trial 


"4 . 
ch group was 8; . 
on to the rotator 
and (f) following the test 
given tone-shock avoidance 
ttempt to obtain an addi- 
nal measure of SP, Since this procedim 
Produced no significant results, no report 0 
is made. b 
Results —Magnitude of SP is nan 
frequency of response in the cross-modi 
generalization tests of tone alone, E: 
following avoidance conditioning to the igl 


An orthogonal polynomial anaya 
variance (Grant, 1956) of the data o E 
experimental groups shows significant dil q 
ences between groups and a significant qua 


The experimental 


ratic trend (P = .05). : ‘a ale 
group means are plotted in Fig. 1 as 18 n 
the curve of the best-fitting qd 
tion. Range tests (Duncan, 1951) m p 
that the mean for 4 sec, precedence is pi 
ficantly different from all other means, an 
of which differ Significantly from each ot 


sae -4, 

e means for Precedence conditions ie 

=2, and 16 Sec. are either pai 
qual to or Jess than the mean for the co 


group (9), so it is improbable that any SP 
Occurs for these condi tions, dy 
he control group of the present a 
shows Beater cross-modal seer 
than Similar groups have Sr T +988) 
Studies (B, 1 1939; Hoffeld et al., 198 
If the en of Hoffeld et al. (1958) are 


S 58. 
toss-modal gene: 
test is 93 
clearly excludes evidence of SP at precedes 
els of —4 and —2 sec. (backward cond c 
tioning) and long trace conditioning at 16 a 
ce. If the combined control gro! f 
data are used in One-tailed ? tests with that 0 


sia ans 
the remaining Experimental groups, the me 


sec. 
for ence conditions 0, 1, 2, pe a 
exceed t ontrol group mean at the 5 ei 
and for nce conditions — 1 and 


SUPPLEMENTARY REPORT 


exceed the control mean at the 20% and 10% 
levels, respectively. 

Discussion.—The present results are in 
good agreement with those of Hoffeld et al. 
(1958) which are plotted in Fig. 1. In com- 
paring the two studies it is evident that the 
confounding of duration of tone with its 
precedence ove light during preconditioning 
training has little if any effect upon the 
magnitude of SP. Saying it another way, 
delay and trace conditioning procedures 
during preconditioning make little if any 
difference as long as the time relations be- 
tween the onset of the CS (tone) and onset 
of the UCS (light) are the same. 

A difference in magnitude of SP, favoring 
the present experiment, was expected on the 
basis of parametric differences in the number 
of preconditioning trials, since Hoffeld et al. 
(1960) found this variable to have a marked 
effect upon the magnitude of SP. They 
obtained a mean frequency of response to 
trials of tone alone of 24.17 for 4 precondi- 
tioning trials (value used in the present study) 
and a mean of 10.50 responses for 200 trials 
(value used by Hoffeld et al., 1958). The 
time relations during preconditioning of the 
Hoffeld et al. (1960) experiment were those 
found optimal by Hoffeld et al. (1958) and 
involved a 6-sec. tone and 2-sec. light with 
4 sec. precedence. In view of the minor 
differences in amount of SP for comparable 
time relations between the present study and 
that of Hoffeld et al. (1958), the possibility 
of interactions between trace an delayed 
time relations with number of preconditioning 
trials must be considered. The possibility 
also exists that an unknown variable or 
parameter present in the Hoffeld et al. (1960) 
study accounts for the high level of SP. 


Journal of Experimental Psychology 
1962; Val, 68 Now 4, 423-424 


SUPPLEMENTARY REPORT: 


423 


Although a significant quadratic trend in 
magnitude of SP as a function of time rela- 
tions was found in the present experiment, 
the best-fitting quadratic equation does not 
give a satisfactory description of the rela- 
tionship in the data. Other analyses estab- 
lish a maximum effect at tone precedence of 
4 sec., with no effect for the backward condi- 
tions of —4 and —2 sec. precedence or for 
the long trace condition of 16 sec. precedence. 
Whether SP occurs at the backward condition 
of —1 sec. precedence and the trace condition 
of 8 sec. precedence is dubious. These two 
conditions represent the extremes between 
which fall the time relations during precondi- 
tioning that are effective in producing SP. 
A quadratic function with a maximum around 
4 sec. precedence and zero at the backward 
condition of approximately —I-sec. pre- 
cedence and the forward condition of approxi- 
mately 8 sec. precedence may be reasonable, 
but is not clearly supported by the data 


of the experiment. 


REFERENCES 


BroGpEN, W. J. Sensory preconditioning. J. exp. 
Psychol., 1939, 25, 323-332. 
Duncan, R. B. A significance test for differences be- 
tween ranked treatments in an analysis of variance. 
G y DA yen his pes tests in the analysis 
RANT, D. yi a! ance inthe 
and comparison of curves, rsychol. Bull., 1956, 53, 
HOFFELD, D. R., KENDALL, S. B., Tuomprsox, R. F., & 
Brocpen, W. J. Effect of amount of preconditioning 
the magnitude of sensory precondi- 
exp. Psychol., 1960, 59, 198-204. 
D. R., THOMPSON, R. F., ‘& Brocpen, W. J. 


x np Psychol., 1958, 56, J ‘ 
gen Cc. x liever, D. R. Temporal factors in 
sensory preconditioning. J. comp. physiol. Psychol., 
1954, 47, 57-59. 


(Received July 13, 1961) 


SEMANTIC GENERALIZATION IN 


PROBABILITY LEARNING 
J. P. DAS 
Utkal University, Cultack, India 


This experiment is planned to find out 
whether the introduction of semantic stimuli 
changes the course of generalization in 
probability learning. The present study is 
similar in design and procedure to one by 
Popper and Atkinson (1958). Both are based 
on predictions from a discrimination model 
and its application given in Estes and Burke 
(1955), Every trial begins with the presen- 
tation of a T, or a T+ stimulus, the probability 


of each being .50. Following a T, stimulus, 
the letter X occurs with a probability of .90 
and Y with a probability of .10. But follow- 
inga Ts stimulus, X occurs with a probability 
of .70 for one group and .30 for another 
group; the corresponding probability of Y 
being .30 and .70, respectively. These prob- 
ability schedules are identical with those of 
Groups II and IV of a related experiment 
by Atkinson, Bogartz, and Turner (1959). 


424 J. P. DAS 


TABLE 1 
Means AND SDs or X PREDICTIONS OVER 20 TRIALS FOLLOWING Ti AND T: 


Trial Blocks 


4 
= 
ey, Mean sp] 
2.18 18.73 | 1.61 
3.44 15.27 2.93 
2.33 17.86 2.17 
2.34 3.33 2.71 
1.70 17.92 2.23 
1.56 17.46 115 
3.30 16.46 | 3.22 
3.63 5.38 3.58 
a AAE LA te BW adh TPE! ORDI V 1an d = ayr Dand want 2a 


However, there is a major difference between took about 10 sec.: 3 sec. to read out the _ 
the Popper and Atkinson experiment and the stimulus word, 4 sec. allowed to Ss for putting 
y down their guesses, and 3 sec, to expose the 
used here in place of nonsense syllables card to Ss and begin the next trial. The Ss 
as T and T stimuli. ‘ ‘ covered their previous choices by folding 
Method.—The Tı stimulus was an Indian the recorded Portion of the paper after each 
word, RAJANI, meaning night. The two T: trial, 
stimulus words, NIGHT and DAY, were synon- Results. —A summary of the findings is 


given in Table 1. The Ss' predictions were 
, they expected to vary on two accounts—semantic 
relation between T, and Ts, and similarity 
> between the probabilities of X after T: an 
a ; 2 Groups I and ITI, and Groups II an 
eee pb Sct and IV, R90-N30 therefore, may be meaningfully compared. 
in which T. San toe the group The final proportion of R choices in Group | 
poi i 1 Was RAJANI and probability of X should be higher than the same in Group III 
a oe -90, whereas Ts was pay and because there will be a greater generalization 
vt erai Au E i ; i ‘70. The between the Synonyms R and N than between 
z Cefined. the antonym D. Similarly, the mean 
di ri) ErOUp of Ss received 160 trials in all, prediction of R TN final trial’ block for 
Pas ay erte 40-trial blocks, _ Within Group IV will be less than that for Group I, 
d ae e 20 Tı and 20 T3 trials were whereas the reverse would be true for N an 
randomly distributed, and in the T; and T; he series of t's computed shows ie 
ee f although the mean differences are in the 
E Cer pa pemple: i Group I, expected direction, only the difference be- 
RAJANI-X s s f Pines a combination tween N70 and D70 reached at least the .0 
ANE o ba in 18 cards whereas the level of Significance. This result may be 
aaa at had Rajanr-Y, Similarly interpreted as Offering limited support tO 
of pay canes had 14 cards of Dav-X and 6 the hypothesis that semantic stimuli influence 
; The ns acne students, had four Strips eer generalization. ` 
ot paper for the four blocks of trials. On 
strip they found two columns, marked se REFERENCES 


eiA : ATKINsow, : 3 ? urNER, R, Ne 
Tı and Ty stimuli at the top. Their task aria kera alap LEA eri reinforcer 
was to guess whether X or Y would come and Estes, Wo goulen 7. exp, Psychol, 1959, 57, 3? ita 
record it under the appropriate columns tistical monet to aimple airia hea Sas arn i 
immediately after Æ read out the T; or T, „human s J. 4b. Paychol., 1955, S0, 81-58 
stimulus. Then E exposed the card, and at POPPER. J., & ATKINSON, R. C. Disien te 


: : 1 
the same time read out whatever Was written Psychol, 1958, So aj gg mn situation 


(Received August 14, 1961) 


Journal of 


Experimental Psychology 


Ee 


VoL. 64, No. 5 


NOVEMBER 1962 


VoL. 04, NO- >. AEE 


THE EFFECTS OF DIFFERENTIAL VISUAL STIMULATION 


AFTER INDUCTION OF 


HERBERT L. 


University 


The relationship between Gibson’s 
negative aftereffect and the Köhler- 
Wallach figural aftereffect has been 
the subject of controversy. 


Gibson (1933) found that a curved line, 
when perceived for a period of time, becomes 
phenomenally less curved than it was origi- 
nally, and that after such an inspection period 
an objectively straight line appears curved in 
the opposite direction from the inspection 
curve. These negative aftereffects were at- 
tributed to a process similar to sensory 
adaptation in which curvature perceptions 
when long continued tend to approach the 
norm of a straight line. Gibson regards the 
straight line as a neutral from which other 
lines deviate. Thus a straight line will serve 
as an anchoring point in the perception of 
curved lines. A frequent condition of the 
environment tends to become a norm of the 
phenomenal world and new stimuli are per- 
ceived in relation to it. 

Köhler and Wallach (1944) in their studies 
of figural aftereffects postulated an electrical 
field process in the visual cortex which satiates 
the cortex in the immediate area of the 
cortical representation of the inspection figure. 


1 This research was supported fos gens by 
the Research Committee of the Graduate 
School of the University of Wisconsin with 
funds provided by the Wisconsin Alumni Re- 
search Foundation. Authors are indebted to 
J. J. Gibson for a critical and helpful reading 
of the manuscript. 


PICK, Jre, MAVIS HETHERINGTON, 


VISUAL AFTEREFFECTS ' 
asp ROLAND BELKNAPP 
of Wisconsin 


This satiation results in increased resistance: 
to further stimulation in this area and to 
displacement of the cortical representation to 
neighboring regions upon subsequent stimu- 
lation. Köhler and Wallach proposed that 
Gibson's curved line effect could be ade- 
quately explained by their satiation theory, 
if the test line were considered to be displaced 
from the satiation area of the inspection 
curve. 

Although Osgood and Heyer (1952) sug- 
gest an alternative mechanism to explain 
figural aftereffects, both their theory and the 
Kohler-Wallach theory are based upon the 
same sort of physiological satiation by pro- 
longed inspection of contours. 

Several recent attempts have been made 
to differentiate experimentally the Gibson 
negative aftereffect from the Kohler-Wallach 
figural aftereffect. Sagora and Oyama (1957) 
in their survey of studies of figural aftereffects 
in Japan cited considerable evidence to in- 
dicate that the curved line effect could not be 
explained by the Kohler-Wallach theory. 
Bergman and Gibson (1959), using a slanted 
textured surface as a stimulus, demonstrated 
that one type of negative aftereffect could not 
be explained on the basis of satiation and con- 
tour displacement. Gibson (1959a) has 
argued on logical grounds that the two kinds 
of aftereffect cannot possibly be the same, 
and has recently (Gibson, 1959b, pp- 489-491) 
restated the normalization hypothesis. 


as that reported by 


Such evidence 
(1957) and Berg- 


Sagora and Oyama 


425 


426 


man and Gibson (1959) suggest that 
different processes are involved in the 
two types of aftereffect. Further in- 
vestigation seemed warranted to eluci- 
date the nature of these differences 
and to clarify the theoretical bases of 
the phenomena. 

The K@hler-Wallach theory sug- 
gests a gradual dissipation with time 
of the differential satiation in the 
visual cortex. If the dissipation is 
spontaneous, and if satiation explains 
both contour-displacement and cur- 
vature-straightening, it could be 
Predicted that with homogeneous 
stimulation (e.g. a “ganzfeld”) inter- 
polated between the viewing of the 
inspection figure and of the test figure, 
both types of visual aftereffects should 
decrease, The Gibson theory, on the 
other hand, Suggests that the normal- 
ization of a curved line would not 
dissipate during an afterperiod of 
homogeneous stimulation since the 


METHOD 


In order to test the above 
different Postinspection visual experiences 
were introduced following the j i 
Kohler-Wallach aftereffect 
curved line afterefiect, a 


Predictions three 


Gibson negative after- 
school students enrolled 

Psychology or education 
courses, 


Apparatus.—The 


apparatus for the Köhler- 
Wallach effect was w 


a modified Dodge type 


H. L. PICK, JR., M. HETHERINGTON, AND R. BELK. 


i 


NAPP 


tachistoscope in which the inspection (I) 
figure was shown binocularly at a distance of 
58 cm. from the eye. The test (T) figures 
were exposed at an equal distance. In both 
cases the visual field was 19.7 cm. square. 
The I figure consisted of a circle 5.5 cm. in 
diameter which was placed to the left of a 
fixation point in the center of the visual field, 
The width of the line defining the circumfer- 
ence of the circle was 2 mm. Each T figure 
was drawn on white cardboard and consisted 
of two circles Placed on opposite sides and 
equally distant from a fixation point. The 
width of the line in the T circles was .75 mm. 
In half the test trials the circle on the left was 
-6 cm. in diameter and would tend to be ex- 
panded outward to produce an aftereffect of 
increased size. These circles were always 
concentric with the original I figure. When 
the left circle was 7.6 cm. in diameter, the 
circle on the right was 7.6, 7.8, 8.0, 8.2, or 
8.4 cm. in diameter, In the other half of the 
test trials the circle on the left was 4.0 cm. in 
diameter and would tend to be compressed 
inward to Produce an aftereffect of decreased 
size. When the circle on the left of the T 
figure was 4.0 em. in diameter the circle on the 
right was 3.6, 3.7, 3.8, 3.9, or 4.0 cm. in 
diameter. In both cases the size of the right 
circle was varied randomly from trial to trial. 
The use of two different size circles on the left 
Was necessitated by the possibility of response 
bias occurring if the figural aftereffects always 
occurred in the same direction. A measure 
of the magnitude of the figural aftereffect 
could be determined by noting size judgments 
of circles on the right in relation to circles on 


the left. The met hod of obtaining this meas- 
ure is discussed in the section on scoring of 
performance, 


The Gibson Visual aftereffect was produced 
by having Ss fixate an I figure consisting of a 
black curved line 3 mm. in width and convex 
to the left drawn on white cardboard. In 
order to eliminate straight reference lines, the 

line was presented through a circular 
aperture 30 em. from the eye. The diameter 
of the aperture was 25.5 cm. The curved 
line extended the full length of the aperture 
and was bowed 2 cm. at its center. A series 
of five T lines was constructed and presented 
in the same manner, The T lines, 3 mm. in 
thickness, were also bowed to the left by 0.0, 
0.1, 0.2, 0.3, and 0.5 cm. The T lines were 
Presented in a random order. The magnitude 
of the aftereffoct could be determined by 
noting which of these lines were perceived as 
straight. 

The brightness of the larger surfaces (walls, 


a EE e 
i E E 
ee 
——— 


VISUAL AFTEREFFECTS 


table-top) to which S was exposed under 
conditions of normal stimulation varied from 
40 to 4.4 ft-c as measured by a Macbeth 
illuminometer. The goggles for the ganzfeld 
condition passed approximately 25% of the 
illumination. The brightness of the I and T 
fields for the Köhler-Wallach figural after- 
effects was 2.7 ft-c. The brightness of the 1 
and T field for the Gibson aftereffects was 
3.2 ft-c. 

Procedure. —Hammer (1949), in the only 
published study of the dissipation of figural 
aftereffects, found that they decreased to 
zero in 150 sec. Pilot studies, using the 
conditions outlined above, indicated that with 
Smin. | periods and subsequent normal 
visual conditions the Köhler-Wallach after- 
effect diminished to zero in approximately 
30 min.; the Gibson effect in 15 min. These 
time intervals were consequently used for the 
postinduction visual exposure periods in the 
respective experiments. Each experiment 
consisted of five steps in the following order: 
an initial control test with the T figures, a 
5-min, fixation of the I figure, an immediate 
test of the magnitude of the aftereffect, a 
postinspection period in one of three visual 
conditions, and a final test of the magnitude 
of the aftereffect. 

In the Gibson procedure the control test 
consisted in making judgments of the direction 
of curvature of the five curved test figures 
convex to the left plus two curved figures 
convex to the right. The latter curved lines 
were included to accustom 5 to perceive lines 
curved in both directions. These lines were 
presented in random order, ‘This procedure 
required approximately 20 sec- depending on 
the speed of judgment of S. During the 
5-min. I period which followed the control 
test, S was instructed to run his eyes slowly 
up and down the middle portion of the I 
line. The immediate test of the magnitude 
of the aftereffect was the same as the control 
test except for the omission of the curves 
convex to the right. One-third of the Ss then 
received each of the following postinspection 
conditions for 15 min.: Cond. I, normal 
stimulation, S looked around the room; Cond. 
II, homogeneous light stimulation, S’s vision 
was limited to a homogeneous ganzfeld, 
produced by goggles which covered each eye 
with a concave section of a translucent ping 
pong ball; Cond, IIT, homogeneous lack of 
stimulation, S had a black blindfold over open 
eyes. A final test of the magnitude of the 
aftereffect was administered as before. 

_ In the Köhler-Wallach procedure the 
initial control test consisted of viewing the 10 


427 


test cards with two black circles on them. 
Cards were presented in random order at 4- 
sec. intervals. The S was instructed to fixate 
on the cross between the circles and report 
which of the circles was larger or if they were 
equal in size. During the 5-min. 1 period S$ 
was directed to fixate on the cross on the I 
figure. The subsequent test utilized the 
same procedure and stimuli as the control 
test. One-third of the Ss were then exposed 
for 30 min. to each of the three postinspection 
conditions described above and finally the 
third test of the magnitude of the figural 
aftereffect was administered in the same 
manner as the previous ones. 

Scoring of performance.—The scoring pro- 
cedure for the Kohler-Wallach aftereffect was 
as follows: An arbitrary scoring system as- 
signed numerical values to the size of the 
figural aftereffect. If the 7.6-cm. circle on the 
right was judged to be equal to the 7.6-cm. 
circle on the left (standard), a score of zero 
was assigned; if it was judged smaller (this 
being evidence of a figural afterefiect), a 
score of +1 was assigned. If the 7.8-cm. 
circle on the right was judged smaller than the 
standard, a score of +2 was assigned, Thus 
the weights were increased by integral units 
as the magnitude of figural aftereffect in- 
creased. An identical procedure was used 
with the 4.0-cm. standard circle, but since the 
effect of the inspection was to make the 4.0- 
cm. standard appear smaller, evidence of the 
visual aftereffect was a tendency of the circle 
on the right to appear larger. 

A similar procedure was used to score the 
magnitude of the aftereffect of the Gibson 


type. 
RESULTS AND DISCUSSION 


With the scoring procedures a value 
was computed for each S for the 
he initial aftereffect test, 


control test, t 
and the final aftereffect test. These 


were computed by totaling the num- 
ber of points obtained by S on all the 
test stimuli. The average values 
obtained and their SDs are shown in 
Table 1. In each experiment the 
values obtained for each of the three 
groups could be compared. The dif- 
ferences between the magnitude of the 
aftereffect (immediately after inspec- 
tion) and the final magnitude (after 
the interpolated visual experience) 
were analyzed with each S serving as 


H. L. PICK, JR., M. HETHERINGTON, AND R. BELKNAPP 


TABLE 1 
AVERAGE MAGNITUDES AND SDs OF AFTEREFFECT SCORES 


Kohler-Wallach Gibson 

Test Normal Blindfold Ganzfeld Normal Blindfold Ganzfeld 
Mean | SD |Mean| SD |Mean| SD |Mean| SD |Mean| SD |Mean| SD 
——|—— — 
2 | 04 

Control 2.8 | 3.5 | 26 | 3:84 20 | 2.1 | 0.1 | 0.3 | 01 | 0.7 0. 
Initial aftereffect 6.6 | 4.2 | 7.2 | 5.4/5.7] 3.3 | 3.0 | 20 | 2.8 |19 | 44 = 
Final aftereffect 1.8 | 4.1 | 5.6 | 42/81 | 3.0 | 0.9 | 1.0 | 0.6 | 0.5 | 06 5 


his own baseline. Negative differ- 
ences occurred where the final after- 
effect test resulted in a lower score 
than the immediate postinspection 
test, i.e., a decrease of the aftereffect 
over time. Conversely, positive dif- 
ferences indicated an increase of after- 
effect over time. 

The results shown in Table 1 sug- 
gest that with the Köhler-Wallach 
experiment the ganzfeld condition 
serves to enhance the aftereffect, 
and although the blindfold condition 
does not enhance the aftereffect it does 
reduce the amount of decay. The 
greatest decrease in aftereffect occur- 
red with the normal vision group. 
With the Gibson aftereffect the great- 
est decay occurred under the ganzfeld 
condition. These findings are verified 
when the data are subjected to 
analysis of variance. 

With the Köhler-Wallach effect the 
three groups are significantly different 
from each other (P < .01) and the 
ganzfeld group shows a significant 
enhancement effect (P < -05), i.e., an 
increase in magnitude of aftereffect 
over the 30-min. period. A significant 
difference (P < .025) between groups 
found with the Gibson aftereffect is 
apparently due to the fact that the 
ganzfeld group had somewhat inflated 
values in the initial aftereffect test. 
In any case no significant differences 
in final level of aftereffect occurred 
with the Gibson aftereffect, i.e., the 


aftereffect had vanished completely 
under all three visual conditions. 


Since conditions of homogeneous 
stimulation effect these two visual after- 
effects in radically different ways, it is 
strongly implied that the two phenomena 
are basically different. ! 

The results are, in general, opposite to 
what might be expected if each theory 
predicted the outcome for its own type 
of aftereffect. The inference from the 
Kéhler-Wallach theory was that a dimi- 
nution of the aftereffect would occur 
following conditions of homogeneous 
stimulation. However, the Kéhler-Wal- 
lach aftereffect was enhanced by the 
ganzfeld situation. Gibson's normaliza- 
tion theory would predict a maintenance 
of the curved line aftereffect under homo- 
geneous stimulation but instead this 
effect had essentially vanished. 

The nature of the perceptual pera 
which might account for the paradoxica 
enhancement or the maintenance of the 
Kéhler-Wallach effect warrants further 
investigation. It may be related to the 
higher susceptibility to figural after- 
effects found after a period of sensori 
deprivation (Doane, Manatoo, Heron, y 
Scott, 1959). In the present study, Í 
the ganzfeld and blindfold conditions 
were considered conditions of sensory 
deprivation it would suggest that stimu- 
lation just prior to such deprivation hes 
particularly strong effect, If the homo- 
geneous conditions are related to sensory 
deprivation the differences obtain 
under the two conditions require further 
investigation. 


ps 


VISUAL AFTEREFFECTS 


SUMMARY 


The effects of three conditions of post- 
stimulation on a Kéhler-Wallach figural after- 
effect and a Gibson negative aftereffect were 
investigated. Condition I was normal stim- 
ulation, obtained by looking around the room ; 
Cond. I] was homogeneous lack of stimula- 
tion, obtained by wearing a black blindfold ; 
Cond, II was homogeneous light stimulation 
obtained by exposure to a “ganafeld.” The 
Gibson aftereffect decreased normally under 
all three conditions. The ganzfeld enhanced 
the Kéhler-Wallach aftereffect, the blindfold 
retarded the decrease in the aftereffect, and 
looking around the room permitted the nor- 
mal disappearance of the figural aftereffect. 


REFERENCES 


BERGMAN, R., & GIBSON, j.J. The negative 
after-effect of the perception of surface 
slanted in the third dimension. Amer. $ 
Psychol., 1959, 72, 364-374. 

Doane, B. K., MANATOO, W., Heron, W., & 
Scorr, T. H. Changes in perceptual func- 


429 


tion after isolation. Canad. J. Psychol., 
1959, 13, 210-219. 

Gieson, J. J- Adaptation, after-effect, and 
contrast in the perception of curved lines. 
J. exp. Psychol., 1933, 16, 1-31. 

Gisson, J. J: After-effects: Figural and 
negative. Contemp. Psychol., 1959, 4, 
294-295. (a) 

Gigeso, J. J. Perception as a function of 
stimulation. In S. Koch (Ed.), Psychology: 
A study of a science. EIo DN k: 
McGraw-Hill, 1959. Pp. 456-501. (b) 


Hanmer, E. R. Temporal factors in figural 
after-effects. Amer. J. Psychol., 1949, 62, 
337-354. 


KönLeER, W., & WALLACH, H. Figural after- 


effects. Proc. Amer- Phil. Soc., 1944, 88, 
269-357. 

Oscoop, C. E.; & Hever, A. W., Jr. Anew 
interpretation of figural-effects. Psychol. 
Rev., 1952, 59, 98-118. 

Sacora, M., & Oyama, T. Experimental 


studies on figural after-effects in Japan- 
Psychol. Bull., 1957, 54, 327-338. 


(Received October 9, 1961) 


al oj Experimental Psychology 
"1060, Vol. 64, No. 5, 430-433 


EFFECTS OF PROBABLE OUTCOME INFORMATION 


| 


ON TWO-CHOICE LEARNING 


RICHARD C, NIES! 


University of California, Los Angeles 


The impact of statistical learning 
theory (Bush & Mosteller, 1955; 
Estes, 1950) has focused attention 
upon probability learning in the two- 
choice situation (Goodnow, 1958; 
Humphreys, 1939). Although success 
is maximized by consistent choice of 
the more likely alternative, Ss typ- 
ically approximate the actual propor- 
tions of the reinforcement schedule 
(Hake, 1955; Jenkins & Stanley, 
1951). However, when the task is 
presented in a gambling context (i.e., 
total correct choices are maximized), 
Ss tend to predict the more frequent 
event at a significantly higher propor- 
tion of the trials (Goodnow, 1955; 
Siegel & Goldstein, 1959). The pres- 
ent study was designed to further 
explore the conditions which maximize 
success by manipulating information 
about the probability of the events, 
Although responses in two-choice 
situations appear to be relatively 
independent of experimental instruc- 
tions (Anderson & Grant, 1957; 
Neimark & Shuford, 1959) and per- 
formance information (Das, 1961), 
Koehler (1961) was able to predict 
mean terminal response rates by vary- 
ing instructions which dealt with how 
Ss should consider the nonreinforced 
trials in a two-choice contingent 
partial reinforcement situation. In 
view of this finding, the stability of 
behavior to the manipulative effects 
of differential information in such 
situations is to be questioned, It was 


! The author expresses his appreciation to 
Allen Parducci and Norman H. Anderson for 
their invaluable guidance in the design and 
analysis of this experiment, 


therefore hypothesized that specific 
information about the probabilities of 
two alternative outcomes would pro- 
duce a shift toward consistent selec- 
tion of the more likely event. 


METHOD 


Apparatus.—A box containing 100 marbles, 
70 of one color and 30 of another color, was 
mounted so that the marbles could be mixed 
by turning a crank protruding from one end. 
A trough at the other end received 1 marble 
whenever the box was tilted forward, the 
marble rolling back into the box when the 
apparatus was returned to its normal position. 

Subjects.—Eight experimental and four 
control sessions were held, with groups of 16 
Ss drawn for each session from the course in 
introductory psychology at the University of 
California, Los Angeles. : 

Procedure.—The following instruc odi 
were read aloud to all Ss in the experimenta 
groups: 


This experiment is designed to study 
human guessing habits, In this box are 4 
number of blue and red marbles (shown 
briefly). For each trial, I will shake thes 
thoroughly and allow one marble to roll into 
the trough (demonstrated), While I am 
shaking the box I will count to three 
(demonstrated), and by the time I ei 
“three,” you write down on the respos 
sheet, which will be given you shortly, 
whether you think a blue marble or a oe 
marble will roll out. After you have wie 
your choice, I will allow a marble to rol 
and you will then be told its color. y You 
task is to get as many correct prediction 
as you can. 


T > : > ed 
To assure the 70-30 ratio of blue to f 


430 


TWO-CHOICE LEARNING 


schedule, Ss were led to believe that he was 
reporting the color of the marble that actually 
rolled out. This procedure insured that the 
different groups were exposed to identical 
sequences. 

The Ss in each experimental session were 
randomly divided into four equal subgroups, 
differentiated with respect to the additional 
information printed at the top of their re- 
sponse sheets. The No-Information group 
had no additional information, The Pattern 
group had the following sentence added: “It 
has been found in experiments of this kind 
that the marbles roll out in definite patterns.” 
The Ratio group had the statement: ‘There 
are 100 marbles in the box: 70 are blue and 30 
are red.” And the Ratio-Explanation group 
was informed that: ‘“There are 100 marbles in 
the box: 70 are blue and 30 are red. This 
means that the chances of a blue marble 
10, and the chances of 


to the box, these odds will be the same for 
each trial. Furthermore, since the b i 
shaken thoroughly each time, there can be no 
fixed pattern in which the marbles roll out.” 

For all conditions, S was to indicate his 
prediction by recording either am “R” or a 

B” on each trial. 

To control for color preferences 
sequence peculiarities, half the experimental 
Ss were run with colors reversed (ie~ 70 reds 
to 30 blues), run using @ 
different 70-30 sequence for 
unique group differences, 
experimental combinations 
with a second group. ‘The result was a 
2 X 2 X 4 factorial design (Color X Sequence 
X Information) replicated with a second set 
of four experimental groups: 

Four control groups (two groups for each 
reinforcement sequences 
condition only) were also run 
baseline against which the € 
marble box could be assessed. 
read the following instructions: 


and 


This experiment is designed to study 
human guessing habits. For each trial, 
will call out one of two colors——blue or red. 
Before I announce the color, I will count to 
three (demonstrated). While I am count- 
ing, you write down on the response sheet, 
which will be given you shortly, whether 
you think I will call out blue or red. After 
you have written your © voice, 1 will then 
announce the color. Your task js to get as 
many correct predictions as you can. 


431 


amo RATIO“ ERPLANATION GROUP 


101-180 
BLOCKS OF TRIALS 


151-200 20-230 


MEAN PROPORTIONAL CHOICES FOR 70 BEAD 


Fic. 1. Mean proportional choices for the 
.70 marble by trial blocks under various levels 
of information. 


RESULTS AND Discussion 


Figure 1 shows the mean proportion 
of choices for the more likely alterna- 
tive over each successive block of 
trials. Separate analyses of variance 
were performed for each block of 50 
trials, using the mean proportions as 
raw scores. Table 1 summarizes the 
analysis for the first 50 trials, showing 
the significant effect of Information 


and also of the interaction between 
Information and Sequence. This 
interaction was found only in the first 


TABLE 1 


ANALYSIS OF VARIANCE FOR TRIAL BLOCKS 
1-50 FOR EXPERIMENTAL GROUPS 


Source df MS F 
Instructions (1) 3| 639.80 | 14.55** 
Color (C) 1| 54.82 | 1.44 
Sequence (S) T\ 2:32 06 
pee 3} 5.55 2 
Txs 3| 156.18 3.53* 
CxS 1) 19.33 | -51 
IxCX c 3| 40.88 .92 
Groups within treat- 

ments* al 38.10 | -86 
Pooled Groups x Ii 

structions 12| 51.91 | 1.17 
Ss within groups” 96| 44.28 


Sse 


a Error term for C, S, and C XS. 
b Error terms for T, I XC, IXS, IxC 
Groups within treatments, and Pooled Groups X 
* Significant at 05 level. 
s+ Significant at ‘001 level. 


xs, 
L 


432 


block of 50 trials, so it was not judged 
as critical in modifying the interpreta- 
tion of the main effects. In the other 
blocks, the only significant effect that 
appeared was that for Information in 
Trial Blocks 51-100 and 101-150 
(F = 3.64 and 3.61, respectively; 
df = 3/96; P < .05). The different 
levels of information thus produced 
significant differences in betting be- 
havior over the first three-trial blocks. 
Since the experimental conditions 
revealed no significant effects for 
Trials 201-250, the scores for this 
block were pooled (using the .70-red 
condition only) and tested against the 
control scores for the same block. 
The difference was highly significant 
(F = 16.33; df = 1/100; P < .001). 
These analyses support the follow- 
ing conclusions: (4) The information 
about probable event outcomes affect 
Ss’ responses during the early blocks 
of trials (up to 150 trials), but shows 
no significant effect in later blocks, 
These early differences appear to be 
relatively independent of color prefer- 
ences, sequence peculiarities, and 
group differences. (b) While the con- 
trol groups (no marbles) just reach 
the probability matching level on the 
last block of trials, the experimental 
groups (marbles) exceed this level for 
all the trial blocks. Relevant to this 
finding is the proposal by Flood (1954) 
and Rubinstein (1959) that the aware- 
ness of randomness in a two-choice 
probabilistic outcome situation tends 
to elevate S’s response predictions, 
the impossibility of a complete solu- 
tion increasing S's caution against 
betting on the less likely alternative. 
Randomness was made explicit in the 
present study by the use of a box in 
which marbles were thoroughly mixed 
in Ss’ presence before each prediction 
was made. In the control groups 
with no marbles, Ss displayed typical 
probability matching behavior. 


While consistent with the interpreta- 
tion made by Flood and Rubinstein, the 


RICHARD C. NIES 


present experiment advances their argu- 
ment by demonstrating an early facilita- 
tion of optimal betting through the 
introduction of specific information re- 
levant to the probable outcome of the 
events. While the presence of the marble 
box can account for the higher response 
level of the experimental groups, pre- 
sumably through its contribution to the 
random appearance of the events, the 
perception of randomness was hastened 
by the critical information supplied to 
Ratio and Ratio-Explanation groups. 
When information is effective in deter- 
mining behavior in a two-choice, un- 
certain outcome situation, its effective- 
ness may be based on its contribution to 
the perception of randomness. The fact 
that the instructions used by Anderson 
and Grant (1957) and Neimark and 
Shuford (1959) did not elevate Ss’ 
response levels may result from a failure 
to make explicit the randomness and 
impossibility of a complete solution. 
Although the experimental conditions 
elevated the proportion of correct antici- 
pations, only 4 Ss (all from the Ratio- 
Explanation group) learned to bet 100% 
on the more likely alternative. Two 
suggestions for this relative lack of 
optimal betting find support in the 
present research: (a) The Ss seem sur- 
prisingly expectant of patterns and sys- 
tems when there is any challenge of a 
problem to be solved (Goodnow, 1958). 
This is consistent with the responses to a 
questionnaire given at the conclusion of 
the present experiment where 75 of the 
128 experimental Ss reported looking for 
a pattern whereas only 32 had been told 
(falsely) that there was one. Also, as 
shown in Fig. 1, the betting curve for the 
Pattern group is fairly close to that of the 
No-Information group. Moreover, while 
patterns may be expected from the prob- 
lem solving nature of the task, certain 
characteristics of the data support it. In 
contrast to the method of constrained 
randomization used by Edwards (1961; 
Lindman & Edwards, 1961) and Nicks 
(1959), the method used in this research 
involved an unrestrained randomization, 
resulting in a distribution of lengths of 
homogeneous outcome runs which in- 
cluded too many short runs and not 
enough long runs. Thus there were 


TWO-CHOICE LEARNING 433 


patterns in the outcome sequence which 
allowed for the gambler's fallacy. The 
point to be made here is that this reduc- 
tion from optimal betting is not “irra- 
tional” in terms of the Ss’ set to expect 
patterns. (b) Furthermore, the Ss prefer 
the occasional success of guessing the 
unlikely alternative to the monotony of 
repetitiously predicting the more certain 
event where no tangible inducements are 
offered to maximize their correct predic- 
tions (Goodnow, 1955; Siegel & Gold- 
stein, 1959). Thus, there is a second 
source of reinforcement at work, the 
utility of correctly predicting the occur- 
rence of the less frequent event, which 
subtracts from maximum gain responding 
(Brackbill, Kappy, & Starr, 1962). This 


also is consistent with the questionnaire 
responses; 77 of the experimental Ss 
reported that they knew the wisest 
procedure would be to bet consistently 
on the more likely alternative. 

The data, thus, suggest a compromise 
between the expedience of betting 100% 
for the more likely alternative (because 
of the “chance” or “random” structure 
made patent by the experimental ap- 
paratus) and the challenge of betting 
70% for the more likely alternative (i.e., 


trying out different patterns in an effort 
to “beat” the game). 


SUMMARY 


The experiment investigated the effect of 
different levels of probability information on 
response frequencies in a random, two-choice 
situation with unequal event probabilities. 
This information was demonstrated to have 
differential effects during the early trials. 
In addition, it was found that Ss reached a 
significantly higher response level when the 
outcome of a trial appeared to depend upon 
the chance drawing of a marble from a box 
than when the marble box was absent. These 
findings were interpreted in terms of the 
perception of “randomness, Ny 


REFERENCES 


ANDERSON, N. H., & Grant, D. A. A test of 
a statistical learning theory model for two- 
se behavior with double stimulus 
events. J. exp. Psychol., 1957, 54, 305-317. 
Brackett, Y., KAPPY, M. S., & STARR, R. H. 
Magnitude of reward and probability 


PE: J. exp. Psychol., 1962, 63, 32- 
Buss, R. R., & MOSTELLER, F. Stochastic 
models for learning. New York: Wiley, 
1955. É 
Das, J. P. Mathematical solution in the 
acquisition of a verbal CR. J. exp. 

Psychol., 1961, 61, 376-378. 

Epwarps, W. Probability learning in 1000 
trials. J. exp. Psychol., 1961, 62, 385-394. 

Estes, W. K. Toward a statistical theory of 
learning. Psychol. Rev., 1950, 57, 94-107. 

Froop, M. M. Environmental non-sta- 
tionarity in a sequential decision-making 
experiment. ln R. M. Thrall, C. H. 
Coombs, and R. L. Davis (Eds.), Decision 
processes. New York: Wiley, 1954. Pp. 
287-299. 

Goopxow, J. J. Determinants of choice 
distributions in two-choice probability 
situations. Amer. J. Psychol., 1955, 68, 
106-116. 

Goopnow, J. J. A review of studies on 
probable events. Aust. J. Psychol., 1958, 
10, 111-125. 

Haxe, H. W. The perception of frequency 
of occurrence and the development of 
“expectancy” in human experimental sub- 
jects. In H. Quastler (Ed.), Information 
theory in psychology. Glencoe, Ill.: Free 
Press, 1955. Pp. 257-274. 

Humpnreys, L. G. Acquisition and extinc- 
tion of verbal expectation in a situation 
analogous to conditioning. J. exp. Psychol., 
1939, 25, 294-301. 

JENKINS, W. O., & STANLEY, RAGOR: 
Partial reinforcement: A review and a 
critique. Psychol. Bull., 1951, 41, 291-297. 

KOEHLER, J., JR. Role of instructions in two- 
choice verbal conditioning with contingent 
partial reinforcement. J. exp. Psychol., 
1961, 62, 122-125. 

LINDMAN, H., & EDWARDS, W. Supplemen- 
tary report: Unlearning the gambler’s 
fallacy. J. exp. Psychol., 1961, 62, 630. 

Nemark, E. D., & SHUFORD, E. H. Com: 
parison of predictions and estimates in a 
probability learning situation. J. exp. 
Psychol., 1959, 57, 294-298. 

Nicks, D. C. Prediction of sequential two- 
choice decisions from event runs. J. exp. 
Psychol., 1959, 57, 105-114. 

RUBINSTEIN, I. Some factors in probability 
matching. J. exp. Psychol., 1959, 57, 
413-416. 

SIEGEL, S., & GOLDSTEIN, D. A. Decision- 
making behavior in a two-choice uncertain 
outcome situation. J. exp- Psychol., 1959, 
57, 37-42. 


(Early publication received March 16, 1962) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 5, 434-440 


COLOR CODING AND VISUAL SEARCH! 


SIDNEY L. SMITH 
The MITRE Corporation, Bedford, Massachusetts 


There is relatively little available 
data on the effectiveness of color as a 
coding dimension for information 
transmission in visual displays. Some 
studies have been published exploring 
the limitations of our ability to 
identify different hues on the basis of 
absolute discrimination. They sug- 
gest that under good viewing con- 
ditions 5 to 8 colors can be dis- 
tinguished reliably (Conover, 1959: 
Conover & Kraft, 1958; Eriksen & 
Hake, 1955), or perhaps even as many 
as 9 to 12 under optimal conditions 
(Chapanis & Halsey, 1956; Halsey & 
Chapanis, 1951). 

Whichever number we accept, this 
means that color as a coding dimen- 
sion can be used to distinguish only a 
relatively small number of categories, 
as compared with shape coding (sym- 
bology) which carries the bulk of 
information in visual displays. How- 
ever, within this limitation, there js 
evidence that color is better than 
shape in tasks which involve locating 
displayed data (Christner & Ray, 
1961; Hitt, 1961), 

The effectiveness of color coding 
for locating particular displayed items 
is related, by extension, to its use for 
providing visual separability among 
data classes. Results on this question 
have been reported by Green and 
Anderson (1956), They examined the 
degree to which color coding permitted 
visual separability of displayed two- 


' The research reported in this article was 
supported by the Department of the Air 
Force under Contract AF-33(600)39852, A 
more detailed account of this research was 
published as a MITRE Technical Series 
Report, MTS-7, “Display Color Coding for a 
Visual Search Task," June 1962, 


434 


digit numbers, as measured by de- 
creases in average visual search time 
when the color of a “target” number 
was known beforehand (relevant) as 
compared with other occasions when 
it was not known (nonrelevant). 
They (Green & Anderson, 1956) sum- 
marized their conclusions as follows: 


When Os know the color of the target, the 
search time is approximately proportional to 
the number of symbols of the target’s color, 
There is also a slight increment in search time 
due to the presence of the wrong-colored 
targets, When Os do not know the target's 
color, search time depends primarily on the 
total number of symbols on the display. 
However, search times are slightly longer for 
multicolored displays than for comparable 
single-colored displays (p. 24). 


In an attempt to confirm these re- 
sults, this present experimental study 
was conducted. This study accepts 
the basic premise of Green and Ander- 
son that visual search time is a funda- 
mental measure of the potential value 
of display color coding. It simply 
expanded their model to include a 
greater range of displayed densities, 
more displayed colors, both light and 
dark display backgrounds, and certain 
modified techniques of display pres- 
entation. 

PROCEDURE 

Twelve Ss participated in this study, 11 
men and 1 woman. Preliminary testing 
using the American Optical Company H-R-R 
pseudoisochromatic plates confirmed that all 
Ss had normal color vision. In the course of 
the study, each S made a total of 300 visual 
searches, using a variety of different displays. 
This required several experimental sessions 
for each S, Individual Ss worked for no 
longer than 1 hr. at a time, and for no more 
than 2 hr. per day. 

The displays consisted of varying arrays of 
three-digit numbers. These numbers were 


COLOR CODING AND VISUAL SEARCH 


randomly placed in a square field, which can 
be imagined as comprising 13 columns and 
27 rows, for a total of 351 possible positions. 
The numbers themselves were chosen ran- 
domly from the 1000 possibilities (000 through 
999) with certain restrictions: the numbers on 
each particular display were all different; 
they were unique in terms of their first two 
digits; their third digits represented an equal 
sample of each of the 10 possibilities (0 
through 9). In the case of multicolored 
displays, the particular color of each three- 
digit number was also chosen at random, 
with the restriction that all colors were 
equally represented. 

The displays were made up as 2 X 2 in. 
color slides, and presented to Ss by rear 
projection on the screen of an experimental 
console, The dimensions of the digits as 
projected on this screen were 4 in. high X 
1 in, wide. Viewing distance was about 18 in. 
The overall display field as projected was 
12 in. square. 

The. colors used on the various displays 
blue, orange, and either 
white (on slides with a black background) or 

Visual 
ted with 
standard colors resulted in agreement by 2 
Os on the descriptive specifications shown in 
Table 1. 

Because rear projection was used, a 


moderately high ambient illumination was” 


maintained, by diffuse overhead lighting from 
dimmed fluorescents: over .5 ft-c as measur 
at the experimental console. 

The experimental routine for S began when 
E signaled to him the first two digits of the 
target number, the number he was to find on 
the display. These two digits were displayed 
on an auxiliary panel. The S also saw on this 
panel either a colored indicator showing what 
color the target number would be, or else a 
statement ‘color unknown.” 
exposed the slide on the screen in front of S 
and started a clock. The S searched for the 


target number, indicated when he had found 


TABLE 1 
DispLAY COLORS Usep ( MUNSELL NOTATION) 


Display Color iL er Saal 
pi sa nil MA eS A 
Red asR5/10 |SRS/12 
Green 23 GY 8/8 |5GY 7/8 
mE o BR 
2 25 3 YR 
Black p34 | N 9/0 


Black/White 


435 
TABLE 2 
Disetays Usen or Eacn Tyre 
Number of Number of Displayed Items 
Cee = u 
ap 2 | w | oo | so | 100 
| 
1 5 sos 5 | 5 
2 10 | | 
3 10 | 
4 5 
5 44 fed Bi | PINU 


it by pushing one of 10 numbered buttons, 
corresponding to the third digit of the target 
number. If this response was correct, the 
clock stopped, a chime sounded, and the 
screen went blank. The Æ recorded the time, 
and then set up the auxiliary display panel to 
begin the next trial. In the very rare case 
when S pushed the wrong response button, a 
loud buzzer sounded, and the trial was run 
again later in the experimental series. 

The displays were presented in a series of 
150 slides, half with white backgrounds and 
an identical group with black backgrounds. 
Each group of 75 slides contained both single- 
and multicolored displays with display 
densities varying from 20 to 100 three-digit 
numbers (which will be called “items” in the 
next few paragraphs to avoid confusion). 
The types of displays used are summarized in 
Table 2. The entries in the matrix represent 
the number of different displays used. 

There are really three sets of displays 
represented in the matrix. The upper row 
consists of single-colored displays of increasing 
display density, with 5 displays (one of each 
color used) at each density level. ‘The lower 
row consists of a comparison set of displays, 
each with items of all five colors on it. It was 
arbitrarily decided to use 5 displays at each 
density level, to match the first set. The 
diagonal set of the matrix represents those 
displays where an increasing number of dis- 
played colors 1s associated with increasing 
display density, with the advantage that a 
constant number (20) of items of any par- 


10 three-colored displays represented all the 
possible triple combinations ; 
colored displays represented the cases where 
each of the five available colors had been 


omitted, 
The series of 150 displays described above 


436 SIDNEY L. SMITH 


were presented in a random order, which in 
turn was randomized (in five blocks) differ- 
ently for different Ss. Each S went through 
this series twice in the course of the experi- 
ment, for a total of 300 searches. The first 
time through, for half of the displays (ran- 
domly chosen) he was told in advance the 
color of the target number, the number-to-be- 
searched-for. For the other half of the dis- 
plays, the color of the target number was 
unknown to him. During his second run 
through the series, these conditions were re- 
versed for each particular display—where the 
color of the target had been known before, it 
was now unknown, and vice versa. 

In summary, visual search time data were 
obtained from 12 Ss, each viewing a series of 
300 displays, which varied in display density, 
in number of colors used, in the particular 
color of the target, with either a white or 
black background, under conditions where 
S either knew the color of the target number 
in advance or did'not. 


RESULTS AND Discussion 


Because of the relative complexity 
of the experimental design, no single 
statistical treatment would suffice to 
answer all the questions we might 
wish to raise. In fact, five separate 
analyses of variance were made, each 
using some portion of the data, in 
order to examine the effect of different 
combinations of the experimental 
variables. 

The most extensive analysis of 
variance, and one basic to the sub- 
sequent data analysis, was carried out 
using individual search times obtained 
under all conditions involving either 
single-colored or five-colored displays. 
These data represent a factorial design 
of 12 Ss by 2 conditions of prior 
knowledge (target color either known 
or unknown) by 2 types of display 
background by 5 degrees of display 
density by 2 types of display (either 
single-colored or five-colored) by 5 
possible colors used for the target 
number. Taken together they com- 
prise 2400 measures. 

The data initially available for this 


analysis were search times expressed 
in .01 sec. Because of the inherently 
high correlation between mean and 
variance in search time data, a log 
transform was used prior to the vari- 
ance analysis computations. The 
computations themselves were carried 
out by an electronic computer, and 
sums of squares were obtained for the 
six experimental variables and their 57 
various interaction terms, including 
the six-way residual. Following a 
procedure suggested by Edwards 
(1950) for the treatment of repeated 
measures obtained from the same Ss, 
all sums of squares representing 
interactions of Ss with other experi- 
mental variables were pooled to form 
one residual representing the Ss 
X Conditions interaction. The mean 
square of this interaction was used as 
the error term to test the significance 
ofallothers. (As it happens, if Ss had 
been treated as a legitimate variable 
in the analysis, none of the interac- 
tions involving Ss would have proved 
Statistically significant.) Because of 
the large number of F ratio compari- 
sons made in this analysis, only 
significance levels of at least .001 were 
accepted as persuasive evidence of 
Statistical reliability. The summa- 
rized results of this analysis are 
presented in Table 3. 

We may note, first, that neither the 
particular target color used, nor the 
display background, nor any inter- 
action term including these variables, 
had any statistically significant effect. 
We might have expected, for example, 
that some colors on a light back- 
ground would be less legible than on a 
dark background, and hence more 
difficult to scan quickly, since it is 
clear that visual contrast is an im- 
portant variable in legibility. How- 
ever, the present data suggest that we 
need not expect any measurable effect 
related to different contrast ratios 


COLOR CODING AND VISUAL SEARCH 


TABLE 3 


ANALYSIS OF VARIANCE OF TRANSFORMED 
SEARCH Tres FOR ALL EXPERIMENTAL 
CONDITIONS INVOLVING ONE- AND 
Five-CoLorep DISPLAYS 


Gource of VEES a 
Ss 41) 0.708 — 
Conditions i) = _ 
Knowledge of target color (K): 1 | 21.787) 229,34* 
Display background ( 1 TESE ST 
Display density ( 4| 16.791 | 176.75* 
Number of colors (N) 1 | 14.956) 157.43" 
Color of target (©) 4| 0.063) 066 
KX i| 0.018) 016 
KXD 4| 0.619) 6.525 
KXN 1 | 15.549) 163.67* 
KXC 4| 0.106) 1.12 
BXD 4| 0.071) 075 
BXN 1) 0,000) 0.00 
BXC 4| 0.095| 0.98 
DXN 4| 0.129| 1.36 
DXC 16| 0.129] 136 
N XC 4| 0.292) 3.06 
Higher-order interactions 
among conditions 145| 0.096) —* 
Ss X Conditions 2189| 0.095) — 


a None was significant when tested individually. 
*P <.001. 


that are all well above some sort of 
legibility threshold. 
Next, consider the marked effect on 


search time of increasing display 
density, confirmed by the statistical 
analysis and illustrated in varying 


degrees by Fig. 1-3. It is the nature 
of these data that there is greater in- 
herent variability of the search time 
measure in those situations where the 
average search takes longer. Taking 
this into account, it seems that the 
average search time data closely 
approximate a direct linear relation 
with display density. This was also 
the case in the Green and Anderson 
study and has been noted before by 
other investigators (€.g-, Green, Mc- 
Gill, & Jenkins, 1953) 

If the “curves” in Fig. 1 are extra- 
polated backward to a display density 
of zero, they would intersect the 
ordinate axis at a value of about 1 
sec., which presumably represents the 
simple button-pushing reaction time. 
This is confirmed by the observation 
that only one or two of the individual 
search time measures obtained in this 


437 


study were less than 1 sec., and those 
were smaller by only a slight margin. 
The same observation was made by 
Green and Anderson (1956) under 
similar circumstances. It should be 
pointed out that their reported search 
times were in general somewhat 
shorter than those in the present 
study. In part, this is because they 
reported geometric rather than arith- 
metic means. And, in part, this 
difference is probably attributable to 
the fact that their display field 
(12 X 16Ẹł in., viewed from a distance 
of 10 ft.) subtended a smaller visual 
angle than was the case in this present 
study. Hence, fewer eye movements 
were required to scan their displays. 
The significant interaction between 
knowledge of target color, and number 
of colors displayed, indicates the 
potential value of color coding when 
the color is relevant to the search 
task. This is illustrated by the sizable 
difference between the curves in 
Fig. 1. If the color coding permitted 
absolute visual separability between 
classes of displayed items, then the 
average search times for these five- 


FIVE ~ COLOR OSPLATS 


SECONDS) 
Š 


AVERAGE SEARCH TIME l 


DENSITY 
(NUMBER OF THREE-DIGIT ITEMS) 


DISPLAY 


Fic. 1. Search time as a function of 
display density with knowledge of target 
color as a parameter. 


438 SIDNEY L. SMITH 


color displays would be only one- 
fifth as great when the target color 
was known as when it was unknown. 
In actuality, the difference was not 
quite so great as this, but considerable 
nonetheless. 

Comparable curves for single- 
colored displays showed no such dif- 
ference, i.e., knowing the target color 
beforehand did not speed the sub- 
sequent visual search process. In- 
deed, there is no logical reason to 
expect that it would. However, to 
check this, a separate variance anal- 
ysis was conducted, similar to that 
already described, but using only the 
search times for single-colored dis- 
plays. The only significant variable 
in this treatment turned out to be 
display density. Prior knowledge of 
the target color made no difference. 

A comparable variance analysis was 
run comparing one- and five-colored 
displays, but using only the search 
time data obtained when Ss had no 
prior knowledge of target color. 
Again, the only significant variable 
proved to be display density. In 
particular, number of colors displayed 
made no difference to Ss when they 
had no knowledge of the target color 
to guide them. This lack of difference 
is illustrated by the overlapping 
curves in Fig, 2. 

This last finding would seem to bear 
directly on the conclusion by Green 
and Anderson that multicolored dis- 
plays retard visual search when target 
color is unknown, and in fact, is 
evidence that no such effect occurred 
in this present study. It should be 
noted, however, that their conclusion 
was based on somewhat different 
evidence. Data obtained under equi- 
valent conditions in this present study 
are summarized in Fig, 3. The dis- 
plays considered here are those which 
always contained 20 items of the same 
color as the target number, For the 


IME Sasconge? 


s 


SINGLE-COLOR DISPLAYS 
{COMBINED | DATA } 


AVERAGE SEARCH TII 


g 7 40 eo ao To 


DISPLAY DENSITY 
(NUMBER OF THREE-DIGIT ITEMS) 


Fic, 2, Search time asa function of display 
density under conditions of nonrelevant color 
coding and of no color coding (single-colored 
displays), 


displays with just one color, this is 
all they contained. For the two- 
colored displays, there were also 20 
items of the second color present. For 
the three-colored displays, there were 
20 items displayed for each of the two 
other (nontarget) colors—and so on, 
for the four- and five-colored displays. 


NUMBER OF DISPLAY COLORS 
2 


i 3 4 5 


TARGET coor ~” 
UNKNOWN Ue 


AVERAGE SEARCH TIME (SECONOS) 
3 
sE 
i> 


x 
if TARGET COLOR 
5 5 Coal dia IRGET COLO i) 
aa a a 
hee a ote abet al ee ene 
: od ya eo To wo 


DISPLAY DENSITY 
(NUMBER OF THREE-DIGIT ITENS } 


Fic. 3, Search time as a function of dis- 
play density when target color is known and 


unknown and when there are 20 items of 
target color in all cases, 


| 
| 


COLOR CODING AND VISUAL SEARCH 439 


‘This comparison in Fig. 3, showing the 
considerably reduced search times 
when the target color was known to 
Ss, is essentially the comparison made 
by Green and Anderson in their 
report. It is true that the number of 
displayed colors is here confounded 
with differences in display density, 
But in the sense that each display 
contains just 20 target numbers amid 
varying amounts of “clutter,” the 
comparison it permits is interesting. 

To permit a direct check of the 
Green and Anderson (1956) results, 
a further analysis of variance was Tun, 
comparing search times for single- 
colored displays, of 40, 60, 80, and 100 
items, with those for two-, three-, 
four-, and five-colored displays, of 
corresponding respective densities, 
under conditions of no prior knowl- 
edge of target color. Since neither 
display background nor target color 
had proved to be significant variables 
in preceding analyses, these were 
eliminated in this analysis by sum- 
ming individual search times across 
these variables. Thus the data u 
consisted of the sum of 10 search times 
(5 target colors by 2 display back- 
grounds) under each combination of 
display density and number of colors. 
Actually, this is not entirely true, 
for in the case of the two-colored, 
40-item displays, and: the - three- 
colored, 60-item displays, there were 
20 search times available (10 target 
color/other color combinations by 
display backgrounds). In these cases, 
half the available data were chosen 
randomly to use in the variance anal- 
ysis. This total data subset, then, 
represents 4 factorial design of 12 
Ss by 4 degrees of display density 
by 2 types of displays—single-colored 
versus multicolored—for a total of 96 
measures. 

The results of this analysis are 


summarized in Table 4. The only 


TABLE 4 


ANALYSIS OF VARIANCE OF SUMMED SEARCH 
TIMES FOR CONDITIONS INVOLVING SINGLI- 
AXD MULTICOLORED DisrLays writ 
No PRIOR KNOWLEDGI 
op Taxcet CoLor 


Source of Variance a| MS F 


Ss 1) 9,351 
Conditions 7 

Number of colors (N) | al 3449| 071 
Display density (D) | 3) 68,973 14,11" 
NX | 3| 4,397) 0.90 
Ss X Conditions 77\ 4, 


*P <.. 


variable with a statistically significant 
effect proves again to be display 
density. There is no reliable differ- 
ence that can be attributed to number 
of displayed colors. If such a differ- 
ence had been confirmed, it would 
have been in favor of the multicolored 
displays, which were searched in an 
average time of 5.9 sec., as compared 
with 6.5 sec. for the single-colored 
displays. Certainly we cannot con- 
clude that multicolored displays were 
distracting in this present study, even 
though the target color was unknown. 


Why did Green and Anderson obtain 
a different result? The most probable 
explanation lies in their choice of projec- 
tion technique: luminous colored num- 
bers displayed in a dark environment 
would provide a stimulus situation very 
conducive to perception of a depth 
illusion based on differences in either 
brightness or wave length. If their dis- 
played numbers of different colors ap- 
peared to be in slightly ‘different frontal 
planes, this might have led their Ss to 
scana multicolored display several times, 
a process that on the average would be 
somewhat slower than one systematic 
search of a single-colored display. In 
this present study, with rear projection 
and higher ambient illumination, the 
displayed symbols clearly lay ona single 
surface, and no depth effects were 


apparent. 


440 SIDNEY 


The slight but regular increase in 
search time, when the target color was 
known, as more and more numbers of 
other colors were added to the display 
(the lower curve in Fig. 3), also proved 
to be statistically reliable. The data 
involved in this demonstration consist of 
the search times for each individual S, 
summed over target color and display 
background as before, for displays of one, 
two, three, four, and five colors, with 
display densities of 20, 40, 60, 80, and 
100 items, respectively, under conditions 
where the target color was known in 
advance. As in the previous analysis, 
only half of the available data for the 
two-colored, 40-item displays and the 
three-colored, 60-item displays was used. 
This represents a factorial combination 
of 12 Ss by 5 display densities (each 
representing a different number of dis- 
played colors), for a total of 60 measures, 
Analysis of variance confirmed a reliable 
difference among the 5 display types at 
P<.01 (F= 437; df= 4/44). This 
confirms the conclusion of Green and 
Anderson, based on a similar analysis, 
that wrong-colored items can be almost 
completely ignored in visual search— 
almost, but not quite. 


SUMMARY 


Twelve Ss each viewed a series of 300 
displays, which varied in display density, in 
number of colors used, in the Particular color 
of the target, with either a white or black 
background, under conditions where S either 
knew the color of the target in advance, or 
did not. 

Neither the particular color of the target 
nor the display background had any signi- 
ficant effect on search time. Search time 
increased regularly with increasing display 


L SMITH 


density. For multicolored displays, when 
the color of the target was known in advance, 
search times were considerably shorter than 
when the target color was unknown When 
the color of the target was unknown, search 
times were not Significantly different than 
those for single-colored displays. 


REFERENCES 


Cuarants, A., & Hatsey, R. M. Absolute 
judgments of spectrum colors. J. Psychol., 
1956, 42, 99-103. 

CHRISTNER, C. A., & Ray, H. W. An evalua- 
tion of the effect of selected combinations 
of target and background coding on map- 
reading performance: Experiment V. Hum. 
Factors, 1961, 3, 131-146. 

Conover, D. W. The amount of information 
in the absolute judgment of Munsell hues, 
USAF WADC tech, Note, 1959, No. 58-262. 

Conover, D. W., & Krart, C. L. The use 
of color in coding displays. USAF WADC 
tech. Rep., 1958, No. 55-471, 

Epwarps, A. L. Experimental design in 
psychological research. New York: Rine- 
hart, 1950. 

ERIKsen, C. W., & Hake, H. W. Multi- 
dimensional stimulus differences and ac- 
curacy of discrimination. J. exp. Psychol., 
1955, 50, 153-160. 

Green, B. F., & ANDERSON, L. K. Color 
coding in a visual search task. J. exp. 
Psychol., 1956, 51, 19-24, 

Green, B. F., McG, W. J, & Jenkins, 
H.M. The time required tosearch for num- 
bers on large visual displays. Technical Re- 
Port No. 36, 1953, Massachusetts Institute 
of Technology, Lincoln Laboratory. 

Hatsey, R. M., & Cuapanis, A. On the 
number of absolutely identifiable hues. 
J. Opt. Soc. Amer., 1951, 41, 1057-1058. 

Hitt, W. D. An evaluation of five different 
abstract coding methods: Experiment IV. 
Hum. Factors, 1961, 3, 120-130, 


(Early publication received April 24, 1962) 


Journal of Kaperimental detory 
og eee Ai 


RESISTANCE TO EXTINCTION WHEN PARTIAL 
REINFORCEMENT IS FOLLOWED BY 
REGULAR REINFORCEMENT 
HERBERT M. JENKINS 
Beil Telephone Laboratories, Incorporated, Murray Hill, New Jawy 


The discrimination hypothesis 
(Humphreys, 1940; Mowrer & Jones, 
1945) ascribes the partial reinforce- 
ment effect (PRE) to the relative ease 
of discriminating an abrupt transition 
to extinction following regular rein- 
forcement as compared to the more 
gradual transition following partial 
reinforcement. 

If the recent conditions of rein- 
forcement provide the basis of the 
discrimination, then the PRE should 
be decreased by the interpolation of 
regular reinforcement prior to extinc- 
tion since this would make the transi- 
tion between recent training and ex- 
tinction abrupt. This was examined 
for the case of the free operant re- 
sponse by Keller (1940), by Likely 
(1958), and by Quatermain and 
Vaughan (1961). Although in every 
case the results were that the inter- 
polation of regular reinforcement 
failed to reduce resistance to extinc- 
tion, no firm conclusions can be drawn 


monstrated a clear 
versus regular reinforcement taken 
separately. Theios 
this deficiency in a runway experi- 
ment. The PRE was not significantly 
altered by the interpolation 
regular reinforcements, Siod 
still clearly present, although signifi- 
cantly reduced, after 70 regular rein- 
forcements. 
The present 
pigeon’s key peck response with a 
discrete trial procedure. They were 
carried out independently of Theios’ 
work and afford some of the same 


experiments use the 


com In Exp. 1, the location 
of a period of partial reinforcement 
within a longer regime of training 
under r reinforcement was 
varied so that the effect of the amount 
of regular reinforcement which oc- 
curred between partial reinforcement 
and extinction could be observed 
where the total amount of training 
was the same. A baseline was 

j by a control group which 
received all regular reinforcement. 
The results suggested that the inter- 
polation of regular reinforcement prior 
to extinction may increase resistance 
to extinction over the level obtained 
when extinction occurs directly fol- 
lowing partial reinforcement. Ex- 
periment II was designed to sub- 
stantiate this finding, to further define 
the conditions under which it occurs, 
and to remove certain ambiguities in 
the interpretation of Exp. I. 


METHOD 


Subjects—The Ss were $-6 yr. old, male 
White Carneaux pi i 


peri 7 
by restricted feeding at 80% of their free- 
feeding body weight. 

Apparatus—An automatic key pecking 
apparatus of Skinner design was used. De- 
tails of the apparatus have been reported 
previously (Jenkins, 1961). Reinforcement 
was a 4-sec. period of access to a tray of mixed 
grain signaled by lighting the opening to the 
tray. 

Trials.—A trial was begun by lighting the 
translucent plastic response key. trial 
was terminated (key light turned off) by a 
single response, Or by external control at the 
end of 5 sec., whichever occurred first. On 
reinforced trials, the tray operation followed 
the response without delay. During the 


441 


442 


interval between trials the key light was of, 
but S's compartment remained illuminated, 
The time between onsets of successive trials 
was equally often 15, 30, of 45 sec. in an 
A response made between 


training. 

Sessions—The experiments involved pre- 
training, and extinction. 
training consisted of four sessions 


of partial reinforcement consisted of 40 rein- 
forced trials plus a number of nonreinforced 
trials programed in a random sequence subject 
to the constraint that each half of the se- 
quence contained half of the reinforced trials 
(20) and half of the nonreinforced trials, 
Extinction sessions were run on consecutive 
days and each consisted of 40 nonreinforced 
trials, By prior decision, all comparisons 
among groups were on the first 10 
extinction sessions. Additional sessions were 
run in the case of most groups in order to 
more nearly complete extinction, fi 
Treatment groups, Exp. I.—The four 
groups of Exp. I each received 13 sessions of 


HERBERT M. JENKINS 


training beyond preliminary training, Group 
13K received 13 sessions of regular reinforce 
ment, The remaining three groups received 
partial reinforcement for a block of 3 con 
secutive sessions located in different positions 
within the sequence of training sessions 
These groups are designated 3P-10R (3 ses 
sions of partial reinforcement followed by 10 
sessions of regular reinforcement), OR-3P-1R, 
and 10R-3P. Forty nonreinforced trials were 
programed in the first session of partial 
reinforcement, 60 in the second, and 80 in the 
third, 


Five Ss were assigned to cach group so as 
to match the groups as well as possible on the 
basis of the means and variance of the latency 
of response during preliminary training. 

Treatment groups, Exp. I1.—Three groups 
received different numbers of training sessions 
under partial reinforcement: Groups 8P 
(N = 9), 20P (N = 7), and 32P (N = 7). 
Two groups received different amounts of 
training under regular reinforcement: Groups 
8R (N = 6) and 20R (N = 6). Finally, two 
groups received different amounts of training 
under partial reinforcement prior to 12 
sessions of regular reinforcement: Groups 8P- 
12R (N = 7) and 20P-12R (N = 6). The 
reasons for selecting these conditions will 
become clearer after the results of Exp. I are 
reported, but certain features may be noted 
here. Partial reinforcement was begun at the 
same point in training (directly following pre- 
liminary training), rather than at different 
points as in Exp. I. The amount of training 
was extended in order to obtain a clear 
Separation in resistance to extinction between 
groups given partial reinforcement as against 

hose given regular reinforcement. The de- 
sign also allows comparison of the effect on 
extinction of adding, after different amounts 
of training under partial reinforcement, either 
more partial reinforcement, or an equal 
number of sessions of regular reinforcement. 

The groups in Exp, II were run in the fol- 
lowing order: Groups 8P and 8R concur- 
rently, then Groups 20P, 20R, and 8P-12R 
concurrently, then Groups 32P and 20P-12R 
concurrently. Groups run at the same time 
were matched on the basis of performance in 
preliminary training as in Exp. 1. 

„In each of the first two sessions of partial 
reinforcement, 40 nonreinforced trials were 
programed. Thereafter, ‘each session of 
partial reinforcement contained 80 nonrein- 
forced trials, 

Additional sessions of extinction —All 
groups in Exp, I.and Groups 20R, 20P, and 
8P-12R of Exp. II received 2 additional 
Sessions of extinction. Groups 20P-12R and 


mmama of cation 


the groups as 


determined 
on oe was examined and le reported below, 


ResuLTS 


Responses in 
portion of tri 


training and .985 for training. 

were no systematic differences in this 
measure among groups Or between 
the periods of partial reinforcement 
and regular reinforcement within 
groups. 

Shape of extinction curves. —The 
discrete trial method of the present 


Fic. 1. Responses in è 


RESISTANCE TO EXTINCTION 


au 


viekls extinction curves 
typically show a sharp drop 
from a high to a low bevel of respond: 


xtinction for individual Ss of Groups 20P and 20R. 


aR ap \ 
(0.47) \(0.87) (1.06) 
` 


` 
\ 
` 
\ 
` 


MEAN NUMBER OF RESPONSES 


ap- 


HERBERT M, JENKINS 


` 


20P-12R \ 20P \ 32P 
\ (0.90) 


(0.99) \ (0.90) SD OF SESSIONS 
y `X BETWEEN A ano B 
\ 


\ 
‘ \ 


\ i 


` \ `v 

‘ \ ` 
\ a ee \ 
iy \ 


7 8 9 10 "n 12 


EXTINCTION SESSIONS 


FiG. 2. The average slopes of extinction curves for the groups of Exp. II. 


sessions A and B are explained in the text. 


Sessions A and B, or one-half SD to either side 
one SD of the number of sessions between Sess 


criterion sessions A and B. On the 
ordinate is plotted the mean number 
of responses in the criterion sessions. 
The slope of the lines connecting A 
with B thus represents, on the aver- 
age, the abruptness of the extinction 
curves. The slopes are very similar. 
The SD of the location of the criterion 
sessions increases as the mean number 
of extinction sessions prior to the 
“break point" increases, a trend 
which is correlated with Partial rein- 
forcement in training. On the other 
hand, variability in the number of 
sessions between A and B shows no 
systematic differences among the 
groups. 

Since the shapes of the extinction 
curves are similar for the several 
groups, the major effects of the train- 
ing variables can be represented con- 
veniently by the mean number of 
responses in extinction, 


(The criterion 


The bars represent one SD of the location of 
of the mean. 
ions A and B.) 


The numbers in parentheses are 


Responses in extinction, Exp. he 
Summary data for responses in 10 
sessions of extinction are given for 
Exp. I and II in Table 1.! On the 
assumption that PRE is reduced as a 
function of the amount of regular 
reinforcement which is interpolated 
prior to extinction, the groups of 
Exp. I would be ordered on the basis 
of number of responses in extinction 
as follows: 10R-3P > 9R-3P-1R 
> 3P-10R > 13R. However, the 
groups which received partial rein- 
forcement actually show the reverse 

1 The inclusion of responses which occurred 
in sessions of extinction beyond the tenth 
session resulted in only minor increases over 
the totals given in Table 2 except in the case 
of Groups 20P and 32P, where the totals 
increase about 10%. Statistical tests in- 
volving Groups 20P and 32P were recomputed 
using the total of responses for all extiaction 
sessions. The significance levels in each case 


remained unchanged from those obtained on 
the basis of the first 10 sessions of extinction. 


RESISTANCE TO EXTINCTION 


ordering: i.e, 10R-3P < 9R-SP-1K 
< 3P-10R. An analysis of variance 
for these groups yielded an F = 3.40 
with 2/12 df, and P < .10. The mean 
difference between the extreme groups 
was 78.4 responses (t = 2.60; df = 8, 
P < .05). Group 13R, which re- 
ceived no partial reinforcement had 
the lowest mean number of responses 
in extinction. However, meaningful 
statistical comparisons of this group 
with the others are difficult to make 
since 1 S in Group 13R resumed 
responding late in extinction and thus 
made a total number of responses 
exceeded by only 2 other Ss in the 
entire experiment. The remaining 4 
Ss in Group 13R had the four lowest 
totals in the experiment. 

The following hypothesis was for- 
mulated from the outcome of Exp. I 
and tested in Exp. 11: The interpola- 
tion of regular reinforcement between 
partial reinforcement and extinction 
does not reduce the PRE, but in fact 
increases resistance to extinction over 
that which obtains when extinction 
follows partial reinforcement directly. 
Experiment I falls short of establish- 
ing this point on two counts. First, 
a clear PRE was not obtained when 
extinction followed partial reinforce- 
ment directly (i.e. Groups 10R-3P 


us 


and 13R were not well separated). 
In order to obtain a clear PRE against 
which to evaluate the effect of inter- 
polated regular reinforcement, the 
amount of training under regular or 
partial reinforcement was extended in 
Exp. II. Second, although the hy- 
pothesis ascribes the greater resistance 
to extinction in Group 3P-10R com- 
pared to Group 10R-3P to the regular 
reinforcement which followed partial 


partial 

partial reinforcement was introduced 
at the same point in training in order 
to avoid this ambiguity. 

Responses in extinction, Exp. IL.— 
A comparison of Groups 8P, 20P, and 
32P of Exp. 11 shows that resistance 
to extinction was a function of the 
amount of training under i 
reinforcement (F = 12.18; df = 2/12, 
P < 001). The extension of training 


P <.001), but no further increase 
resulted from the further extension of 
training from 20 to 32 sessions 
(t < 1). 


TABLE 1 


MEANS AND SDs OF RESPON 


9R-3P-IR 


ES IN 10 SESSIONS OF Exmixcrion: Exe. | anp H 


Experiment IT 

ae 

Group Mean SD 

8R 114.5 33.2 

20R 103.7 15.1 

8P 164.3 63.9 

20P 294.9 54.9 

32P 298.7 54.9 

8P-12R 247.4 34.7 

20P-12R 250.2 48.3 


3P-10R 


446 


A comparison of Groups 8R and 
20R shows, on the other hand, that 
when training was under all regular 
reinforcement, resistance to extinction 
was unchanged by the extension of 
training from 8 to 20 sessions (t < 1). 

A clear PRE emerged only after 
extended training. Groups 8R and 8P 
did not differ significantly (t = 1.64; 
df = 13, P < .20) while Groups 20R 
and 20P were clearly different 
(t = 7.60; df = 11, P < .001). 

Consider next Group 8P-12R which 
was switched from partial to regular 
reinforcement prior to extinction. 
Resistance to extinction for this 
group was significantly greater than 
for Group 8P (t = 2.91; df = 14, 
P < .02) showing that the addition of 
regular reinforcement increased re- 
sistance to extinction. The increase 
obtained from adding 12 sessions of 
regular reinforcement to 8 sessions of 
partial reinforcement was less, but 
not significantly less, than the increase 
resulting from the addition of 12 
sessions of partial reinforcement, i.e., 
the means for the number of responses 
in extinction in Groups 8R-12P and 
20P could not be reliably distinguished 
(¢ = 1.79; df = 12, P= .10). 

When 20 sessions of partial rein- 
forcement were given, the addition of 
12 sessions of regular reinforcement 
(Group 20P-12R) produced no fur- 
ther increase in resistance to extinc- 
tion. The mean number of responses 
to extinction was in fact less, although 
not significantly less, in Group 20P- 
12R than in either Groups 20P or 32P 
(20P-12R vs. 20P: £ = 1.42, df = 11, 
P < .20; 20P-12R vs. 32P: £ = 1.55, 
df = 11, P < .20). : 

The results of Exp. I and II are 
brought together in Fig. 3 which plots 
the mean number of responses in ex- 
tinction as a function of the amount of 
training under regular reinforcement, 
the amount of training under partial 


+ 


HERBERT M. JENKINS 


reinforcement, and for the mixed 
conditions in which partial reinforce- 
ment is followed by regular reinforce- 
ment. Experiment I has been co- 
ordinated to this plot by ignoring 
differences in the amount of regular 
reinforcement which preceded the 
introduction of partial reinforcement. 
When treated in this way, the out- 
come of Exp. I falls into line quite 
well with that of Exp. II, so that it 
seems reasonable to believe that varia- 
tions in the amount of training under 
regular reinforcement preceding the 
introduction of partial reinforcement 
made little difference. 

The independent variables in Fig. 3 
are the number of training sessions of 
regular or partial reinforcement be- 
yond the sessions of preliminary 
training. Thus, the dotted lines 
extrapolated to zero sessions of train- 
ing represent a guess that preliminary 
training alone would produce a resist- 
ance to extinction equal to that ob- 
tained after eight additional sessions 
of regular reinforcement. 

Relation of latency in training to 
responses in extinction—The amount 
and type of training had only minor 
effects on the mean latency of response 
for groups. There were in fact no 
significant differences in the mean 
latencies of response taken over the 
last three sessions of training in either 
experiment. The overall mean la- 
tency for these sessions was 1.4 sec. 
However, it was possible to detect in 
Exp. II a small effect of partial 
reinforcement when compared to regu- 
lar reinforcement for the first eight 
training sessions. The mean latency 
for partially reinforced Ss (N = 36) 
was .2 sec. longer than for regularly 
reinforced Ss (N = 12). A test of this 
difference using a covariance analysis 
with latency during preliminary train- 
ing as the predictor yielded £ = 2.34; 
df = 45, P < 05, The switch from 


ju 


ee 


RESISTANCE TO EXTINCTION 447 


20 16 12 8 4 
~<— SESSIONS OF REGULAR REINFORCEMENT 


Fic. 3. Mean number of responses i! 


of the amount and type of trainin: 


preceding partial training in 
[see text ].) 


partial to regular reinforcement 
caused no significant change in latency 
in either experiment. 

From the overall results it is ap- 
parent that the effects of the amount 
and type of training on resistance to 
extinction were not paralleled by 
effects on group mean latencies of 


response in training. 
The correlation between latencies 


for individuals within groups and 
responses tO extinction is more 1m- 
teresting since it provides a clue as to 


te) 


n extinction for all groups 
g beyond preliminary training. 
Exp. I have been disregarded an 


32P % 300 


a 
o 
RESPONSES IN EXTINCTION —> 


(Exp. I and II) as function 
(Sessions of regular training 
d are shown in parentheses 


how the addition of training under 
regular reinforcement after a limited 
amount of training under partial 


reinforcement operates to increase 
resistance to extinction. A sub- 


stantial negative correlation (longer 
latencies associated with fewer re- 
sponses in extinction) was found in the 
case of two groups: 10R-3P (Exp. 1) 
and 8P. (Exp. Il). These were the 
only groups which went directly to 
extinction after a training period 
under partial reinforcement that was 


448 


not sufficiently extensive to produce a 
maximum resistance to extinction. 
In Group 10R-3P the correlation was 
— .79, and in Group 8P it was also 
— .79. In the first case the correla- 
tion was based on 5 Ss and was not 
significant while in the latter case it 
was based on 9 Ss and was significant 
Cesas dpe 1, P <02) In all 
other groups, the correlations were 
low and did not approach significance. 

The pattern of correlations reflects 
the following state of affairs. When 
extinction was begun immediately 
after a relatively short period of 
training under partial reinforcement, 
slow responders extinguished more 
rapidly than fast responders. When, 
on the other hand, a period of regular 
reinforcement was added, resistance 
to extinction for slow responders was 
at the same average level as for fast 
responders. It is important to note 
that the effect of adding regular 
reinforcement was not to alter the 
mean latency of response, but rather 
to remove the correlation between 
latency and resistance to extinction. 
The extension of training under 
partial reinforcement also removed 
the correlation in the same way. 


DISCUSSION 


The abruptness of the transition from 
training to extinction is obviously not the 
critical functional difference between 
regular and partial reinforcement. When 
extensive training is first given under 
partial reinforcement, its effect persists 
through a period of training under 
regular reinforcement involving 480 rein- 
forcements. 

Although 480 reinforcements is a large 
number, it is still a small fraction of the 
2,320 training trials under partial rein- 
forcement which were used in Group 
20P in order to obtain a clear PRE. It 
therefore seems possible that the in- 
significantly lower resistance to extinc- 
tion for Group 20P-12R in comparison 


HERBERT M. JENKINS 


with that of Group 20P or 32P would 
become a reliable difference with more 
extensive regular training. The point is 
important in connection with one inter- 
pretation of how overtraining in a dis- 
crimination task facilitates the learning 
of a reversed discrimination (Birch, Ison, 
& Sperling, 1960; Capaldi & Stevenson, 
1957). - 

Interpolating a period of regular rein- 
forcement between partial reinforcement 
and extinction can increase resistance to 
extinction over the level which obtains 
when extinction follows partial rein- 
forcement directly. The results suggest 
the rule that added training under regular 
reinforcement increases resistance to ex- 
tinction only if added training under 
partial reinforcement would also increase 
resistance to extinction. It appears that 
Ss with relatively long latencies of re- 
sponse during the first segment of partial 
training gain the most in resistance 
to extinction from subsequent rein- 
forcement. 

This unanticipated result requires a 
reformulation. Instead of thinking of 
the partial training as producing a PRE 
which might or might not be attenuated 
by subsequent regular reinforcement, it 
now appears that the PRE can be gener- 
ated by the training conditions in Groups 
3P-10R and 8P-12R. The presentinterpre- 
tation of the PRE in these groups is that 
it arose from an interaction of the effects 
of the nonreinforced trials during the 
partial training with the subsequent 
reinforcement during regular training. 
It is clear that the effect depends upon 
an interaction since without a prior 
exposure to nonreinforced trials, resist- 
ance to extinction was unchanged by the 
extension of training under regular rein- 
forcement. Amsel’s (1958) account of 
the PRE, which holds that the combina- 
tion of conditioned frustration due to 
nonreinforcement and subsequent rein- 
forcement develops a tolerance for non- 
reinforcement, provides one conjecture 
as to how this interaction is mediated.? 


* Results on the occurrence of ITRs suggest 
that frustration was associated with extinc- 
tion, During training, the overall mean 


RESISTANCE TO EXTINCTION 


Other results (Weinstock, “1954, 1958) 
indicate that some form of interaction 
between nonreinforcement . and rein- 
forcement can occur across long time 
intervals (at least 24 hr. separated the 
partial from the regular training in the 
present experiments), and even when the 
reinforcements occur consecutively in a 
block (Jensen & Cotton, 1960; Lauer & 
Carterette, 1957) as they did in the 
regular training of the present ex- 
periments. 

The present results agree with those 
of Keller (1940), Likely (1958), Quater- 
main and Vaughan (1961), and more 
specifically with Theios (1962), łn de- 
monstrating the persistence of the PRE 
through a period of regularly reinforced 
training. Further, Theios’ finding of a 
small but significant reduction in the 
PRE for the group which received more 
extensive regular reinforcement (Group 
P-70) is paralleled by the present finding 
of a lessened resistance to extinction in 
Group 20P-12R. 

The fact of increased resistance to 
extinction for the partial-regular training 
sequence is a departure, although in- 
dications of it can be found in previous 
results. In particular, Theios obtained a 
small increase in resistance to extinction 
(not significant) when 25 regular rein- 
forcements were added to previous 
partial training. 
possibility that it resulted from the 
regular reinforcement, but rejected it on 
the grounds that no increase occurred 
when regular reinforcement was added 
to previous regular training. However, 
the present results show that when the 
effect is obtained, it arises from an inter- 
action of partial and regular reinforce- 
ment; a possibility which is overlooked 
in Theios’ argument. 

Two implications of the present results 
for theory may be noted. First, the 
increased resistance to extinction pro- 


frequency of ITRs was .5 per 100 trials. In 
extinction it increased in every group yielding 
an overall mean of 2.2 per 100 trials. The 
mean frequency of ITRs showed a peak value 
in the vicinity of the decline of responses 1n 


extinction. 


449 


duced by the partial-regular sequence 
poses a problem for the dissonance 
theory of the PRE (Lawrence & Fest- 
inger, 1962) since that theory holds that 
the processes producing the increased 
resistance to extinction occur primarily 
on the nonreinforced trial itself. Second, 
the remarkable degree of independence 
between the slope of the extinction curves 
and the total number of responses in ex- 
tinction indicates that something like a 
threshold model of extinction is prefer- 
able, at least for conditioning procedures 
of the type used here, to a decremental 
model in which an initial probability of 
response is reduced on each trial by an 
amount proportional to the number of 
responses remaining. Models of the 
latter type lead to exponential decays in 
which the slope of the curve is correlated 
with the total number of responses in 
the curve. 


SUMMARY 


A food reinforced, key peck response in the 
pigeon was used with a discrete trial pro- 
cedure to study resistance to extinction 
(number of responses in 10 40-trial sessions) 
following different amounts of training of 
three types: partial reinforcement, regular 
reinforcement, and partial followed by regular 
reinforcement. 

Resistance to extinction showed no sys- 
tematic change as a function of the amount 
of regularly reinforced training. On the other 
hand, it increased as a function of the amount 
of partial training up to 20 sessions and then 
leveled off at a value which clearly demon- 
strated the PRE. Resistance to extinction 
after training under the partial-regular 
sequence was never significantly lower than 
for partial alone. In fact, the addition of 
regular reinforcement increased resistance to 
extinction when the amount of prior training 
under partial was not sufficiently extensive to 
produce a maximum. 

From these results and those of previous 
experiments the conclusion is well established 
that the abruptness of the local transition 
between training and extinction is not 
critically involved in the PRE. 


REFERENCES 


AmseL, A. The role of frustrative nonreward 
in noncontinuous reward situations. Psy- 
chol. Bull., 1958, 55, 102-119. 


450 è 

Brrcu, D., Ison, J. R, & SPERLING, S. E. 
Reversal learning under single stimulus 
presentation, J. exp. Psychol., 1960, 60, 
36-40. 

CAPALDI, E. J., & Stevenson, H. W. Re- 
sponse reversal following different amounts 
of training. J. comp. physiol. Psychol., 
1957, 50, 195-198. 

HumPHREYS, L. G. Extinction of condi- 
tioned psychogalvanic responses following 
two conditions of reinforcement. J. exp. 
Psychol., 1940, 27, 71-76. 

Jenkins, H. M. The effect of discrimination 
training on extinction. J. exp. Psychol., 
1961, 61, 111-121, 

Jensen, G. D., & Corton, J. W. Successive 
acquisitions and extinctions as related to 
percentage of reinforcement. J. exp. 
Psychol., 1960, 60, 41-49. 

KELLER, F. S. The effect of sequence of con- 
tinuous and periodic reinforcement upon 
the “reflex reserve.” J. exp. Psychol., 1940, 
27, 550-565, 

Laver, D. W., & CARTERETTE, T. S. 
Changes in response measures over re- 
peated acquisitions and extinctions of a 
running habit. J. comp. physiol. Psychol., 
1957, 50, 334-338, 


HERBERT M. JENKINS 


LAWRENCE, D. H., & FESTINGER, L. Deter- 
rents and reinforcements. Stanford: Stan- 
ford Univer. Press, 1962. 

Lixety, F. A. Relative resistance to ex- 
tinction of aperiodic and continuous rein- 
forcement separately and in combination, 
J. gen. Psychol., 1958, 58, 165-187. 

Mowrer, O. H., & Jones, H. M. Habit 
strength as a function of the pattern of 


reinforcement. J. exp. Psychol., 1945, 
34, 293-311. 
QUATERMAIN, D., & VauGHAN, G. M. Effect 


of interpolating continuous reinforcement 
between partial training and extinction. 
Psychol. Rep., 1961, 8, 235-237. 

Taeros, J. M. The partial reinforcement 
effect sustained through blocks of con- 
tinuous reinforcement. J. exp. Psychol., 
1962, 64, 1-6. 

WErINsTocK, S. Resistance to extinction of a 
running response following partial rein- 
forcement under widely spaced trials. 
J. comp. physiol. Psychol., 1954, 48, 318- 
322. 

Weinstock, S. Acquisition and extinction 
for a partially reinforced running response 
at a 24-hr. inter-trial interval. J. exp. 
Psychol., 1958, 56, 151-158. 


(Early publication received May 7, 1962) 


ote 


Journal of Experimental Psychology 
1962, Vol. 64, No. 5, 451-459 


PARTIAL REINFORCEMENT, CONTINUOUS REIN- 
FORCEMENT, AND REINFORCEMENT 
SHIFT EFFECTS * 


STEWART H. HULSE 


Johns Hopkins University 


Hulse (1962) has shown that partial 
reinforcement (PRF) of running in a 
straight alley produces sharp dis- 
crimination of reward stimuli in the 
goal box, but continuous reinforce- 
ment (CRF) does not. Partially rein- 
forced rats learned to lick from a 
drinking tube much faster than con- 
tinuously reinforced rats, and the 
number of licks they emitted on non- 
reinforced running trials rapidly de- 
creased to a very low level. Taken as 
a whole, the data indicated that 
behavior in the goal box was much 
more critically focused on the reward 
and its stimulus properties if PRF as 
compared with CRF were used. 

There is an important implication 
of these experimental results. During 
the course of training with PRF, 
changes in the stimulus properties of 
the reward ought to be sharply dis- 
criminated, - For example, on the 
assumptions that sweetness is a valid 
stimulus continuum for a reinforcing 
substance (Guttman, 4953) and re- 
sponse rate is positively correlated 
with sweetness, shifts from low to high 
concentrations of a saccharin rein- 
forcer ought to produce prompt and 
direct increases in response rate. 
Discrimination of a stimulus change 
of this sort, and appropriate changes 
in response rate, ought to occur to a 
much lesser degree, if at all, with 
CRF. This would be true since CRE 


1 The research reported in this paper was 
supported by National Science Foundation 
Research Grants G8712 and G18125. The 
author is indebted to M. A. Berkley and E. ye 
Appelman for collecting the data. 


does not appear to bring behavior 
under the stimulus control of the 
reinforcer to the extent that PRF 
does. 

The data reported here constitute 
a test of this implication. In one 
experiment, rats were conditioned on 
low and high concentrations of sac- 
charin with PRF and CRF. Then, 
for half the rats, reinforcement con- 
centrations were switched from low to 
high or from high tolow. Ina second 
experiment, essentially the same pro- 
cedure was used, except that rats 
were switched only from low to high 
concentrations, and more extreme 
differences in concentration were used. 
Instrumentally conditioned licking 
was used as a response. This response 
is particularly appropriate, since the 
tongue is intimately involved in the 
sensory dimension of taste, and since 
licking rates vary quite consistently 
with the concentration of a sweet 
reinforcing stimulus (Hulse & Bacon, 
1962). 

EXPERIMENT I 

Method 

Subjects —The Ss were 64 naive male 
albino rats, 85 days of age, of the Sprague- 
Dawley strain obtained from Sprague- 
Dawley, Incorporated, Madison, Wisconsin. 
The Ss had no experience drinking from tubes 
prior to their use in the experiment. 

Apparatus and procedure.—The apparatus 
and the general procedure for treating licking 
as an instrumental response have been de- 
scribed in detail elsewhere (Hulse, 1960; 
Hulse, Snyder, & Bacon, 1960). In brief, the 
apparatus consisted of a 6.5 X10 X7 in. 
wooden box which was suitably lighted, 
yentilated, and sound-shielded. A piece of 
Lin, Plexiglas was located in front of a hole 


451 


452 


in one wall of the box. The Plexiglas had a 
1 X 2cm. vertical slot cut into it. The S had 
access through the slot to a drinking tube. 

Each lick on the drinking tube operated an 
electronic relay which operated, in turn, a 
programing circuit. Liquid reinforcements of 
a specified volume were delivered to S through 
the tube by means of an infusion pump 
operated by the programmer. For partial 
schedules of reinforcement, the programmer 
also operated a solenoid-driven lever. The 
lever slightly squeezed the pressure tubing 
leading from the pump to the drinking tube. 
After the reinforcement was delivered, the 
tubing returned to its normal shape and 
produced a slight negative pressure in the 
fluid system. ‘The negative pressure drew the 
fluid column about 1 mm. inside the tip of the 
drinking tube. This assured that S could not 
make contact with the fluid on nonreinforced 
licks. 

Two drinking tubes were used in the experi- 
ment, During the first 4 days of training, a 
brass tube with a 2-mm. fluid hole was used. 
On Day 1, this tube projected through the 
slot in the experimental box; on Days 2 to 4, 
the tube was gradually withdrawn through 
the slot. On Day 5, a plastic nipple with a 
2-mm. fluid hole was introduced behind the 
slot and used for the rest of the experiment, 
This tube had a 7y-in. diameter brass electrical 
contact located just below the fluid hole, 

The experimental design that was used 
during initial training incorporated two sched- 
ules of reinforcement (CRF and FR8), and 
two concentrations of reinforcement (1 gm. 
and 10 gm. of saccharin added to 1 1. of tap 
water). Sixteen Ss were randomly assigned 
to each of the four groups called for by this 
design. After initial training, the concentra- 
tions of reinforcement were switched for half 
the Ss trained with each schedule of rein- 
forcement. For both the CRF and FR8 
schedules of reinforcement, the postswitch 
design thus included one group switched from 
the low to the high concentration, one group 
switched from high to low, and two control 
groups which continued on their initial con- 
centrations (N = 8 in each group), 

À Upon receipt from the breeder, Ss were put 
in colony cages and maintained on ad lib. 
Purina lab chow and water (available from 
cans) for 10 days. Ten days of taming 
followed. On Day 1 of taming, Ss were 
transferred to individual cages and placed on 
a daily deprivation diet of 10 gm. ground 
Purina lab chow mixed with 20 ce of tap 
water. They stayed on this deprivation 
schedule throughout the remainder of the 
experiment. Also, on each taming day, Ss 


STEWART H. HULSE 


were taken in groups of approximately 15, 
freely handled, and allowed to explore in a 
large wooden box for 30 min. They received 
their daily ration of wet mash in their home 
cages after this procedure was completed. 

Following taming, all Ss received 22 days 
of initial conditioning of the instrumental 
licking response. On Day 1, each S was 
placed in the apparatus and permitted 300 
licks from the brass tube on CRF. The con- 
centration of reinforcement variable was 
introduced immediately. ‘The size of each 
reinforcement was .005 cc of fluid. The 
plastic nipple was introduced on Day 5. 
The number of licks permitted all Ss and the 
ratio of reinforcement for the partial Ss 
gradually increased so that by Day 7, all Ss 
made 1,000 licks, and by Day 18, the partial 
Ss were on their final FR8. This regime was 
continued through Day 22. 

On Day 23, the concentration of saccharin 
for Ss in the switched groups was shifted from 
high to low or from low to high, depending 
upon the concentration that was used during 
initial training, This regime was continued 
from Day 23 through Day 29. On Day 30, 
Ss in the switched groups were shifted back 
to their original concentrations and run under 
this regime through Day 36. The Ss in the 
control groups remained on their initial 
concentrations throughout the 36 days of 
training. 

On Day 37, 3 days of extinction began. 
Each day, Ss were placed in the apparatus 
and allowed to lick the dry tube for 3 min. 
All conditions remained the same as those 
prevailing during training, except that the 
pump and solenoid-driven lever were dis- 
connected from the programmer. 

The Ss were fed their daily ration of wet 
mash 20 to 30 min. after they had been run. 
The amount of water added to the mash was 
adjusted to account for the fluid obtained in 
the apparatus, 

During training, the total time that each S 
required to emit its 1,000-lick allotment was 
recorded for each day. These times were 
transformed to reciprocals and multiplied by 
1,000 to give a measure of average licking rate 
in licks per second. During extinction, the 
total number of licks that each S emitted 
during the daily 3-min. extinction period was 
recorded. 


Results 


Preswitch performance.—By Day 
22, the last training day before the 
first switch in saccharin concentration, 


REINFORCEMENT SHIFT EFFECTS 


CRF produced faster licking rates 
than PRF, and the high concentration 
of saccharin produced faster licking 
rates than the low concentration of 
saccharin. An analysis of variance 
for Day 22 for Ratio and Concentra- 
tion of Reinforcement yielded an F 
for Ratio of 15.83 (df = 1/60, 
P < .01) and an F for Concentration 
of 59.11 (df = 1/60, P < 01). The 
interaction between the two variables 
was not significant (P > .05). 

First switch.—The effect of the first 
switch in concentration, which oc- 
curred on Day 23, was quite different 
for CRF and PRF. Figure 1 shows 
that with PRF an orderly and appro- 
priate change in rate occurred from 
Day 22 to Day 23. If the switch was 
from a high to a low concentration, 
licking rate decreased. If the switch 
was from low to high, licking rate in- 
creased. With CRF, on the other 
hand (Fig. 2), licking rates decreased 
regardless of the direction of the 
switch in concentration. 

Statistical analyses of these results 
were carried out in two ways: First, 
analyses of variance were used on 
means for the groups that were 
switched in concentration (Table 1). 
These analyses were done separately 


o=o HIGH -CONTROL 

5.0 e— e LOW- CONTROL 
o=o HIGH SWITCHED 

on=--8 LOW- SWITCHED 


LICKS PER SECOND 


29 so uas 36 


DAYS 


24-28 


Fic. 1, Effect of shifts in concentration of 
reinforcement for PRF in Exp. (Shifts 
occurred between Days 22 and 23, and be- 
tween Days 29 and 30.) 


453 


Licks PER SECOND 


Effect of shifts in concentration 
(Shifts 
occurred between Days 22 and 23, and be- 
tween Days 29 and 30.) 


Fic. 2. 
of reinforcement for CRF in Exp. I. 


for the CRF and PRF conditions, and 
they included pre- and postswitch 
days as a variable. Second, analyses 
of variance were used on means for 
both the control and switched groups 
(Tables 2 and 3). These were done 
separately for both the CRF and PRF 
conditions and for Days 22 and 23. 
Table 1 shows a highly significant 
Treatments X Days interaction for 
the PRF condition. By ¢ test, the 
licking rate of the PRF High-Switched 
group decreases from Day 22 to Day 


TABLE 1 


ANALYSES OF VARIANCE OF SwitcHED-GRouP 
LIcKING RATES DURING TRAINING 


MS 
Source df PRF CRF 
Days | Days | Days Days 
22-23 | 29-30 22-23 | 29-30 
Ss 15 
Treat- 
ments 
(T) 1| 0.98 | 0.21 1.67 | 0.84 
Error 14| 1.09 | 0.70 | 1.35 0.75 
Days (D) 1| 1.62 | 0.01 9.57**| 0.05 
Ss X D 15| 0.67 | 0.30. | 0.56 0.26 
TxD 1| 7.22**| 2.64°* 1.32 | 2.65** 
Error 14| 0.20 | 0.13 0.51 | 0.09 
P <01. DA k x p 


454 


TABLE, 2 


ANALYSES OF VARIANCE OF LICKING RATES 
DURING TRAINING FOR PRF Groups 


MS 
Source df, 
Day | Day | Day | Day Day 
22 23 29 30 36 
Concentra- 
tion (C 1) 12.50**| 7.70**| 6,57%] 7.22! 5,360 
Switch (Sw) 1| 0.01 |0.38 |0,94 |0.07 |3.06* 
C XSw 1| 0.01 |1.17 | 1.95* |0.36 |0.43 
Error 28| 0.43 |0.54 |0.34 |0.42 |0.62 
P SOS. 
RESO, 


23 (t = 6.36, df = 14, P < .01), and 
the licking rate of the PRF Low- 
Switched group increases from Day 22 
to Day 23 (¢ = 2.27, df = 14, P < .05). 
For the CRF condition, Table 1 shows 
that only Days is a significant vari- 
able. Licking rates on Day 23 are 
much lower than on Day 22, and this 
is true regardless of the direction of 
the switch in concentration. Tables 2 
and 3 show that for both PRF and 
CRF on Day 22 only the Concentra- 
tion variable is significant. High con- 
centrations produced faster licking 
rates than low concentrations. On 
Day 23, however, the pattern of 
significant effects is quite different for 
PRF and CRF. For PRF, the Con- 
centration variable once again pro- 
duces the only significant effect. 
Licking rate is correlated with the 


TABLE 3 


ANALYSES OF VARIANCE OF LICKING RATES 
DURING TRAINING FOR CRF Groups 


MS 
Source df 
Day | Day | Day| D: Day 
22 23 29 30 rr 
Concentration ppa 4 A = 
e 1/8.92** 0.92 | 1.62 | 4.97% 1.56% 
Switch (Sw) 1/0.16 |7.22*%| 0.10 [0,13 rr 
C X Sw 1/031 |1.20 | 0.32 [0.09 0.02 
Error 28/0.32 [0.90 |045 [0.34 | 0.27 
*p<0 


STEWART H. HULSE 


concentration used on Day 23 for both 
switched and control groups. For 
CRF, only the Switch variable is 
significant. Licking rates are gen- 
erally lower for the switched groups 
than they are for the control groups. 
By ¢ tests, both the High-Switched 
and the Low-Switched groups differ 
from the High-Control group 
(t's = 2.72 and 2.83, df = 28, Ps < .05 
and .01, respectively), but neither 
switched group differs significantly 
from the  Low-Control group 
Psi> 05). 

Terminal performance on the con- 
centrations of the first switch was 
quite different for PRF and CRF2 
For PRF on Day 29 (Fig. 1), licking 
rates for the Low-Switched group 
reached about the same level as those 
of the High-Control group. Licking 
rates for the High-Switched group, 
however, showed an increase following 
Day 23 and reached an asymptote 
that was much higher-than that of the 
Low-Control group. Table 2 shows 
significant effects for both Concen- 
tration and the interaction between 
Concentration and Switch in Concen- 
tration. By £ test, the Low-Switched 
group does not differ from the High- 
Control group (P > .05), but the 
High-Switched group licks signifi- 
cantly faster than the Low-Control 
group (t = 2.90, df = 28, P < .01). 
The results for CRF (Fig. 2) show a 
pattern of differences. among the 
groups which is similar to that for 
PRF, but the magnitudes of the 
differences are much smaller, Table 3 
shows that for CRF none of the @x- 
perimental variables produced signifi- 
cant effects on Day 29, 

* Analyses of variance for the last 3 days 
of the first and second switches revealed no 
significant effects due to Days or the inter- 
action of Days with any of the other variables. 
The data thus provide no evidence that the 
groups had not reached asymptotic perform- 
ance by Day 29 or by Day 36. 


aN 


REINFORCEMENT SHIFT EFFECTS 


Second switch—With one major 
exception, the effect of the second 
switch in concentration on Day 30 
was quite similar for both PRF and 
CRF. Figures 1 and 2 show that the 
licking rates of the switched groups 
changed in the appropriate direction 
when these groups returned to their 
original training concentrations. This 
was true regardless Of ratio of rein- 
forcement. The extent of the change 
in rate, however, was quite different 
for PRE and CRF. With CRF; 
Fig. 2 shows that the switched groups 
reached licking rates which were es- 
sentially the same as the control 
groups. With PRF, however, Fig. 1 
shows that the Low-Switched group 
changed to a performance level which 
was considerably above that of the 
Low-Control group. By Day 36, the 
PRF High-Switched group also re- 
sponded at a faster rate than the 
High-Control group. 

Table 1 shows a significant Treat- 
ments X Days interaction for the 
switched groups for Days 29 and 30 
for both PRF and CRF. Tables 2 and 
3 show a significant Concentration 
effect on Day 30 for both PRF and 
CRE, but none of the other variables 
are significant. On Day 36, however, 
there is a significant Switch effect for 
PRE in addition to significant Con- 
centration effects for both PRF and 
CRF. Although both PRF switched 
groups performed above their controls 
on Day 36, the difference between the 
Low-Switched and the Low-Control 
group is the only one that reaches 
significance (f = 2.16, df = 28, 
P < .05). 

Extinction.—F igure 3 shows that, 
in general, the PRF groups emitted 
more licks on the dry drinking tube 
than the CRF groups. Moreover, 
with PRF, the switched groups were 
far more resistant to extinction than 
the nonswitched groups. With CRF, 


NUMBER OF LICKS 


pays 


Fic. 3. Resistance to extinction in Exp. 
I for PRF and CRF Switched and Control 
groups. 


however, the data suggest that the 
switched groups were less resistant 
to extinction than the nonswitched 
groups. 

An analysis of variance on mean 
number of licks for all groups across 
‘the 3 days of extinction shows signifi- 
cant effects for Ratio of Reinforce- 
ment (F = 70.48, df = 1/56, P < 01), 
Days of Extinction (F = 102.93, 
df = 2/126, P < .01), Switch in Con- 
centration (F = 5.78, df = 1/56, 
P < .05), Ratio X Switch (F = 9.10, 
df = 1/56, P < .01), and Ratio 
X Days (F = 41.46, df = 2/112, 
P < 01). For PRF, the mean differ- 
ence between the switched and non- 
switched groups is significant (¢= 6.61, 
df = 56, P< 01), For CRF, this 
difference is not significant (t = 0.76, 
af = 56, P > .05). 


EXPERIMENT II 


Experiment II was run to check on 
the generality of some of the phe- 
nomena which resulted from the first 
switch: of Exp. i Experiment IT 
incorporated more days of training 
before the switch took place and used 
more extreme differences in the con- 
centration of the saccharin rein- 
forcement. 


456 


Method 


Subjects—The Ss were 30 naive male 
albino rats, approximately 75 days of age, of 
the Sprague-Dawley strain. 

Apparatus and procedure.—The apparatus 
was identical to that used in Exp. I. Simi- 
larly, the details of housing, taming, and 
feeding of Ss were essentially the same as 
those of Exp. I. A somewhat more severe 
daily deprivation regime was adopted: 9 gm. 
of Purina chow mixed with 18 cc of tap water. 

The experimental design that was used was 
identical to that of Exp. I except that no 
High-Switched groups were run. The post- 
switch design thus included six groups with 
an N of 5 in each group. Reinforcement 
concentrations were either .50 gm. or 10 gm. 
of saccharin added to 11. of tap water. As 
before, ratio of reinforcement was either CRF 
or FR8. The size of each reinforcement was 
0043 ce of fluid. 

Initial training in Exp. II was identical 
to that of Exp. I with the following excep- 
tions. Only 600 licks were permitted each 
day. The PRF Ss were on their final FR8 
schedule by Day 16, and all Ss continued 
initial training through Day 32. The PRF 
Ss thus received 17 days of training on FR8 
instead of the 5 days that the PRF Ss received 
in Exp. I. Following initial training, concen- 
tration of reinforcement was changed from 
low to high on Day 33 for the PRF and CRF 
Switched groups, and the Ss continued on the 
new regime through Day 37. 


Results 


The immediate effects of the switch 
from a low toa high concentration are 
qualitatively exactly the same as 


© HIGH-CONTROL 
PRF © LOW-CONTROL 
a SWITCHED 


fee 
aes 


Sad 
o 


> 
o 


/ 
if 
/ 
/ 


3.0 


LICKS PER SECOND 


A Tae 


32 33 3a 35 36 3 
DAYS 


Fic. 4. Effect of shift in concentration of 
reinforcement for PRF in Exp. II. (The 
shift occurred between Days 32 and 33.) 


STEWART H. HULSE 


so 
a 
z 
o 
o 
w 
a aso 
rf 
a 
£ 
o 3.0 © HIGH-CONTAOL 
3 CRF © LOW-CONTROL 
4 switcHeo 
2.0 
o 
32 33 34 35 36 a7 
DAYS 


Fic. 5. Effect of shift in concentration of 
reinforcement for CRF in Exp. II. (The 
shift occurred between Days 32 and 33.) 


those obtained in Exp. I. Figure 4 
shows that with PRF, the licking 
rates of the Switched group increased 
markedly from Day 32 to Day 33. 
Comparison of the data shown in 
Fig. 4 with those shown in Fig. 1 
suggests that the increase in rate was 
considerably greater under the condi- 
tions of Exp. II than under those of 
Exp. I. Figure 5 shows that with 
CRF, licking rates of the Switched 
group declined, as they did in Exp. I. 

An analysis of variance used on the 
groups means of Days 32 and 33 for 
PRF shows the following effects to be 
significant: Treatments (F = 14.76, 
df = 2/12, P < .01), Days (F = 4.66, 
df = 1/14, P < .05), and Treatments 
X Days (F = 7.49, df = 2/12, 
P <.01). The significant Treat- 
ments effect occurs primarily because 
of the large and consistent difference 
in performance between the High- 
Control and the Low-Control groups, 
while the significant Treatments 
X Days effect reflects the sharp 
change in performance of the Switched 
group from Day 32 to Day 33. An 
analysis of variance for CRF shows 
Treatments to be the only significant 
effect (F = 12.61, df = 2/12, P <.01). 
A third analysis of variance, which 
compared the performance of the 


“ 


‘ 


REINFORCEMENT SHIFT EFFECTS 


CRF and PRF Switched groups on 
Days 32 and 33, shows a significant 
Days X Ratio of Reinforcement inter- 

action (F = 9.14, df = 1/8, P < .05). 
This corroborates the results shown by 
the separate_analyses for PRF and 
CRF regarding the effects of the 
switch in concentration. 

Following the first day of the 


switch in concentrafion, the perform- ` 


ance of the CRF and PRF Switched 
groups was quite similar. Both 
groups reached a licking rate which 
was between the rates of the High- 
Control and Low-Control groups. 
An analysis of variance of means 
obtained by averaging rates on Days 
36 and 37 shows a significant effect 
due to Treatments 18.86, 
df = 2/22, P < 01) and to Ratio of 
Reinforcement (F = 15.57, df = 1/22, 
P< 01). The interaction between 
these two variables is not significant. 
By ¢ test, the Switched groups Te- 
sponded significantly faster than the 
Low-Control groups (t =3.62, df = 22, 
P < .01) and significantly slower than 
the High-Control groups (t = 2.50, 
df = 22, P < .05). 


Discussion 


The licking behavior that occurs on the 
day of a sudden change in the concentra- 
tion of the reinforcer indicates that rats 
who are trained wit PRF are condi- 
tioned to respond to the reinforcing 
stimulus in a way which is quite different 
ogare trained with CRF 


With PRF, the sweetness and other 
f the reinforcer are 


tions which occurs when r 
sometimes followed by @ drop of sac- 
charin and sometimes followe | 
ing. Ifthe concentration of the saccharin 
suddenly changes, the change 15 rela- 
tively easy to detect, and response rate 
immediately increases or decreases in the 
‘direction of the change- In effect, be- 


457 


havior comes under critical and orderly 
control of the stimulus properties of 
the reinforcer through discrimination 
training. 

A different situation prevails when 
CRF is used. Here, each successive 
response is followed by the same sweet 
stimulus, and there is no stimulus con- 
trast from one response to the next. 
With CRF, S learns to attack the drink- 
ing tube in a way which is vigorous, but 
relatively speaking, blind. If the con- 
centration of the saccharin reinforcer 
changes, stimulus generalization decre- 
ment prevails, and behavior is markedly 
disrupted. Pure generalization decre- 
ment will hold the first time that S meets 
a sudden change in concentration, but 
not, of course, on later occasions. With 
the first change, S receives new informa- 
tion about reinforcement conditions and 
is, in fact, trained through contrast to 
discriminate somewhat more about the 
stimulus properties of the reinforcer. 
A second change in concentration comes 
as less of a surprise to S, and as Exp. Í 
shows, response rates immediately in- 
crease or decrease appropriately. 

The behavior that appears on the days 


which follow a change in reinforcement 
concentration, and 


the behavior that 
appears during extinction, indicates that 
experience with a particular set of rein- 
forcement conditions on PRF produces 
residual effects which have a long-lasting 
influence on response strength. The dis- 
crimination training that PRF provides 
apparently yields a learned connection 
between licking behavior and a particular 
stimulus property of the reinforcer, such 
as its sweetness. 

First, licking rates for the PRF 
switched groups reach postshift asymp- 
totes which often lie between the licking 
rates that are characteristic of the high 
and low concentrations of saccharin. 
The Ss behave much as if they were 
averaging the intensities of the pre- and 
postswitch reinforcing stimuli. Stimulus 
averaging of this sort seems to be most 
easily demonstrated when concentrations 
are changed from high to low (cf. Fig. 1). 
On the first postswitch day of the second 


458 


change in concentration of Exp. I, for 
example, the response rate of the PRF 
Low-Switched group falls sharply. But 
it remains well above the response rate 
of the Low-Control group. In Exp. II, 
a compromise in rate is also obtained 
when concentrations are changed from 
low to high. The PRF Switched group 
increases its licking rate on the first 
postswitch day, but then the rate falls 
to an asymptote which is again about 
half way between the rates of the control 
groups. Sometimes, stimulus averaging 
and comprises in response rate also ap- 
pear with CRF. But when they do, they 
are far less easily obtained than they are 
with PRF. In Exp. I, there is no indica- 
tion of a consistent compromise in 
response rate for CRF. However, in 
Exp. II, which involved more training 
days and larger differences in reinforce- 
ment concentration, a compromise in 
response rate is clearly obtained for CRF 
as well as for PRF. Apparently the more 
extreme conditions of Exp. II are re- 
quired, however, before a compromise in 
rate will appear with CRF and instru- 
mentally conditioned licking. In this 
connection, Premack and Hillix (1962) 
trained rats to lick 4% and 16% sucrose 
solutions from a conventional drinking 
tube, a procedure analogous to CRF 
conditioning as defined here, and then 
shifted the concentration of sucrose for 
these rats to 32%. The licking rates on 
the new concentration were consistently 
lower than the rate of rats maintained on 
32% sucrose throughout the experiment. 
Apparently, experience with a particular 
reinforcing stimulus on CRF can make 
that stimulus have long-lasting effects on 
behavior under some conditions. The 
important point is that, for the present 
at least, these conditions appear less 
well defined for CRF than they do for 
PRF. 


Second, if experience with a particular 
set of reinforcement conditions on PRF 
produces residual effects which per- 
manently influence behavior, extinction 
should provide a sensitive test for these 
effects. This would be true since ex- 
tinction removes § from direct exposure 


STEWART H. HULSE 


to the primary reinforcing stimulus. 
The data suggest that the control of 
behavior by the reinforcing stimulus is 
indeed permanent and that it does trans- 
fer to extinction. First, the extinction 
data of Exp. I show that a past history 
of shifts in the concentration of the 
reinforcer serves to markedly increase 
resistance to extinction when PRF is 
used. Further, Hulse (1958), Wagner 
(1961), and Hulsë and Bacon (1962) 
have shown that with PRF resistance to 
extinction increases as amount of re- 
inforcement increases. This is true 
whether amount of reinforcement is 
defined in terms of weight of food or 
sweetness of saccharin solutions. 

None of these things are true when 
CRF is used. In Exp. I the CRF groups 
that had a past history of shifts in 
concentration of the reinforcer were, if 
anything, less resistant to extinction 
than the control groups. In other ex- 
periments which have used CRF and 
varied amount of reinforcement, the ex- 
tinction results are at best unpredictable. 
Sometimes resistance to extinction in- 
creases as amount of reinforcement 
increases (Zeaman, 1949), sometimes it 
decreases (Armus, 1959; Hulse, 1958), 
and sometimes it does not vary with 
amount of reinforcement at all (Hulse & 
Bacon, 1962). Since § receives no 
discrimination training for the reinforcer 
with CRF, it is perhaps not surprising 
that these inconsistent results should be 
obtained. The data suggest that with 
CRF, S is stimulus bound. Response 
strength will be consistently correlated 
with some stimulus property of the 
reinforcer as long as S is directly exposed 
to it, but change theèreinforcer—or 
remove it—and the correlation vanishes. 

Finally, none of the data in either 
experiment reveals the slightest trace 
of elation, depression, positive contrast, 
or negative contrast (Pubols, 1960). 
The only time that a switched group 
licks either faster or slower than a 
control group, in a way which might 
Suggest a contrast effect, occurs after the 
initial change in concentration of Exp. I. 
Here the rate of the CRF High-Switched 


REINFORCEMENT SHIFT EFFECTS 


roup drops below that of the Low- 

Control group, but it immediately 
bounces back to a point which is higher 
than the Low-Control group. As we 
have seen, it seems most meaningful in 
the context of the present experiments to 
view changes in rate of this sort as due 
to stimulus generalization decrement. 
In the same connection, it is interesting 
to note that most of the significant con- 
trast effects that are reported in the 
literature are negative contrast effects 
(Spence, 1956; Pubols, 1960). Since all 
the earlier experiments have used CRF, 
perhaps generalization decrement was at 
least as important as some emotional- 
motivational factor in determining the 
contrast effects that were obtained. 


SuMMARY 


Instrumentally conditioned licking was 
studied in two experiments as a function of 


partial reinforcement (PRF), a shift in con- 
centration produces an immediate change in 
licking rate in the direction 
With continuous reinforcement (CRF), the 
immediate reaction to the shift is always 4 
decrease in response rate. With PRF, 
asymptotic response rates reached several 
days after a shift are often compromises, be- 
tween the rates of control groups maintained 
on high and low concentrations. 
a compromise of this sort appears only when 
relatively large amounts of training and 
relatively extreme differences in concentration 


of reinforcement are Sar iai 

The data suggest that, with 5 vior 
is more critically and permanently under the 
control of reinforcement stimuli than with 
CRF. This happens because RF provides 
discrimination training for reinforcement 


stimuli, but CRF does not. 


459 


REFERENCES 


Aruus, H. L. Effect of magnitude of rein- 
forcement on acquisition and extinction of 
a running response. J. ¢xp. Psychol., 1959, 
58, 61-63. 

Gurtman, N. Operant conditioning, extinc- 
tion, and periodic reinforcement in relation 


used as a rein- 
Psychol., 1953, 46, 

213-224. 
S. H. Amount and percentage of 


reinforcement and duration of goal confine- 
ment in conditioning and extinction. J. 


J. exp. 


system con 

J. exp. Anal, Behav., 1900, 3, 1-3, 
Huse, S. H. Discriminati 

in learning with partial and continuous re- 


inforcement. J. exp. Psychol, 1962, 64, 
227-233. 
HuLse, S. H., & Bacon, W. E. Supplemen- 


amount of reinforcement as determinants 
of instrumental licking rates. 
Psychol., 1962, 63, 214-215. 

Herse, S. H., Syvper, H. L., & BACON, 
W. E. Instrumental licking behavior as a 
function of schedule, volume, and con- 
centration of a saccharin reinforcer. J. exp. 
Psychol., 1960, 60, 359-364. 

Prestack, D., & HILLIX, W.A. Evidence for 
shift effects in the consummatory response. 
J. exp. Psychol., 1962, 63, 284-288. 

pesos, B. H., JR- Incentive magnitude, 
learning, and performance in animals. 
Psychol. Bull., 1960, 57, 89-115. 

Spence, K. W. Behavior theory and condition- 
ing. New Haven: Yale Univer. Press, 1956. 

Waoner, A. R. Effects of amount and per- 
centage of reinforcement and number of 
acquisition trials on conditioning and ex- 
tinction. J. exp. Psychol., 1961, 62, 234- 
242. 

zrama, D. Response latency as a function 
of amount of reinforcement. J. exp- 
Psychol., 1949, 39, 466—483. 


(Early publication received June 13, 1962) 


Journal ey Experimental Psychology 
1962, Vol. 64, No. 5, 460-466 


INFERENTIAL BEHAVIOR IN CHILDREN AS A FUNCTION 
OF AGE AND SUBGOAL CONSTANCY ' 


TRACY S. KENDLER AND HOWARD H. KENDLER 
Barnard College New York University 
Problem solution sometimes re- integrating B stimulus in the infer- 


quires the integration of units of be- 
havior that are already in the reper- 
toire of S but have not previously been 
used in conjunction with one another. 
When the efficiency of integration 
increases gradually as a function of 
the number of trials the process is 
variously described as trial-and-error, 
instrumental, or selective learning. 
When an efficient solution occurs on 
the first trial, without any preceding 
trial-and-error behavior, the process is 
variously designated as reasoning, in- 
sight, or inference. The present ex- 
periment is one of a series that seeks to 
explore the relationship between these 
two processes in children (Kendler & 
Kendler, 1956; Kendler, Kendler, 
Pliskoff, & D'Amato, 1958; Kendler & 
Kendler, 1961). These studies have 
used an experimental paradigm, de- 
rived from Hull (1935, 1952), in which 
S is trained on three discrete behavior 
segments (A-B, X-Y, and B-G) and 
then presented with a test situation in 
which S is instructed to get G when 
only A and X are available. Problem 
solution requires the assembly of A-B 
and B-G. 

The purposes of the present study 
were (a) to determine whether the 
ability to infer, as measured by these 
particular operations, increases with 
age and (b) to analyze the role of the 


1 This research was sponsored by the 
National Science Foundation. The authors 
wish to express their appreciation for the 
cooperation extended by Orville Sipe, the 
principal of the Le Conte Public School, and 
his staff throughout the conduct of this 
research, 


ential process. 


METHOD 


Experimental design.—There were four ex- 
perimental groups, 32 Ss in each, arranged in 
a 2 X 2 factorial design. The main effects 
were age, kindergartners (K) vs. third-graders 
(rd), and the constancy of training and test 
subgoals, constant (C) vs. switched (S). 

Subjects.—The Ss were 137 children drawn 
from the Le Conte Public School in Berkeley, 
California. Nine Ss were eliminated, 5 of 
them because they made simultaneous A and 
X choices, and 4 due to inadvertent errors 1n 
experimental procedure. The data to be re- 
ported are based on the remaining 128 
children, of whom 63 were boys and 65 girls. 
The kindergartners’ mean age was 68.7 mo., 
range from 61 to 74. The third-graders 
mean age was 103.9 mo., range from 96 to 115. 
Within each age level Ss were randomly 
assigned to C and S groups without regard 
to age or sex. 

Apparatus.—The portable aluminum ap- 
paratus used consisted of three distinct square 
panels 17.5 cm. on a side. Each panel, which 
could be exposed to S's view singly oF ' 
combination with the others, corresponded to 
one habit segment. The center panel pro- 
vided for the B-G segment. On its blue 
anodized surface was a circular opening 2.4 
cm. in diameter into which S could drop the 
objects that served as subgoals in this experi- 
ment, namely, a glass marble with a 1.8-cm. 
diameter and a steel ball bearing with a 1.3- 
cm. diameter. If 5 dropped the one 
subgoal into the circular opening, 4 small, 
shiny, gold fairy-tale or nursery-rhyme charm 
was propelled to a trough near the bottom ° 
the panel. The incorrect subgoal deliver 
no reward. The charms, eg. “little Bo p 
Peep,” “the cow that jumped over the moon, 4 
“the gingerbread man,” etc., which "i 
presented in a set sequence, served as t 
major goal (G). A 

The two side panels corresponded to t ; 
A-B and X-Y segments. The left panel wê 
anodized pink and the right anodized ve low. 


460 


4 


INFERENTIAL BEHAVIOR IN CHILDREN 


Each panel was equipped with a button which, 
when pressed, closed a circuit that led to the 
delivery of the appropriate subgoal (glass 
marble or ball bearing) to a trough near the 
bottom of that panel. 

Procedure.—All Ss were run individually 
and completed in one experimental session. 
The session began with the administration of 
the Peabody Picture Vocabulary Test (Dunn, 
1959), which took from 5 to 15 min. and was 
followed immediately by training on the 
inference apparatus. One of the side panels 
was opened and S was told, “Press this button 
and see what happens.” When the subgoal 
was delivered, S was instructed to pick it up, 
look at it, then return it to E so that he might 
have another turn. This side panel was 
closed, the other side panel opened, and the 
procedure repeated. After S had one forced 
trial on each panel, the procedure was re- 
peated with the order of the sequences 
reversed. Thus at this point in training $ 
had made two responses on the A-B and on 
the X-Y segments in an ABBA order. After 
these four forced trials, the doors of both side 
panels were opened and S was shown one of 
the subgoals and directed to, “Press the 
button that will get one like this.” The 
procedure was repeated with each of the sub- 
goals presented in an ABBA-BAAB order 
until the criterion of six successive correct 


The next step of the training started with 
E opening the middle panel (after closing the 
two side ones) and directing the attention of S 


would soon get an opportunity 
closely. The aperture was inted out and 
S was informed that if he dropped “the right 
thing” into that hole, the charm would 

into the tray. 


the right thing. 
it in the hole to see if it makes the charm come 
out.” On the next trial S was again im- 
structed to drop in the one that would make 
the charm come out. 
proceeded until a criterion of four successive 


bearing (i.e. left and right) were varied in a 
random er after each correct response 
throughout the training on the B-G segment. 


461 


After preliminary training the test trial 
was introduced with the following instrec- 
tions: “Would you like to see another charm? 
Very well, this time I won't put out any little 
things, but 1 will open all the doors lf you 
do what you are supposed to, you can make 
the charm come out. Go abead.” The 5 
was allowed 60 sec. in which to make any 

that he chose. If he did not press 
the A or X button during this time, B mid, 
“Which button should you press to help you 
Go ahead.” 3 
either button, he was allowed another 60 sec. 
to complete the sequence by dropping either 
subgoal into the B-G aperture before the trial 
was terminated. Thus all Ss made either an 
‘A or X response and the trial was terminated 
cither when S made a major goal response or 
when 60 sec. had elapsed since the subgoal had 


appeared. 

For half of the children at each age level 
the subgoals were switched (S) between the 

iminary training and the test trial so that 
if S made an A choice, he obtained the Y sub- 
goal while if he made an X choice he obtained 
For the remainder the sub- 
position (C) 


of the Ss in each experimental group had the 
right panel serve as the A-B segment while the 
left panel served as the X-Y segment. The 

ite arrangement applied to the remain- 
(b) For half of the Ss in each experi- 


until 
emphasized that at no point in the training 
or test did E describe the subgoal by name. 


RESULTS AND DISCUSSION 


IQ scores —The mean PPVT IQ 
scores for the four experimental groups 
were as follows: K-C (Kindergarten- 
Constant), 105.1; K-S, 106.8; 3-C 
(Third Grade-Constant), 110.5; and 
3-S, 108.6. A2 X 2 factorial analysis 
of variance applied to these data 
yielded no significant Fs. The mean 
1Q for all Ss was 107.8. 

Preliminary training. —All groups 
quickly acquired the subgoal seg- 


462 


ments, i.e., they learned which panel 
yielded the marble and which yielded 
the ball bearing. Eighty-four percent 
of all Ss attained criterion with no 
errors and 91% made no more than 
one error. No child made more than 
seven errors. The percentages of Ss 
who attained criterion with no errors 
were as follows, by groups: K-C, 84% ; 
K-S, 94%; 3-C, 75%; and 3-S, 78%. 
A x? analysis of the corresponding 
frequencies revealed no statistically 
significant differences among the 
groups. 

The major goal segment was also 
easily learned, i.e., whether the marble 
or the ball bearing yielded the charm. 
Except for 1 S who made nine errors 
and another who made four, all chil- 
dren reached the criterion after no 
more than three errors. The per- 
centages of Ss who attained criterion 
with no more than one error were as 
follows: K-C, 88%; K-S, 88%; 3-C, 
69%; 3-S, 84%. A x? analysis of 
the corresponding frequencies again 
yielded no significant differences 
among the groups. It should be 
noted, however, that in the acquisition 
of both the subgoal and major goal 
segments the older Ss made more 
errors than the younger. This may 
be another manifestation of the re- 
sponse-shift-tendency reported by 
Harlow (1959). 

Test trial: initial choice-—The first 
component of inferential behavior is 
the initial choice between A and X, in 
which A is the inferential choice, 
The combined results, which appear 
in Table 1, show a statistically 
significant age difference in the ex- 
pected direction (P < 01). In fact, 
on this measure, the kindergarten 
children perform precisely at chance 
level. 


This part of the results appears to be 
in conflict with two previous studies 
(Kendler & Kendler, 1956; Kendler et 


TRACY S. KENDLER AND HOWARD H. KENDLER 


TABLE 1 


PERCENTAGE OF Ss IN EACH EXPERIMENTAL 
Group WHoseE INi1IAL CHoicE Was A 


Chronological Age Level 
Experimental Group | 5-6 Yr. | 8-10 Yr. 


% op % £ 


Subgoals switched | 50.0 | .088 | 71.9 |.079 
Subgoals constant | 50.0 | .088 | 75.0 | .077 
Combined 50.0 | .063 | 73.4 | .056 


al., 1958) which showed that in groups 
of children between 34 and 60 mo. of age 
there were significantly more A than X 
choices. The studies, however, differed 
from the present one in two ways. The 
task used was simpler and the Ss were 
from higher socioeconomic levels. A 
more recent study (Kendler & Kendler, 
1961) used a procedure and socio- 
economic sample comparable to the 
present ones and obtained similar results. 
In that study, which had as Ss children 
between 30 and 65 mo. of age, the number 
of A and X choices was almost exactly 
equal. Itis therefore suggested that the 
apparent conflict in results is due to the 
roles played by the difficulty of the task 
and the intelligence of the Ss. To some 
extent this explanation of the discrep- 
ancy between the studies is supported by 
the findings of the present study (see 
Table 2) that the selection of A is related 
to mental age, It is evident that there 
was, in general, an increase in the per- 
centage of A choices with increasing MA. 
This is, however, an ad hoc analysis, not 
tested for statistical significance; con- 
sequently the generality of the results is 
limited. They do, nevertheless, support 
the implication of the CA results, namely 
that inferential behavior is symptomatic 
of an important developmental process. 


Test trial: integration response. —If 
after his initial choice, S inserted a 
subgoal into the G aperture, he was 
considered to have made an integra- 
tion response. If the subgoal thus 
utilized was B, the integration re- 


— 


ee eee a 
— — ao 


INFERENTIAL BEHAVIOR IN CHILDREN 


TABLE 2 


PERCENTAGE OF A CHOICES AS A 
Function OF MA 


Chronological Age Levels 
MA 5-6 Yr. 8-10 Yr, | Both 

WAJ |A] r | AA] r 
36-71 40.0 |.098 |100.0 | — |42.3 | .097 
72-95 48.3 |.093 | 50.0 |.145 148.8 | .078 
96-119 66.7 |.192 | 75.0 |.097 |73.1 | .087 
120-143 [100.0 | — | 71.4 |.111 |77.8 | .098 
144-167+ | — | — | 88.2 |.079 [88.2 .079 


sponse was correct. If it was Y, the 
integration response was incorrect. 
Each of these integration responses 
could be further subdivided into those 
that occurred with no unnecessary 
responses intervening between initial 
and goal responses (direct) and those 
that occurred after one or more un- 
necessary responses (indirect). The 
intervening responses ranged from 
making only one unnecessary response, 
e.g., pressing the X button, to repeat- 
ing almost the entire training se- 
quence, e.g., pressing A but leaving B 
in trough, then pressing X and leaving 
Y in trough, then taking both sub- 
goals out and setting them into the 
same position they occupied during 
training of B-G, then finally taking up 
B to drop into the G aperture. (The 
number of unnecessary responses In 
indirect solutions may be a fruitful 
subject for analysis in the future, but 
for the present the entire gamut was 
treated as an entity.) Finally, S had 
the option of making no integration 
response at all. 

Table 3 presents the results for all 
Ss in the C groups, divided into these 
five categories. Since it might appear 
that the superiority of the older chil- 
dren of this measure is merely a re- 
flection of the greater proportion of 
correct. initial responses, the results 


463 


were also analyzed using as a base only 
Ss who made an initial A choice, with 
the following effect. Twelve percent 
of the kindergartners and 67% of the 
third-graders who made an initial A 
choice made direct correct integration 
responses. This difference is signifi- 
cant (P < .001). 

There are several conclusions to be 
drawn from the data in Table 3. One 
is that, as was the case with the initial 
choice measure, the integration meas- 
ure yields marked differences in the 
performance of the two age levels. 
Note that the method used by those 
kindergartners who do attain a correct 
solution is primarily indirect. De- 
scriptively speaking, such inference 
was more like trial-and-error than 
insightful behavior. On the other 
hand, a very large majority of the 
older Ss made a correct integration 
response, and their solutions tended 
to be more direct than indirect. 

The other conclusions deal with the 
relations between the first and second 
components of the inferential se- 
quence. Since many Ss, particularly 
among the kindergartners, who made 
an A choice either made no integration 
response at all or interposed other 
responses before completing the se- 
quence, it may be concluded that the 
stimulus presented by the “B subgoal 


TABLE 3 


PERCENTAGE or Ss AT EACH AGE LEVEL IN 
Constant Suscoats Grours WHO 
MADE Various INTEGRATION 
RESPONSES 


Kind of Integration 


4 No 
Chronol. A-B-G+ X-Y-G— Inte- 
Wr) (Correct) (Incorrect) gration 
f Respo! 


Direct | Indirect | Direct | Indirect 


6.2 | 43.8 | 6.2 | 6.2 37. 
50.0 | 37.5 | 3.1 3.1 6. 


5-6 
8-10 


464 


in its trough” does not necessarily 
produce the B-G behavior segment. 
On the other hand, it is also clear that 
when an integration response does 
occur it is much more likely to be a 
B-G response than a Y-G response. 
This result could be due to the pres- 
ence of the B stimulus since it was to 
this stimulus that the integration re- 
sponse was trained. However, in the 
Constant Subgoals groups the appear- 
ance of the B stimulus depends on an 
A response. The data haye shown 
that, at least among the older Ss, the 
A choice is not a matter of chance. 
must be due to some problem solving 
activity, probably some covert system 
of responses. It is therefore equally 
possible that the B-G integration may 
be attributed to the same response 
mechanism that led to the correct 
initial choice. 

In order to sort out whether the 
salient influence on the integration 
response is related to the correctness 
of the initial choice or to the correct- 
ness of the subgoal stimulus, the 
percentages of direct integration re- 
sponses under four different conditions 
are presented in Table 4. Two of 
these conditions are drawn from the 
data of the Constant Subgoals groups 
reported above. The other two con- 
ditions are drawn from the Switched 


K 


TRACY S. KENDLER AND HOWARD H. KENDLER 


Subgoals groups whose second com- 
ponent results have not yet been 
reported. In this table R refers to the 
initial response and S to the subgoal 
stimulus. 

When the proportions of direct 
integration responses, correct (+) and 
incorrect (—), under the various 
conditions are compared for the two 
age levels, it is apparent that there are 
too few Ss in the kindergarten groups 
to yield any trends. The older Ss’ 
behavior, however, varies in an inter- 
esting and statistically significant 
way. Ax’ analysis of the correspond- 
ing frequencies yields a P between 
.02 and .05, Third graders are most 
likely to integrate the two components 
when both their initial choice and the 
subgoal are correct. When one of 
these elements is incorrect, the prob- 
ability of an integration response 
decreases; however, either element by 
itself does lead more than one-third 
of the older Ss to integrate. When 
neither element is correct, the prob- 
ability of integration is minimal. 

In order to interpret these results 
it is necessary to examine more closely 
the nature of these elements. It is to 
be expected that S+ is more likely to 
produce integration than S—. But 
from Table 4 it can be seen that S+ 
by itself is not sufficient to explain 


TABLE 4 


PERCENTAGE OF Ss AT Each Ace LeveL WHO MADE DIRECT INTEGRATION RESPONSES 
AS A FUNCTION OF THE CORRECTNESS OF THE INITIAL RESPONSE AND SUBGOAL 


— = i ———— a us = = _— 
c 7 — 
Subgoals ag ae | Type of yn poe 
Response Integration ` g 
| | « % 

— [~ | Integr N3 Integt 
Constant R+S+ 7 \-B-G Fr EO ERN FUEN — x o 
Switched | R-S+ X-B-G+ | +4 2 24 4 
Switched R+S- 4-Y-G— ie 2 7 35 
Constant R-S- | 16 2: 2 


S— 


=e 


INFERENTIAL BEHAVIOR IN CHILDREN 


integration at either age level. Among 
the younger Ss very little integration 
occurs even under S+ conditions. 
Among the older Ss, S+ is important 
but is not as adequate by itself as 
when in combination with R+. In 
fact R+ is so important that it can 
effect some integration even when it is 
combined with S—. 

To understand why S+ by itself is 
not the adequate stimulus for integra- 
tion, it must be realized that it does 
not stand for the B subgoal in isola- 
tion. It actually symbolizes a total 
stimulus complex of “B in a trough 
of one of the side panels.” During 
training this particular stimulus com- 
pound was associated with picking up 
B and then returning it to E. There 
are quite a few Ss, particularly among 
the kindergartners, who, during the 
test situation, do just that. It is 
another stimulus compound, namely 
B and Y in front of the B-G panel, 
that is associated with picking up B 
and dropping it into the aperture. 
The data indicate that the general- 
ization from the former stimulus 
compound to the latter is more likely 
to occur in 8-10 yr. olds than in 5-6 
yr. olds. In the older age group it is 
more likely to occur after a correct 
initial choice than after an incorrect 


one. 


When these results are represented in 
terms of S-R associations (Kendler & 
Kendler, 1962), it becomes possible to 
describe the psychological difference in 
the behavior of the two age groups as well 
as to suggest the relevant mechanisms 
for the inferential behavior reported in 
this study. There appear to be three 
important characteristics of this be- 
havior. One is that making the correct 
initial choice in an inferential solution 
depends on the ability of S to generate a 
covert anticipatory response to the major 
goal and to respond appropriately to its 
cues. Such ability, according to this 
study, is positively related to age (within 


465 


the limits tested). A second character- 
istic is that inferential solution depends 
on a “short-cutting” process in which 
some overt S-R sequences previously 
learned must drop out. The younger Ss 
had difficulty in this respect. Repeating 
some of the responses practiced during 
training prevented many from exhibiting 
direct inferential solutions. The third 
characteristic of inferential behavior is 
that it is not governed completely by 
external events. If it were, then it 
would be expected that the availability of 
B following a response to X should yield 
as much integration as when it follows a 
response toA. Since it does not, one can 


* conclude that the integration response 


must be influenced by some internal 
stimulus component associated with the 
correct initial choice. This source of 
stimulation is sufficiently potent to com- 
pete successfully, in some cases, with 
external stimulation when there is a 
conflict between the two. This three- 
part characterization of inference is not 
offered as a theory. Its function is to 
analyze inferential behavior into more 
fundamental processes so that each may 
be investigated independently. 


SuMMARY 


Children of two age levels, namely 5-6 and 
8-10 yr., were presented with a task that 
required the linkage of two out of three 
discretely acquired segments of behavior. 
The solution consisted of making an initial 
choice between two of the segments (one 
correct and the other incorrect) and then 
integrating the product of that choice with 
the third segment. The solutions could be 
direct (inferential), i.e, the goal achieved 
without any unnecessary responses. They 
could be indirect, i.e., the goal achieved after 
the repetition of previously acquired but 
presently irrelevant behavior segments. The 
findings were: (a) Older Ss made significantly 
more correct initial choices than younger Ss. 
(b) About half of the younger Ss ultimately 
reached the goal, but their method was 
primarily indirect. Almost all of the older Ss 
achieved solution and a majority of them by 
direct inferential means. (c) In inferential 
solutions the integration of the subgoal and 
major segments was a joint function of the 
relevance of the external stimulation (as 


466 


determined by preliminary training) and the 
correctness of the initial choice. (d) When 
these two contributing factors were experi- 
mentally balanced against each other, it was 
found that the internal stimulation associated 
with the correct initial response was about as 
important a determiner of integration as the 
relevance of the external stimulus. 


REFERENCES 


Dunn, L. D. Peabody Picture Vocabulary 
Test (PPVT). Nashville, Tenn.: Ameri- 
can Guidance Service, 1959. 

Hartow, H. F. Learning set and error 
factor theory. In S. Koch (Ed.), Psy- 
chology: A study of a science. 
York: McGraw-Hill, 1959. 

Hutt, C. L. The mechanism of the assembly 
of behavior segments in novel combinations 


Vol. 2. New 


TRACY S. KENDLER AND HOWARD H. KENDLER 


suitable for problem solutions. Psychol. 
Rev., 1935, 42, 219-245. 
Hutt, C. L. A behavior system. New 


Haven: Yale Univer. Press, 1952. 

Kenpier, H. H., & Kenpier, T. S. In- 
ferential behavior in preschool children. 
J. exp. Psychol., 1956, 51, 311-314. 

KENDLER, H. H., & KENDLER, T. S. Vertical 
and horizontal processes in problem-solving. 
Psychol. Rev., 1962, 69, 1-16. 

KENDLER, H. H., KENDLER, T. S., PLISKOFF, 
S. S., & D’Amaro, M. F. Inferential 
behavior in children: I. The influence of 
reinforcement on incentive motivation. 
J. exp. Psychol., 1958, 55, 207-212. 

KENDLER, T. S., & KENDLER, H. H. Infer- 
ential behavior in children: II. The influence 
of order of presentation. J. exp. Psychol., 
1961, 61, 442-448. 


(Received September 7, 1961) 


Journal of Experimental P: 
1962, Vol. 64, No. 5, ‘ene 


THE SEMANTIC MEDIATION OF EVALUATIVE MEANING * 


FRANCIS J. DI VESTA axp DONALD O. STOVER 


Syracuse University 


Within the theoretical framework 
of Osgood’s (Osgood, Suci, & Tannen- 
baum, 1957) theory of meaning the 
sign of an object is a primary symbol 
assumed to evoke a representational 
mediating response that is some part 
of the total behavior emitted by the 
organism when stimulated by the 
object itself, This response produces 
distinctive cues, mediating behavior 
that would otherwise not have oc- 
curred in the absence of previous 
association of the object with the 
word. The representational elements 
(Pm-Sm) correspond to the meaning of 
the sign, and because of the cue or 
stimulus components the sign can be 
conditioned to other stimuli initially 
lacking in meaning. Upon condition- 
ing such symbolic stimuli are desig- 
nated assigns. On the basis of the 
congruity principle distinctive re- 
sponses may be produced by novel 
stimuli if the assign, within appro- 
priate contextual arrangements, iS 
used to label other novel stimuli 


(Osgood et al., 1957). 


Typical studies of this process are those in 


which meaning of verbal stimuli have been 
hadani eati 


1 This research was supported by Research 
Grant M-2900 from the National Institute of 
Mental Health, National Institutes of Health, 
United States Public Health Service. Thanks 
are due to Donald L. Meyer for statistical 
assistance. Randall Martin aided in collect- 
ing and analyzing the data for Exp. II. The 
authors are grea 
and David F. Sine of the Syracuse Board of 
Education; to Frank Liss, Principal of the 
Charles Andrews School; Elsie Platto, 
Principal of the Edward Smith School; Ruth 
O'Brien, Principal of the Sumner School; and 
to the teachers of the fifth grades for providing 
us with facilities and friendly assistance in 
working with the children involved in the 


study. 


changed by simple conditioning (Staats, 
Staats, & Biggs, 1958) and shown to generalize 
to synonyms of those stimuli (Staats, Staats, 
& Heard, 1959). Rhine and Silun (1958) 
demonstrated that development and strength 
of a concept-attitude were affected by the 
amount of reinforcement. Eisman (1955) 
provided reinforcements for a word and then 
associated that word with various colored 
objects, thereby strengthening the probability 
that the object would be chosen by the S. 
Using a Treatments X Ss design, Osipow 
(1960) provided evidence that a color-name 
associated with either a positive or negative 
evaluative word resulted in corresponding 
changes in preference for a nonsense figure 
when the color-name was associated with the 
figure. Since significant changes also occurred 
in the control group his results were incon- 
clusive. Di Vesta (1962) compared the 
effects of reinforcing a neutral color-name 
with the effects of attaching the neutral color- 
name to a positive evaluative sign in the first 
stage of the mediation process. It was 
demonstrated that both procedures were 
effective in changing preferences for a non- 
sense figure when the color-name was sub- 
sequently associated with the figure. 


The primary concern in the present 
study was to compare the effect of 
two different experimental designs 
commonly used in the studies sum- 
marized above. These experiments 
were intended to extend the findings 
of the previous investigations by 
testing two hypotheses: (a) that 
association of a neutral symbol with 
several signs, all of which represent 
similar polarities of the evaluative 
dimension of meaning, will result in 
movement in semantic space of the 
assign corresponding to the connota- 
tive meaning of the signs associated 
with it; and (b) that labeling a neutral 
stimulus object (nonsense figure) with 
the conditioned assign will result in an 
evaluation of that stimulus object 
corresponding to the acquired mean- 


467 


468 


ing of the assign, independent of 
whatever effects may occur from 
acquired distinctiveness of cues result- 
ing from labeling alone. 

Several distinctions between the 
designs used in the present study and 
thése used in previous investigations 
may be summarized: A Treatments 
X Ss design with repeated measure- 
ments on each S was employed in Exp. 
I, and a two-factor design with re- 
peated measurements on each S was 
employed in Exp. II. Both experi- 
ments provide for control of learning- 
how-to-learn and warm-up effects. 
The effect of meaning was controlled 
by using experimental treatments in 
which neutral evaluative words were 
conditioned to the assign. In the 
Treatments X Ss experiment two ad- 
ditional controls were used, one for the 
use of a label without experimentally 
acquired meanings, and another for 
the evaluation of the stimulus object 
without the use of labels. And, 
finally, in both experiments the assign 
was associated with a nonsense figure 
as the stimulus object rather than 
with another verbal symbol. 


EXPERIMENT I 
Method 


Design.—The general procedure used in the 
experiment was a modification of the mediated 
generalization paradigm. In Phase 1, the 
three sets of nonsense syllables were, respec- 
tively, conditioned to words with Positive, 
neutral, and negative evaluative meanings, 
In Phase 2, the Ss learned to name each of 
three different nonsense figures with one of the 
three conditioned nonsense syllables, a fourth 
figure with an unconditioned nonsense syl- 
lable, and Ss experienced a fifth figure without 
naming it. Thus, one of the figures was 
associated with a positive-conditioned non- 
sense syllable (positive treatment), a second 
with a neutral-conditioned nonsense syllable 
(neutral treatment), a third with a negative- 
conditioned nonsense syllable (negative treat- 
ment), a fourth with an unconditioned 
nonsense syllable (control-UNS), and a fifth 


FRANCIS J, DI VESTA AND DONALD O. STOVER 


with no name attached to it (control). In 
Phase 3, the Ss rated each of the figures on 
three semantic differential scales. 

Materials.—Three decks of 5 X 8 in. cards, 
with 24 cards in each deck, were used in the 
verbal conditioning phase. One positive, 
neutral, or negative evaluative word was 
printed in block letters 1} in. high on each 
card. Examples of the eight positive evalua- 
tive words are RIGHT, CLEAN, BRAVE, and 
SMART; examples of the eight neutral evalua- 
tive words are MIDDLE, USUAL, AVERAGE, and 
MEDIUM; and examples of the eight negative 
words are WRONG, WICKED, FILTHY, and 
STUPID. The negative and positive evaluative 
words were antonyms. Eight different words 
were used for each of the polar positions. The 
order in which the cards were arranged in the 
deck was determined at random with the 
restriction that each polar position was 
represented twice in every block of six words 
and that no more than one meaning occurred 
twice in succession in each block of six words. 
The same words were used in the second and 
third decks and differed from the first deck 
only in the order in which the words appeared. 

Three sets of semantic differential rating 
scales were used. All scales were measures 
of evaluative meaning. The first was a 
practice set to help Ss understand the 
procedure. The Pretty-Ugly and the Wise- 
Foolish scales were used. ‘The second set, 
comprised of the Cruel-Kind, and Wise- 
Foolish scales, was used to determine whether 
Ss’ meanings for the nonsense syllables were 
changed in Phase 1. The third set, made up 
of Good-Bad, Pretty-Ugly, and Like-Dislike 
scales, was the measure used in Phase 3 to 
determine the evaluation of the figures. It 
should be noted that none of the scales dupli- 
cated any of the specific words used in the 
verbal conditioning procedure. All scales 
were five-point scales, with points 4 cm. 
apart. The S was permitted to check any 
place on the scale that he felt best represented 
his judgment. The intensity of S's response 
was obtained by measuring the distance, from 
the extreme end of the scale to his check mark, 
in centimeters, 1 cm. represented extreme 
negative ratings, 9 cm. marked the midpoint 
of the scale, and 17 cm. represented extreme 
positive ratings. 

Five nonsense figures were used, con- 
structed according to the procedure described 
by Attneave and Arnoult (1956) as Method I 
for angular shapes with closed contours. 
Each figure was approximately 34 4 in. 
and was made of heavy white oaktag paper 
Painted gray. These figures were mounted on 
a white oaktag square with sides 5} in. long: 


SEMANTIC MEDIATION OF EVALUATIVE MEANING 409 


Procedure.—The Ss were first introduced 
to the task and to the method of using the 
semantic differential scales by having them 
rate three sets of two pictures (similar to 
those in the Stanford-Binet Intelligence 
Scales) on the Pretty-Ugly scale to determine 
their ability to make these evaluations. In 
addition, S rated examples of behavior such 
as “a boy crossing the street without looking,” 
and fictitious characters such as “Donald 
Duck” on the Wise-Foolish scales. All Ss 
proceeded succcessfully through this task. 

A procedure similar to that described 
earlier by Di Vesta (1961) was used in Phase 1. 
The Ss were instructed that they were to 
learn the meanings of three words (nonsense 
syllables) that they had not heard before. 
Nine nonsense syllables were used throughout 
the experiment, 1 S receiving any three at 
random from among these, for example, PID, 
Lom, and sup might have been used for 1 S 
and cır, pou, and LIM for another S The 
polar position of evaluative meaning (posi- 
tive, neutral, or negative) associated with a 
particular nonsense syllable was randomly 
arranged within the total experiment in order 
to balance evaluations that might have been 
based on associative characteristics of the 
syllables. (In Exp. II the syllables were 
found to have nọ more than chance effects.) 
‘The S was then given the first deck of cards 
and instructed to look at the top card and 
indicate which of the nonsense syllables it 
defined. If S thought, for example, that 
“plain” was a definition for LOM, he was to say 
“rom is plain.” The correction procedure 
was used. After S responded, Æ gave the 
correct response. If S was correct, he went 
on to the next card; if incorrect, he repeated 
the correct association before proceeding to 
the next card. This procedure was continued 
until S reached the criterion of responding 
correctly to two blocks of six words in each. 
This criterion was used since it signified that 
S responded correctly to each polarity at least 
four times with different words. If S did not 
reach this criterion after proceeding through 
the three decks twice, he was eliminated from 
the experiment. At the conclusion of this 
phase, S rated each of the nonsense syllables 
(assigns) on the Cruel-Kind and the Wise- 
Foolish rating scales. 

In Phase 2 S was told that the syllables 
he had just learned about were also the names 
of certain figures. The S was shown five 
nonsense figures and told that since they were 
unlike any other figures that he may have 
seen, they could not be called by names like 
rectangles, triangles, and the like. The S was 
further instructed that four of the figures did 


have names which he was to learn, and that 
the names corresponded in three cases to the 
syllables he had just learned, while the fourth 
figure also had a name (the unconditioned 
nonsense syllable) but that he had not learned 
it, and that the fifth figure had no name. The 
S was then shown the figures, one at a time, 
in random order until he could name all 
labeled ones correctly in four successive 
presentations without error. A procedure 
similar to that used in the first stage was used 
in reinforcing correct responses. In order to 
randomize any effects of initial preference for 
the figure, pairing of assigns and figures was 
varied among Ss. 

In the third phase, S was asked to rate 
the figures, one at a time- The order in which 
figures were presented to a particular S was 
determined at random. Each figure was rated 
on three semantic differential scales, Good- 
Bad, Pretty-Ugly, and Like-Dislike. 

In order to increase the level of motivation 
S was told that he would earn a toy if he 
participated in the experiment. At the 
conclusion of the session he was allowed to 
select his choice of trinkets from among 
crayons, baseball picture cards, balloons, and 
the like. 

Subjects—The Ss were 24 children from 
one fifth-grade class in an elementary school. 
However, 4 Ss were eliminated from the 
analysis because they failed to meet the 
criterion for learning in Phase 1 and 3 other 
Ss were eliminated because they failed to use 
the rating scales appropriately. Thus, all 
analyses were based on an N of 17. The Ss’ 
CAs were 9 and 10 yr. 


Results 


An average of 42.70 (SD = 21.79) 
pairings, excluding criterial trials, of 
nonsense syllables with signs was 
required to learn the evaluative mean- 
ings of the assigns and an average of 
28.25 (SD = 17.63) pairings was re- 
quired to achieve the criterion in 
learning to associate the assign with 
the nonsense figures. The means and 
SDs for ratings of the nonsense syl- 
lables immediately after conditioning, 
on the Cruel-Kind and Wise-Foolish 
scales were compared.” Since no more 


2 Two tables in which are presented the 
means and SDs of assign ratings for each 
group in Exp. I and II, one table summarizing 
the learning data for Exp. II, and one table 


470 


than two scores overlapped in any one 
of the comparisons of the distributions 
of ratings for the three treatments, it 
was obvious that the differences were 
significant and no formal tests of sig- 
nificance were made. 

Each S had been administered all 
treatments and used three scales in 
the final ratings of the figures within 
each treatment. Accordingly, a split 
split-plots analysis of variance was 
used.* A summary of the Treatments 
X Scales X Ss analysis is presented 
in Table 1. As is evident from the 
table, the Fs for the main effects of 
both Treatments and Scales were 
significant (P < .01). A test of the 
significance of the differences between 
the means of the scales indicated that 
the ratings made on the Pretty-Ugly 
scale tended to be generally more 


summarizing the analysis of variance testing 
the effects of nonsense syllables within each 
group in Exp. II, have been deposited with the 
American Documentation Institute. Order 
Document No. 7261 from ADI Auxiliary 
Publications Project, Photoduplication Serv- 
ice, Library of Congress; Washington 25, 
D. C., remitting in advance $1.25 for micro- 
film or $1.25 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, 
Library of Congress. 

*The assumption of homogeneity of 
variance was found to be tenable, via Bart- 
lett’s test, for all analyses of variance except 
where otherwise indicated. 


FRANCIS J. DI VESTA AND DONALD O. STOVER 


TABLE 1 


ANALYSIS OF VARIANCE OF FINAL RATINGS 
OF FIGURES; Exp. I 


Source af MS F 
Between Ss 16| 24.18 
Within Ss 
Treatments (T) 4 | 195.62 5.06* 
T X Ss (Error 1) | 64| 38.70 
Scales (S) 2| 63.83 7.24* 
TXIS 8 5.92 | <1.00 
S X Ss (Error 2) |160 8.82 
Total 254 
*P <.01. T E 


negative (P < .01) than the ratings 
made on either of the other two scales. 
(Similar results were found in Exp. 
II.) The ratings made on the Good- 
Bad and the Like-Dislike scales were 
not significantly different (P > .05). 
The difference in the Pretty-Ugly 
scale did not interact with treatments 
as indicated by the F (< 1.00) for 
Treatments X Scales interaction. 
Thus only treatment means obtained 
by combining the ratings of the three 
scales to obtain a total score were 
compared. The overall comparison 
of treatments using combined scores 
is represented by the F of 5.06 
(PETOLI, 

The means and SDs of individual 
scale scores and total scores for Treat- 


TABLE 2 


MEANS AND SDs of RATINGS OF FIGURES on EACH SCALE AND TOTAL SCALES 
FOR EACH Treatment: Exp. I 


Scales 
Treatment Good-Bad Pretty-Ugly Like-Dislike Total 
je: Saeed tox 2 
1 Mean SD Mean SD Mean SD Mean SD 
Positive | 12.71 | 3.10 | 1065 | 4.95 | 12.59 | 3.26 | 11.98 | 3.97 
Neutral 9.76 | 4.24 7.59 | 3.57 9.29 | 4.17 8.88 | 4.11 
Negative 7.00 | 5.01 6.00 | 4.78 7.06 | 5.42 6.69 | 5.10 
Control-UNS 8.82 2.29 7.53 3.65 7.41 3.97 8.12 3.60 
Control 8.94 | 3.84 7.59 | 3.73 10.06 | 3.90 8.86 3.96 


l 


SEMANTIC MEDIATION OF EVALUATIVE MEANING 


ments are presented in Table 2. 
Duncan’s multiple range test (Ed- 
wards, 1960) was used in testing the 
multiple comparisons between means. 
The Treatments X Ss error term was 
used in making the comparisons. The 
comparisons of the positive evaluative 
treatment with each of the four other 
treatments indicated that all differ- 
ences were significant (P < .001). 
The comparisons of the negative 
evaluative treatment with the neutral 
evaluative treatment and the control 
without labeling were also significant 
(P < .01). The comparison of the 
negative evaluative treatment with 
the control in which only labeling 
was used was not significant (10 > P 
> .05). None of the other compari- 
sons among controls and the neutral 
treatment were significant (P > 10). 


EXPERIMENT Il 


Method 


Design—The primary purpose of this 
experiment was to replicate a part of Exp. | 
while excluding carry-over effects from one 
treatment to the next, @ consideration often 
neglected by previous investigators. Nine 
groups, in a two-factor design with three 


repeated measurements in each group, were 
used. One factor was based on the three 
, and negative) of 


polarities (positive, neutral 

evaluative meaning; tae second was base 
on the use of three different figures. 
figures were used to increase 
of the results. 
between figures, the resu 


of meaning. In 
was associated with signs ng 
evaluative meanings for the post 
with neutral-evaluative meanings for the neu- 
tral group; and wit 
meanings for the negative group. In Phase 2 
the conditioned nonsense syllable was associ- 
ated with Fig. 1 by one-third of each group, 
2 by another third, and Fig- 3 by the 
last third of each group. In Phase 3 Ss used 
the three scales, as in the previous experiment, 
for rating the figure labeled by the assign. 
The following modifications 


Procedure.— x 
were made in the procedure used in Exp. |. 


Fig. 


471 


In Phase 1, S learned the meaning for only 
one nonsense syllable. The same list of words 
was used as in Exp. I and in exactly the sume 
form. However, in Exp. Il S indicated 
whether or not each of the 24 words was the 
meaning for the nonsense syllable to be condi- 
tioned. Thus, for example, if he was in the 
neutral group he would say, “LOM is plain” or 
“tom is not brave.” The E then recited the 
correct association. If S was correct he 
turned to the next card, if incorrect he re- 
peated the correct combination and then 
turned to the next card. 
words, depending upon the experimental con- 
dition, were used iti 
nonsense syllable in each group- 
2 S then identifed and named only three 
figures; one of the figures was labeled by the 
conditioned nonsense syllable and the other 
two figures were named with unconditioned 
nonsense syllables. The three figures were 
selected on the basis of pretest data with over 
150 children in which these figures had been 
ranked as the most neutral of six figures. 


used in each group. Different syllables were 
used between groups. The fact that the three 
syllables might have comprised a fourth 
factor in the total design was inadvertently 
neglected. However, since equal numbers of 
Ss were assigned each of the syllables within 
any one condition, the data were analyzed by 
separate analyses of variance to determine 
whether the syllables used within each treat- 
ment had other than chance effects on the 
results. In the three analyses thus made, 
figures and syllables were fixed factors and 
scales were repeated measures. In every 
case, the main effect of syllables and the 
interaction term in which syllables appeared 
were not significant (P > .20). All but 2 
of the 12 Fs so calculated were < 1.00.* 
Accordingly, the conclusion was that the 
selected syllables had no effect on the final 
ratings made. 

In Phase 3, the S rated the figure labeled 
with the conditioned syllable. The ratings 
were made with the same three scales used in 
Exp. I. The figures labeled with the uncon- 
ditioned nonsense syllables were not ra 
since these data were not relevant to the 
hypotheses. 

Subjects. — The Ss were 81 children from 
the fifth grade classes of a different elementary 
school from that used in Exp. I. The Ss were 
9 and 10 yr. of age, and were randomly 
assigned to the nine experimental groups. 
Twelve Ss had been eliminated; 6 Ss failed to 
learn the task in Phase 1, and 1 S in the 
positive group, 4 Ss in the neutral group, and 


472 


1 in the negative group failed to use the rating 
scales properly at the end of the experiment. 


Results 


The means and SDs of the learning 
data as measured by errors and trials 
to reach the criterion for the three 
main experimental groups in Phases 
1 and 2 were compared The vari- 
ances of the learning data in Phase 
1 were heterogeneous requiring the 
use of the Kruskal-Wallis analysis of 
variance to test differences among the 
groups. Significant (P < .01) differ- 
ences among groups were found for 
both measures. Multiple comparisons 
were tested by the Mann-Whitney U 
test. The difference in errors between 
the positive and negative group was 
not significant (P > .05) while the 
comparisons of each of those groups 
with the neutral group were significant 
(P < .01). All comparisons on 
the “trials” criterion were significant 
(P < .01). However, the order of 
difficulty in learning was not cor- 
related with the order of the final 
preference rating; therefore, the as- 
sumption was made that trials to learn 
or errors made in learning were not 
related to changes in preferences, and 
that any differences in final preference 
could be attributed to the mediation 
of meaning. No significant differences 
were found in the comparisons based 
on either of the learning measures in 
Phase 2. 

The means and SDs of the assign 
ratings following the verbal condition- 
ing procedure in Phase 1 were sum- 
marized.? These data compared favor- 
ably with those obtained for Exp. 1. 

The Treatments X Figures X Scales 
analysis of variance of the ratings 
made of the figures in Phase 3 is 
summarized in Table 3. While the 
overall Treatment effects were sig- 
nificant (P < .001) the differences in 
Scales (P < .001) and the interaction 


FRANCIS J. DI VESTA AND DONALD O. STOVER 


TABLE 3 


ANALYSIS OF VARIAN 


Or FInaL RATINGS 


Source df | MS a 
Between Ss 80 
Treatment (T) 2 | 382.86 | 10.05%% 
Figures (F) 2 6.33 | <1.00 
TXF | 4] 12.53 | <1.00 
Error (1) 72| 38.10 
Within Ss 162 
Scales (S) 2| 78.48 | 8.799% 
rxs 4| 32.98 3.69" 
FXS 4| 18.30 2.05 
TXFXS 8| 12.66] 1.42 
Error (2) 144 8.93 
Total | 242 
*“P <.01 A 
“pP <00 


of Treatments and Scales (P < .01) 
were also significant. The means and 
SDs of the ratings of the figures on 
each of the three scales are presented 
in Table 4. The overall ratings are 
also summarized to permit comparison 
with the data from Exp. I. Data for 
the specific figures have been com- 
bined since the F for differences 
between figures was < 1.00. In the 
multiple comparisons‘ all scales sig- 
nificantly (P < .01) differentiated the 
positive from the negative group; the 
Good-Bad and the Like-Dislike scales 
significantly (P < .05) differentiated 
the neutral from the negative group; 
and the Pretty-Ugly scale significantly 
(P < .05) differentiated the neutral 
from the positive group. None of the 
other comparisons was significant. 
When the data from the three scales 
were combined the differentiation 
among the three groups was more 
clear-cut and the trends clearly cor- 
responded to the conditioning of the 
syllables as well as to the data for 


4The modification of Duncan’s multiple 
comparison test described by Collier (1958) 
was used in testing the differences between 
treatment means on each scale. 


| 
| 


SEMANTIC MEDIATION OF RVALUATIVE MEANING 


ais 


TABLE 4 


Fixar Ratines oF Figures ox Eac Sears axo Torat Scares 
py Exresumextat Gaours: Exp. il 


ae = 


wale 
Group | Good- Had Pretty aly Like Dahe Teal 

Mes | SD | Meee | sp Mesa sD 

Positive ara | ass (mo | ao | teas | 328 

Neutral 30 6.96 | 440 9.59 410 | 89 427 

Negative | 4.52 6.19 4.00 ess | 480 | S88) S87 
Exp. I. The difference between the process would necessitate a com- 
of the present experiments with 


neutral and positive groups were sig- 
nificant (P < .05). The differences 
in the remaining two comparisons 
were also significant (P < .05). 


Discussion 


The results of the two experiments 
demonstrate that (a) assigns acquire 
evaluative meaning through association 
with several signs having specific polarity 
on this dimension of meaning and that 
(b) these assigns transfer 
ratings of figures associated with them. 
The changes in preferences for the 
figures correspon ed to the meanings, 
rather than the specific words, with which 
the nonsense syllable had been asso- 
ciated. Where such associations had not 
been made, as in the two control groups 
was a neutral 


These findings 
f the mediation 
chain of figure, nonsense 
syllable, associations, and connotative 
meaning. However, it is evident that the 
procedures used impose a severe limita- 


tion on this ; 
alternative explanation based on simple 


association principles an equally likely 


assumption. € 
in the first learning phase it 


been formed in 
became a “synonym” for the associated 


words. These words, in turn, may have 
been directly oF implicitly associated 
with the figure during the secon 


A more 


convincing demonstration of the media- 


those in which the two learning phases 
are reversed that is, a comparison of the 
present experiments with one in which 
the standard mediation paradigm is em- 
ployed. These studiesarecurrently being 
planned and are to be conducted in 
exactly the same settings in which the 
nt studies were made. 

Since the main concern, however, was 
in a comparison of the two designs, their 
respective merits and limitations warrant 
attention. Basically, they yielded the 
same results and one is, in a real sense, 
a replication of the other. The intra- 
individual design appears to be more 
sensitive in the gross demonstration of 
the effect than is the individual treat- 
ments design. Among the assumptions 
of the intraindividual design are that 
there are no order effects; and that there 
are no carry-over effects from one treat- 
ment to another. The first is tenable on 
the premise that treatments were as- 
signed in random order to Ss. The 
second, however, is a very likely occur- 
rence in the major phases of Exp. I where 
Ss learned the three meanings concomi- 
tantly. The comparisons between the 
stimuli conceivably facilitate the distinc- 
tions in evaluative response. Although 
the order of the figure was presented at 
random, once one figure had been rated 
a comparison with the others was in- 
evitable; and, since the same three scales 
were employed in each rating it would 
also be expected that carry-over effects 
would occur in the use of the scale. 


474 


In both experiments, there were sig- 
nificant differences among the results 
obtained from the different scales. In 
all but one instance, that is, in the use of 
the Pretty-Ugly scale in Exp. II, the 
preferences for all figures based on each 
of the scales were in the predicted direc- 
tion. The total score provided more 
adequate discrimination between groups 
in both experiments. It appears that the 
primary factor in the slight differences 
among scales may be accounted for in 
terms of lowered reliability when only 
one scale is used. The fact that other 
investigators, e.g., Staats, Staats, & 
Biggs (1958), have used a single scale 
with reliable results may merely reflect 
the ability of adults to use the scales with 
greater accuracy than children. 

Within the limitations discussed above, 
the present study tentatively suggests 
the applicability of the representational 
mediation process to the development of 
attitudes (Dodge, 1955; Osgood et al., 
1957). In terms of the operations of 
measurement with the semantic differen- 
tial the meaning of a stimulus object is 
its allocation in multidimensional space, 
with attitude defined as the projection of 
this point onto the evaluative dimension 
of that space (Osgood et al., 1957). The 
Principal concern is with the common 
elements of the mediating response of the 
meanings of the signs. When the assign 
1s associated with evaluative signs, the 
meaning of the assign will depend upon 
the mediating response evoked by these 
signs. The assign may be attached 
to other neutral stimulus objects to 
influence the responses evoked by these 
stimuli. Attitudesare thus characterized 
as implicitly learned processes with po- 
tentially bipolar evaluative properties. 


SUMMARY 


This study compared two experimental 
designs using procedures for the study of the 
mediation process. In both experiments the 
first phase consisted of assign development by 
conditioning signs with evaluative meaning 
to neutral nonsense syllables. Phase 2 in- 
volved associating the assign with a neutral 
nonsense figure. In the third phase Ss rated 


FRANCIS J. DI VESTA AND DONALD O. STOVER 


the nonsense figure on three semantic differ- 
ential evaluative scales. 

There were 17 fifth-grade children in Exp. 
I. Each S received five treatments. The 
dependent variable was the rating of five 
different nonsense figures. Three of the 
figures were labeled by assigns previously 
conditioned in Phase 1 to signs having nega- 
tive, neutral, or positive evaluative meanings, 
respectively. A fourth figure was labeled by 
an unconditioned nonsense syllable and the 
fifth figure was experienced equally often with 
the others but was not labeled. Ratings, 
made on each of the scales, of the figures 
labeled with conditioned assigns corresponded 
with the evaluative meaning of signs associ- 
ated with the assigns. No significant differ- 
ences were found between ratings made of the 
figure labeled by the neutral assigns, that 
labeled by the unconditioned nonsense syl- 
lable, or the figure not labeled. 

In Exp. II there were 81 fifth-grade pupils. 
In a two-factor design with repeated measures, 
independent groups were required to associate 
signs, varying, respectively, on the negative, 
neutral, and positive polarities of evaluative 
meaning, to assigns. Within each group the 
conditioned assigns were used to label one of 
three different nonsense figures. The remain- 
ing two figures were labeled by nonsense 
syllables. The results of this experiment 
corroborated those of the first. The overall 
ratings, using the combined scores of the three 
scales, differentiated significantly between the 
three groups. There was, however, a sig- 
nificant interaction between treatments and 
scales, although in all but one comparison the 
differences were in the predicted direction. 
No significant differences were found between 
figures. Nor were there significant differences 
between the nonsense syllables used with any 
one of the treatment groups. 


REFERENCES 


Artngave, F, & Arnoutt, M. D. The 
quantitative study of shape and pattern 
perception. Psychol. Bull., 1956, 63, 452- 
471, 

CoLLIER, R. O. 
correlated observations, 
1958, 53, 223-236. 

Di Vesta, F. J. Contrast effects in the verbal 
conditioning of meaning. J. exp. Psychol., 
1961, 62, 535-544. 

Dr Vesta, F. J, The effects of mediated 
generalization on the development of 
children’s preferences for figures. Child 
Develpm., 1962, 33, 209-222. 


Analysis of variance for 
Psychometrika, 


| 


SEMANTIC MEDIATION OF EVALUATIVE MEANING 


Dopce, J. S. A quantitative investigation of 
the relation between meaning development 
and context. Unpublished doctoral dis- 
sertation, University of Illinois, 1955. 

Epwarps, A. L. Experimental design in 
psychological research, N. Y.: Rinehart, 
1960. 

Ersman, B. S. Attitude formation: The 
development of a color-preference response 
through mediated generalization. J. ab- 
norm. soc. Psychol., 1955, 50, 321-326. 

Oscoop, C. E., Suci, G. J., & TANNENBAUM, 
P, H. The measurement of meaning. 
Urbana: Univer. Illinois Press, 1957- 

Ostrow, S. H. The effects of verbal media- 
tion on the modification of children's 


475 


attitudes. J. educ. Psychol, 1960, 51, 
199-207. 

Rume, R. J., & Stuy, B. A. Acquisition and 
change of a concept-attitude as a function 
of consistency of reinforcement. J. exp. 
Psychol., 1958, 55, 524-529, 

Staats, A. W., Staats, C. K., & Bigos, D. A: 
Meaning of verbal stimuli changed by 
conditioning. Amer. J. Psychol., 1958, 71, 
429-431. 

Staats, A. W., Staats, C. K., & HEARD, 
W. G. Language conditioning of meaning 
using a semantic generalization paradigm. 
J. exp. Psychol., 1959, 57, 187-192. 


(Received September 13, 1961) 


Jeureal of Experimental Prychetogy 
1962, vd éi, No. 5, 476-481 


REVERSAL AND NONREVERSAL SHIFTS IN CONCEPT 


FORMATION USING CONSISTENT 


AND 


INCONSISTENT RESPONSES 


MARTIN HARROW! axp ALEXANDER M. BUCHWALD 


Indiana University 


Several previous card-sorting and 
block-sorting experiments have com- 
pared the speed of making reversal 
and nonreversal shifts (Buss, 1953, 
1956; Harrow & Friedman, 1958; 
Kendler & D'Amato, 1955; Kendler 
& Mayzner, 1956; Kendler & Kendler, 
1959). Typically, a reversal shift has 
involved learning two successive sort- 
ing tasks, with correct responses for 
Task 2 being based on the same 
dimension of the stimuli as on Task 1, 
but with S being required, literally, to 
reverse his previous sorting responses. 
For example, in Task 1, red cards 
must be placed in Sorting Category A 
and green cards in Category B, and in 
Task 2 red cards belong in Category B 
and green cards in Category A. Non- 
reversal shifts have involved learning 
one sorting task and then learning a 
second sorting task based on some 
dimension of the stimuli which was 
irrelevant in Task 1. Previous sorting 
experiments with college students 
have found that reversal shifts are 
learned quicker than nonreversal 
shifts. 

In a number of the experiments in 
this area (Buss, 1956; Harrow & 
Friedman, 1958; Kendler & D'Amato, 
1955 ; Kendler & Kendler, 1959) it has 
been hypothesized that reversal shifts 
are learned more quickly because Ss 
respond to the same dimension of the 
stimuli as was used previously in 
Task 1 learning, whereas in a non- 
reversal shift they are required to 
respond to a new dimension of the 

1Now at Yale 


University School of 
Medicine. 


stimuli. According to this analysis, 
reversal shifts are learned quickly 
because the same cues that were used 
during Task 1 learning are again 
relevant during the learning of Task 2. 
The S must merely learn to make 
different responses to the previously 
used cues. On the basis of this hy- 
pothesis the results in the above ex- 
periments have been interpreted as 
supporting a mediational S-R frame- 
work. However, reversal groups in 
the above experiments, besides using 
the same dimension of the stimuli that 
was previously relevant, have also 
been required to make literal reversals, 
in which the exact opposite sorting 
response was required. 

If the analysis in terms of the ad- 
vantages of using the same dimension 
of the stimuli is correct, then this 
condition alone should be sufficient to 
produce facilitating effects. Accord- 
ing to the above analysis supporting 
the mediational S-R framework, It 
would be expected that a shift which 
allows S to respond to previously 
relevant cues (for purposes of sim- 
plification and to maintain the tradi- 
tional terminology used previously, 
this type of shift will also be called 
a reversal shift) would be learne 
quickly even if the new responses that 
are required are not the exact opposite 
of the previously learned responses: 
Thus, after learning a concept based 
on the number of stimulus elements, & 
reversal group switching to a secon 
number concept should learn more 
quickly than a nonreversal group 
which shifts to a concept based on the 


476 


CONCEPT FORMATION m 
position of the stimuli, regardless of systematic fashion while comparing 
whether or not exactly opposite sort reversal ami nonreversal shifte 
ing responses are required. The 
present experiment attempted to test Mernov 


this analysis, 

An interesting factor which has been 
involved, in an incidental manner, in 
some of the previous experiments con- 
cerning reversal and nonreversal shifts 
(Kendler & D'Amato, 1955; Kendler 
& Mayzner, 1986) was also investi- 
gated. This variable is the use of 
consistent and inconsistent responses. 
‘These responses occur int experiments 


labeled with 
stimulus cards, Consistent responses 
occur when cards are 

with stimulus cards which are similar 
to them in some respect (€.-. when S 
is required to sort response 

according to their color and must 


between the stimulus 

appropriate response e.g., when 
S is required to sort cards 
according to their color, but must put 
red cards with a 
stimulus card). 


While consistent and inconsistent 
responses have appeared in some of 
the experiments comparms men 
and nonreversal shifts the influence 
iable on the ease of making 


investigated. 
relative difficulty of 


to investigate the influence of this 
variable. This was done ina card- 
sorting situation which varied con- 
sistent and inconsistent responses in & 


nae 
EH Ti 
ial 
lise i 
tier 

HF | 


ii 
fi 
it 


i 
Hah 
Hie 
uit 


i 
$ 
t 
i 


x 
the stimulus card which had its lines located 
jn the same corner, Corner X. For example, 
response cards having stars in the upper-left- 


478 


hand corner were placed with Stimulus Card 
UL1, which also had its lines located in the 
upper-left-hand corner. Likewise response 
cards having stars in the lower-left-hand 
corner belonged with Stimulus Card LL2, 
response cards having stars in the lower-right- 
hand corner went with Stimulus Card LR3, 
and response cards having stars in the upper- 
right-hand corner belonged with Stimulus 
Card UR4. One of the inconsistent position 
concepts (the clockwise concept) required 
that response cards which had stars in 
Corner X be sorted with the stimulus card 
which had its lines located in the corner 
which is one position clockwise to Corner X. 
Thus, response cards having their stars 
located in the lower-left-hand corner had to be 
sorted with Stimulus Card UL1, which had its 
lines located in the upper-left-hand corner 
(one position clockwise), Likewise, response 
cards having stars in the lower-right-hand 
corner belonged with Stimulus Card LL2, 
response cards having stars in the upper- 
right-hand corner went with Stimulus Card 
LR3, and cards having stars in the upper- 
left-hand corner belonged with Stimulus Card 
UR4. The other inconsistent position con- 
cept (the counterclockwise concept) required 
that response cards which had stars in Corner 
X be sorted with the stimulus card which had 
its lines located in the corner which was one 
position counterclockwise to Corner X, 

The three “number” concepts were a 
consistent number concept, and two in- 
consistent number concepts—an (n +1) 
concept and an (n — 1) concept. The 
Consistent number concept required that re- 
Sponse cards with stars on them be sorted 
with the stimulus card which also had » lines 
on it. Thus, response cards which had one 
star had to be sorted with the stimulus card 
with one line (Card UL1). Likewise, response 
cards having two stars belonged with Stimulus 
Card LL2, response cards having three stars 
went with Stimulus Card LR3, and response 
cards having four stars went with Card UR4. 
One of the inconsistent number concepts—the 
(n + 1) concept—required that response 
cards which had n stars be sorted with the 
stimulus card which had (mn +1) fines. 
Hence, response cards which had one star 
had | to be sorted with the stimulus card which 
had™two lines (Card LL2), Similarly, re- 
sponse cards having two stars belonged with 
Stimulus Card LR3, response cards having 
three stars went with Stimulus Card UR4, 

and response cards having four stars belonged 
with Stimulus Card ULI. The other in- 
consistent number concept—the (n — 1) 
concept—required that Ss sort response cards 


MARTIN HARROW AND ALEXANDER M. BUCHWALD 


which had x stars with the stimulus card 
which had (» — 1) lines on it. 

Design.—In the present experiment an 
attempt was made to test the previous 
analysis that the facilitative effects of reversal 
shifts are due to the advantages of using the 
same dimension of the stimuli when shifting 
concepts. An attempt also was made to 
examine the influence of consistent and in- 
consistent responses on reversal and non- 
reversal shifts. This was achieved in a three- 
dimensional factorial design, in which Ss 
learned two successive card-sorting tasks. 
During Task 1 all correct responses were 
inconsistent with half of the Ss learning 
an incofSistent position concept and the 
other half learning an inconsistent number 
concept. i 

For Task*2 Ss were divided into eight 
subgroups with half of the Ss learning a 
concept involving the same dimension of the 
stimuli as was previously relevant (reversal 
shifts) and half learning a concept involving 
a different dimension of the stimuli than was 
previously relevant (nonreversal shifts). Half 
of the Ss learned a concept involving consist- 
ent responses and half learned a concept 
involving incoysistent responses. Similarly, 
half of the Ss ed position concepts and 


half learned number concepts. This per- 
mitted a 2 X 2 X 2 factorial design. aes 
Procedure—The Ss were tested indivi- 


dually. The instructions read to S indicated 
that as each response card was shown to him 
he should point to the stimulus card she 
thought it belonged with, and that by Z telling 
him whether he was right or wrong he would 
gradually find out where each response card 
really belonged. 

The response cards were presented ran- 
domly and individually to each S by being 
placed in the slot which was at the top of the 
card holder. The criterion of learning for 
both Tasks 1 and 2 was 12 successive correct 
responses. All Ss who met the criterion on 
Task 1 within’ 160 trials were required to 
learn Task 2, 

The change in the pattern of reinforcement 
for the second concept was made without in- 
forming Ss. The Ss who had not learned the 
second concept within 500 trials were ar- 
bitrarily assigned a score of 500 for Task 2 
learning. 

In order to avoid partial reinforcement of 
the first concept during learning of the secon 
concept (Buss, 1956; Gormezano & Grant, 
1958; Harrow & Friedman, 1958) 4 response 
cards were eliminated from each set of 1 
cards, for each group of nonreversal Ss. This 
left 12 response cards in each deck. The same 


CONCEPT F 


ORMATION 479 


TABLE 1 


NUMBER OF TRIALS TO LEARN First Concert (Tasx 1) 
AND Seconp Concert (Task 2) 


j] 


Condition Trials to Learn 
’ Sa 1 Task 2 Ty N Task | | Task 2 
Responses| Concept | Responses | Concept Shift» Mean | Mdn. | Range [Mean | Mdn. | Range 
Incon, | Position Incon. | Position | Rev. | 8 84.5 | 13-133 | 11 5.0 | 2-36. 
Incon. | Position | Incon. Number | NR | 8 44.5 | 6-136| 98| 46.0 | 20-231 
Incon. | Position | Consist. Position | Rev. | 8 60.0 | 10-127| 9| 9.0) 3-18 
Incon, | Position | Consist. | Number | NR | 8 65.5 | 40-133 | 18 | 14.0 2-50 
: Basaar: Mead ei Fase UM m oo 
Incon. | Number | Incon. Nuber Rev. | 8 25-91 14 | 14.5} 2-30 
Incon. | Number | Incon. | Position | NR | 8 3-44 | 155 | 107.0) 8-488 
Incon. | Number | Consist. | Number | Rev. | 8 3-110| 6] 4.5] I-17 
Incon. | Number | Consist. Position NR |8 2-100 | 38 | 15.0| 4-178 


a Incon, = inconsistent; consist. = consistent. 
b Rey. = reversal; NR = nonreversal. 


4 response cards were also eliminated for the 
corresponding reversal groups. 


RESULTS 


» 

The data concerned with the learn- 
ing of the first concept are reported in 
Table 1. Both these results and those 
for the second concept represent the 
number of trials to learn the task, 
excluding the 12 criterion trials. To 
test for differences in speed of learning 
between Ss of different groups learn- 
ing the same first task, two separate 
analyses of variances were computed. 
One analysis of variance compared 
the four groups initially learning an 
inconsistent position concept and one 
compared the four gro}!Pps initially 
learning the inconsistent number con- 
cept. Using 3 and 28 df in each case 
for the position and number concepts, 
the overall Fs were, respectively, 0.48 
(P > .05), and 1.36 (P > .05). Thus, 
the data indicate that the four groups 
of Ss learning each type of task were 
equated with each other initially. 

The results for learning the second 
concept are also presented in Table 1. 
In order to determine whether there 
were any differences in the speed of 


learning the concepts among the eight 
experimental groups, a 2 X 2X 2 
analysis of variance was carried out. 
Due to the skewness of the data, a 
logarithmic transformation was used 
in place of the raw scores to obtain 
homogeneity of variance (Edwards, 
1950). The results, as can be seen in 
Table 2, show that the reversal groups 
learned significantly faster than the 
nonreversal groups (P < .001). Like- 
wise, concepts requiring consistent 
responses were learned significantly 
faster than those requiring inconsis- 
tent responses (P < .001). The sig- 


TABLE 2 


ANALYSIS OF VARIANCE OF LOG ‘TRIALS 
TO LEARN SECOND CONCEPT 


Source af F 
Ee E e e 
Reversal-nonreversal (R-NR) 1 | 41.32"* 
Consistent-inconsistent (C-IC)| 1 17.34** 
Number-position (N-P 11 0.95 
C-IC X R-NR 1| 7.16* 
N-P X R-NR 1| 0.36 
N-P X C-IC LE 
N-P X R-NR X C-IC 1| 1.53 
Within groups (M5) 56 | (0.187) 

*P <.01 
oP <.001 


480 


nificant interaction (P < .01) between 
the reversal-nonreversal and consis- 
tent-inconsistent groups, appears to 
be due to the much slower learning 
of the two nonreversal-inconsistent 
groups, as can be seen in Table 1. 

Further breakdown of the reversal- 
nonreversal comparison was done by 
means of individual ¢ tests. The 
transformed scores were used and the 
mean square within groups, obtained 
from the analysis of variance, was 
used as the basis for the error term. 
Two-tailed ¢ tests showed that both 
kinds of reversal groups learned sig- 
nificantly faster than the comparable 
nonreversal groups. The reversal 
groups which shifted concepts within 
the same dimension of the stimuli, 
without making a literal reversal of 
their previous responses (nonliteral 
reversals), learned significantly faster 
than the comparable nonreversal 
groups (ż = 2.65, df = 56, P < .02). 
The reversal groups which both 
shifted concepts within the same di- 
mension of the stimuli and also made 
responses which were literal reversals 
of their previous responses, learned 
significantly faster than the compar- 
able nonreversal groups (¢ = 6.43, 
df = 56, P < 001). The two types 
of reversal groups did not differ sig- 
nificantly from each other (¢ = 1.05, 
df = 56, P > .05). There was a sig- 
nificant difference between the two 
kinds of nonreversal groups (¢ = 4.83, 
df = 56, P < .001). 


Discussion 


The quicker learning of reversal shifts 
as opposed to nonreversal shifts was 
again found in this experiment, It 
should be remembered that in the present 
experiment the label, reversal group, was 
extended to all groups that learned a 
second concept which required discrimi- 
nation according to a dimension of the 
stimuli that was previously relevant 
during Task 1 learning. It was found, in 


MARTIN HARROW AND ALEXANDER M. BUCHWALD 


all cases, that a second sorting task 
which was based on the same dimension 
of the stimuli as the first task was learned 
quicker than comparable concepts which 
were not based on the same dimension of 
the stimuli. This occurred even when 
the reversal task did not require an exact 
literal reversal of previous sorting re- 
sponses. Thus, the data support the 
previous analysis of Buss (1956), Harrow 
and Friedman (1958), and Kendler and 
D’Amato (1955), who hypothesized that 
a reversal shift is learned quickly because 
it has the advantage of using a dimension 
of the stimuli which was previously 
relevant. Ina similar manner the results 
fit in with a mediational S-R approach. 

The data also indicate that concepts 
requiring consistent responses are learned 
more quickly than concepts requiring 
inconsistent responses. These results are 
not surprising, It seems probable that Ss 
have frequently made other similar con- 
sistent responses before, in their daily 
lives, and that in the present experiment 
they quickly ‘generalized to this par- 
ticular situation. 

The significant interaction between 
reversal-nonreversal and consistent-in- 
consistent groups suggests that the 
relative difficulty of reversal as opposed 
to nonreversal shifts is affected by 
whether the concepts used require con- 
sistent or inconsistent responses. Due 
to this, when both consistent and 
inconsistent responses are used in experi- 
ments of this type they should be con- 
trolled systematically, whether the in- 
terest is in the consistent and inconsistent 
responses (which in themselves have wide 
general applicability) or whether they 
are just used incidentally. In the present 
experiment it seems appropriate to 
analyze the significant interaction term 
with respect to the differences (P < .001) 
between the consistent and inconsistent 
nonreversal groups. Although the literal 
reversals involved inconsistent responses 
and the nonliteral reversals did not, the 
significant interaction appears to reflect 
the significant differences between the 
two types of nonreversal groups. Fitting 
this interpretation, the’means of the two 
kinds of reversal groups did not differ 


CONCEPT FORMATION 481 


significantly from each other. It should 
also be noted, concerning inconsistent 
responses, that the groups required to 
make inconsistent responses were more 
sensitive to the experimental conditions. 
Thus, in similar card-sorting experiments 
it may be advisable, when practical, to 
use groups making inconsistent responses, 
due to their greater sensitivity. 


SUMMARY 


‘The present experiment tested the notion 
that reversal shifts are learned more quickly 
than nonreversal shifts because they involve 
responding toa dimension of the stimuli which 
was used previously. This was accomplished 
in a four-category card-sorting situation in 
which both number and position concepts 
were used. Some concepts required con- 
sistent responses and other concepts required 
inconsistent responses. Sixty-four Ss learned 
two successive card-sorting tasks. During 
Task 1 all concepts required inconsistent 
responses. During Task 2 half of the Ss 
learned reversal tasks and half learned non- 
reversal tasks. Also, half the concepts used 
required consistent responses an half re- 
quired inconsistent responses. 

The results indicated that: (a) All types of 
reversal tasks (tasks requiring the use of a 
diminsion of the stimuli which was previously 
relevant) were learned in fewer trials than 
comparable nonreversal tasks (tasks requiring 
attention to a different dimension of the 
stimuli). Thus the previously reported hy- 
potheses were supported. (b) Concepts 
requiring consistent respon » learn 
in fewer trials than concepts requiring 1M- 
consistent responses: 
culty of reversal shifts as opposed to non- 
reversal shifts is ‘affected by whether con- 


sistent or inconsistent responses are used, i.c. 
there was a significant Reversal-Nonreversal 
X Consistent-Inconsistent interaction. 


REFERENCES 


Berc, E. A. A simple objective technique 
for measuring flexibility in thinking. J. gen. 
Psychol., 1948, 39, 15-22. 

Buss, A. H. Rigidity asa function of reversal 
and nonreversal shifts in the learning of 
successive discriminations. J. exp. Psy- 
chol., 1953, 45, 75-81. 

Buss, A. H. Reversal and nonreversal shifts 
in concept formation with partial reinforce- 
ment eliminated. J. exp. Psychol., 1956, 
52, 162-166. 

Epwarps, A. L. Experimental design in 
psychological research. New York: Rine- 
hart, 1950. 

Gormezano, I., & GRANT, D. A. Progressive 
ambiguity in the attainment of concepts on 
the Wisconsin Card Sorting Test. J. exp. 
Psychol., 1958, 55, 621-627. 

Harrow, M., & FRIEDMAN, G.B. Comparing 
reversal and nonreversal shifts in concept 
formation with partial reinforcement con- 
trolled. J. exp. Psychol., 1958, 55, 592-598. 

Kenpcer, H. H., & D'Amato, M. J. A com- 
parison of reversal shifts and nonreversal 
shifts in human concept formation be- 
havior. J. exp. Psychol., 1955, 49, 165-174. 

KENDLER, H. H., & Mayzner, M. S., JR. 
Reversal and nonreversal shifts in card- 
sorting tests with two or four sorting 
categories. J. exp. Psychol., 1956, 51, 
244-248. 

KENDLER, T. S., & KKENDLER, H. H. Reversal 
and nonreversal shifts in kindergarten 
children. J. exp. Psychol., 1959, 58, 56-60. 


(Received September 25, 1961) 


of Experimental Psychology 
hele Frage S, 482-488 


THE SERIAL POSITION EFFECT OF FREE RECALL?! 


BENNET B. MURDOCK, Jr. 


University of Vermont 


Recently Murdock (1960) has 
shown that in free recall Ry, the total 
number of words recalled after one 
presentation, is a linear function of t, 
total presentation time. Nothing was 
said about the serial position effect, 
though this is a well-known phe- 
nomenon of free recall (e.g., Deese & 
Kaufman, 1957). However, given 
that there is a serial position effect, 
the simple linear relationship between 
Rj and t is rather surprising. 

In the customary serial position 
curve of free recall, probability of 
recall is plotted as a function of serial 
position. This means, then, that the 
area under the serial position curve is 
equal to Ry, the number of words 
recalled after one presentation, If R, 
is a linear function of t then it must 
follow that the area under the serial 
position curve is also a linear function 
of t. However, it is not immediately 
apparent how the serial position curve 
varies with t in such a way as to 
maintain this simple linear rela- 
tionship. 

The present experiment was de- 
signed as an attempt to determine 
how the serial position curve varied 
with list length and presentation rate 
while still maintaining this linear 
relationship. Unfortunately, at the 
end of the experiment it was still not 
clear how this relationship came about 
or, for that matter, whether the rela- 
tionship was even linear after all. 
The basic reason for this failure was 


1 This study was supported by a research 
grant, M-3330, from the National Institutes 
of Health. The author would like to thank 
Ellen Lissner, Cynthia Marvin, and Frank 
Warhurst for analyzing the serial Position 
data. 


that the trends which did show up 
were not consistent enough to justify ` 
any clear-cut conclusions. However, | 
a rather definite picture of the serial 
position curve itself did emerge from 
the data. ‘Therefore, the present 
article will be restricted to a quanti- 
tive description and attempted ex- 
planation of the serial position curve 
of free recall. 


PROCEDURE 


Six groups each had a different combina- 

tion of list length and presentation rate. 
These six combinations were 10-2, 20-1, 15-2, 
30-1, 20-2, and 40-1; the first number indicates 
list length and the second number indicates 
presentation time (in sec.) per item. Thus, 
10-2 means a list of 10 words presented at 
a rate of 2 sec/item. Notice that the first 
two, middle two, and last two groups were 
matched for t, total presentation time (20, 
30, and 40 sec., respectively). 

For each group there were 80 different lists. 

The lists were constructed by randomly 
selecting words from the (approximately) 
4000 most common English words (Thorn- 
dike-Lorge, 1944, G count of 20 and up), 
except that homonyms, contractions, and 
archaic words were excluded. 

Group testing was used. Lists were read 
to Ss either at every beat (presentation rate 
of 1 sec/item) or at every other beat (pres- 
entation rate of 2 sec/item) of an electric 
metronome set at a rate of 60 beats/min. 

After each list there was a recall period of 
1.5 min. The Ss wrote down as many words | 
as they could remember in any order that 
they wished. Each recall period was ter- 
minated by a verbal “Ready” signal which 
preceded the start of the next list by 5-10 sec. 
All groups were given 20 lists per session and 
four sessions; successive sessions were spaced 
2-7 days apart. Nothing was said about 
rehearsing while the lists were being presented. 

In all there were 103 Ss, students of both 
sexes from the introductory psychology course 
who were fulfilling a course requirement. 
Exact Ns by group are shown in Table 1. 


482 


SERIAL POSITION EFFECT OF FREE RECALL 483 


RESULTS 


The data were first analyzed to 
determine if practice effects occurred 
over the four sessions. Analyses of 
variance showed that there was a 
significant (P < .01) improvement 
over the four sessions for Groups 10-2, 
15-2, and 20-2; whereas the effect was 
significant at only the .05 level for 
Group 30-1 and was not significant 
(P > .05) for Groups 20-1 and 40-1. 
However, the largest difference ob- 
tained between the best and the worst 
session for any one group was 1.13 
words, and all other intersession dif- 
ferences were less than 1.0 words. 
Therefore, when this practice effect is 
divided into four sessions and any- 
where from 10 to 40 serial positions 
its effect on the serial position curves 
was negligible. 

Table 1 shows the means and SDs 
of the number of words recalled per 
list (Ri). Each mean is based on 80 
lists per S and from 15 to 19 Ss per 
group. As predicted, groups with the 
same total presentation time did not 
differ significantly in mean number 
of words recalled. That is, no signifi- 


1.00; 


RECALL 
oa 


> 
[e] 


PROBABILITY OF 
te} 


°% 5 10 5 


TABLE 1 
MEAN Numper or Words RECALLED 


Group | oN | Men | SD 
10-2 | 18 | 639 | 0.76 
20-1 16 6.87 1.16 
152 | 19 | 825 1.40 
30-1 | 19. | 882 1.98 
20-2 is | 853 | 208 
40-1 i6 | 824 1.08 


cant differences were found between 
Groups 10-2 and 20-1 (t = 1.39), be- 
tween Groups 15-2 and 30-1 (t = 1.00), 
or Groups 20-2 and 40-1 (t = 0.48). 
The serial position curves are 
shown in Fig. 1. Probability of recall 
is plotted as a function of serial 
position. For greater generality, we 
would also like to use the data from 
studies by Murdock and Babick (1961) 
and Deese and Kaufman (1957). In 
the Murdock-Babick study there were 
18 Ss each tested on 80 different 25-1 
lists. In the Deese-Kaufman study 


there were two groups of 16 Ss each; 
one group was tested on 10 different 
10-1 lists and the other group was 
tested on 10 different 32-1 lists. The 


20 25 30 


SERIAL POSITION 


Fic. 1. Serial 


position curves for the six groups- 


484 


serial position data were presented in 
the original article as Fig. 1 (p. 182) 
and we read the points from the two 
curves as accurately as possible. 
These three serial position curves are 
shown here as Fig. 2. 

We have, then, nine different serial 
position curves. In general, the 
curves seem to share certain general 
characteristics: a marked recency 
effect, a flat middle section, and a 
primary effect which is more precipi- 
tous though smaller in magnitude 
than the recency effect. The presence 
of a flat middle section, or asymptote, 
is clearest in the 40-1 list (Fig. 1), but 
becomes less and less obvious as list 
length decreases. Actually, in the two 
10-word lists the primacy and recency 
curves may have intersected each 
other before an asymptote has been 
reached. 


More specifically, the recency effect 
can adequately be described by the Gom- 
pertz double-exponential function. As 
given by Lewis (1960, p. 81) the equation 
is y = ug”, Probability of nonrecall (y) 
was plotted as a function of list length 
minus serial position (x). Thus, the last 
word ina list would have an x value of 0, 
the next to last word an x value of 1, etc. 
Both vand g were fractional and positive. 
The asymptote » was determined from 
the mean recall probabilities averaged 


BENNET B. MURDOCK, JR. 


4 “| 10-1 ti pi Si 
2 / f fe 
Ys 60) \ J 25-1 / s324 f 160 Ys 
WA } / 
pA \ VY Jf J 005 
ao \ 4 Pi 5 
AN eb or. ae 
£ ‘Sot panes ea i E 
O 5 es T: 
SERIAL POSITION 
Fic. 2. Serial position curves for 10-1 


and 32-1 lists (Deese & Kaufman, 1957) and 
25-1 lists (Murdock & Babick, 1961). 


over the flat part on each serial position 
curve. The constants g and hk were ob- 
tained by a least squares method de- 
scribed by Lewis (1960, pp. 82-88) using 
the last eight points of each serial posi- 
tion curve (except of the two 10-word 
lists where only the last four or five 
points could be used). b 

The evidence for this conclusion is 
shown in Table 2 under the 7? column. 
In all cases the Gompertz equation 
accounted for more than 95% of the 
variance, and the mean coefficient of 
determination (r?) was 97.79%. 

Since in all nine cases g < 1/e the 
recency effect is consistently an S shaped 
curve, This characteristic can be seen 
in the serial position curves of Fig. 1 and 
2. Starting from the last serial position, 
each curve is initially positively de- 
celerated and then soon becomes nega- 
tively decelerated. 


TABLE 2 
VALUES FOR GOMPERTZ DOUBLE-EXPONENTIAL FUNCTION TO FIT 


SERIAL Position CURVES 1N FREE RECALL 


Group C7 f3 y 
10-2 548 -100 05. 
20-1 852 050 03 
15-2 -622 026 -016 
30-1 814 032 026 
20-2 -730 048 035 
40-1 -885 036 032 
25-1* 851 134 114 
10-1 566 | 206 117 
32-1 840 | 270 227 


* From Murdock and Babick (1961). 
t From Deese and Kaufman (1957), 


h | ri xi | zm 
574 97.3% 1.5 6.9 
596 98.8% 2.1 7.9 
518 97.8% 2.0 6.5 
546 99.3% TALI loiha 
552 | 95.9% 1.9 6.9 
-557 98.7% 2.0 7.1 
.634 98.3% 1S | 8i 
431 | 98.3% 0.5 4.1 
644 95.7% 0.6 74 


SERIAL POSITION EFFECT OF FREE RECALL 


The yo column gives the value of y 
when x = 0. If yo is subtracted from 
1.00 this gives the probability that the 
word in the last serial position will be 
correctly recalled. The results for the 
six groups of the present experiment were 
very similar to each other, and an 
analysis of variance of the number cor- 
rectly recalled showed that the groups 
did not differ significantly (F = 1.61, 
df = 5/97, P > 05). The recall prob- 
abilities were rather high but they were 
not 1.00 (and had they been the Gom- 
pertz would not be applicable); the 
corresponding recall probabilities for the 
Murdock-Babick and Deese-Kaufman 
data were clearly lower. 

The inflection point occurs between 
the second and third words from the end 
of the list and appears to be essentially 
independent of list length and presenta- 
tion rate. The evidence for this con- 
clusion is given under the x; column of 
Table 2, where x; =—In (=—In g)/Inh (In 
is log base e). That is, xi is the inflection 
point, that x value at which the decelera- 
tion changes from positive to negative. 
The x; values range from 0.5 words to 2.1 
words with a mean of 1.57 words. Since 
the last word in any list has an © value 
of 0, a mean of 1.57 words places the in- 
flection point midway between the second 
and third words from the end of the list. 

Actually, both Deese-Kaufman curves 
appear to have inflection points nearer 
the end of the list than any of the other 


Otherwise, however, the inflec- 


curves. 
ly in the 


tion points cluster rather close 
range of 1.5-2.1 words. 

The recency effect extends over the 
last eight serial positions and appears to 


be essentially independent of list length 
The evidence for 


column, where %.95 is t 
which the curve is 95% down. 
at this point forgetting is 95% 
asymptotic value. The 95% level serves 
as a convenient criterion to mark the 
end of the recency effect. s 

The mean of the x.9s5 column is 6.88 
words or, rounded off to the nearest whole 
number, 7 words. Except for the Deese- 
Kaufman 10-1 list all the values seem to 


485 


be very close to 7 words. Since the x 
value is 7 words, the recency effect ex- 
tends over the last eight serial positions. 

Another way of indicating the simi- 
larity among different lists is by the + 
column of Table 2. In the Gompertz the 
constant k determines the rate of change. 
Since the values of h are all rather 
similar this indicates that all curves have 
a similar rate of change, and if they have 
a similar rate of change all curves should 
level out at about the same + value if the 
numerical values of g do not differ too 
greatly. 

The primacy effect appears to extend 
over the first three or four serial posi- 
tions. This can be seen in the serial 
position curves of Fig. 1 and 2, as all of 
the curves seem to level out at about the 
third or fourth serial position. The 
primacy effect is so short-lived that the 
curve is difficult to describe mathe- 
matically. Actually, it may well be 
exponential. Semilog plots of the first 
three or four points of the nine curves 
(using 1.00 — vas the asymptote foreach 
curve) gave reasonable approximations 
to straight lines and the slopes were 
rather similar to one another. A group 
curve based on the mean (y—c) values of 
the individual curves was an excellent 
fit; the rate constant was 0.77 and the 
intercept was .27 (see Murdock & Cook, 
1960). However, the fact that this group 
curve was based on only three points 
should make one hesitant about placing 
too much confidence in it. 

Finally, the primacy and recency ef- 
fects are spanned by a horizontal asymp- 
tote. The asymptote is considered to 
extend from Serial Position 5 up to the 
last eight serial positions. That is, in a 
20-word list the asymptote would extend 
from Serial Position 5 through Serial 
Position 12, ina 30-word list from Serial 
Position 5 through Serial Position 22, etc. 
That the asymptote is essentially hori- 
zontal is suggested by the middle parts 
of the serial position curves of Fig. 1 
and 2. 


A close examination of the serial 
position curves suggests that the 
trend line may have a small positive 


486 


TABLE 3 


PREDICTED AND OBTAINED INCREMEN1S 
FOR ASYMPTOTE 


Group Ax Pred. Obt. Diff. 
20-1 8 039 -100 -061 
30-1 18 022 —.004 | —.026 
20-2 8 019 015 | —.004 
40-1 28 -027 .038 -011 
25-1 13 044 -036 | —.008 
32-1 20 032 .046 014 


slope rather than a zero slope. How- 
ever, this positive slope could be due 
to the fact that the recency effect is 
only 95% down; i.e. 5% of the effect 
remains to exert an effect on the 
(allegedly) horizontal asymptote. 
The proper test of this conclusion, 
then, is to determine whether the ob- 
tained increment (if any) is greater 
than the increment attributable to the 
5% remaining from the recency effect. 

The following analysis deals only 
with lists of 20 words or more; the 10 
and 15 word lists could not be used 
because there were too few points. 
For each of the six lists the obtained 
increment was found by fitting a 
least squares regression line to the 
asymptote, determining its slope, then 
multiplyingythe slope by Ax where Ax 
is the difference between Serial Posi- 
tion 5 and the seventh-from-last serial 
position (Ax = 8 for the 20-word list, 
Ax = 13 for the 25-word list, etc.). 
The expected increment was found by 
obtaining the predicted y value from 
the Gompertz equation for the two 
values of x (Serial Position 5 and 
seventh-from-last serial position), 
then subtracting. For each list the 
constants shown in Table 2 were used. 
The predicted and obtained incre- 
ments are shown in Table 3; the 
difference between the predicted and 
obtained increments was not sta- 
tistically significant (£ = 0.72, df = 5). 
Thus, the asymptote does appear to be 


BENNET B. MURDOCK, JR. 


horizontal, and the slight positive 
slope to the curve is no greater than 
would be expected from the tail end 
of the recency effect. 


DISCUSSION 


We have presented data to show that 
the serial position curve of free recall is 
characterized by a rather steep (possibly 
exponential) primacy effect, an S shaped 
recency effect, and a horizontal asymp- 
tote extending between the primacy and 
recency effect. An idealized curve for a 
24-word list is shown in Fig. 3. Its equa- 
tion is 


Po A N S eG) 4 
— .772 (.042) 558% t) 


where Z is list length and x is Serial 
Position 1, 2, 3,... L. The constants 
for the primacy effect were those of the 
group curve discussed above while the 
constants for the asymptote and the 
recency effect were the mean values of 
the constants given in Table 2 for the six 
lists of the present experiment. n 
The curve of Fig. 3 is an empirical 
curve, not a rational curve, It is an 
attempt to describe the serial position 
effect of free recall quantitatively, not 
explain it. Not only does this empirical 
curve represent the nine curves of Fig. 1 
and 2 quite well, but also it is consistent 
with several other sets of data, For 
one, it agrees with serial position curves 
for 20-1 lists reported by Deese (1957, 


1.00, 4100 
/ 
f ah 
a | 13 
Œ / ip a 
5 / 6 
z ee: 
5 i 3 
f FS ~—— 2 
i i 
Pee Saar ee.) 


0 6 20 25 
SERIAL POSITION 


Fie. 3. 


Idealized serial position curve 
for 24-word list. $ 


SERIAL POSITION EFFECT OF FREE RECALL 


Fig. 1, p. 580). For another, it agrees 
well with some unpublished curves culled 
from several experiments recently re- 
ported by Murdock (1960). Finally, the 
exact same trends are present in some 
memory-span data reported by Waugh 
(1960, Fig. 3, p. 75)- 

However, the empirical curve of Fig. 3 
is not in agreement with results reported 
by Bousfield, Whitmarsh, and Esterson 
(1958). These authors used 5-, 10-, 20-, 
and 40-word lists all presented at a rate 
of 2.5 sec/word, and consistently found 
the primacy effect more marked than the 
recency effect. Both Bousfield et al. 
(1958, pp. 260-261) and Deese (1957, pp- 
581-582) suggest that the relatively slow 
presentation rate may have encouraged 
rehearsal and thus led to the greater 
primacy effect. To investigate this 
possibility, we conducted an additional 
experiment with 35 Ss using 10 20-2.5 
lists. The 20-word length was selected 
because the curves of Bousfield et al. 
(1958, Fig. 1, p 258) seemed to show the 
most pronounced primacy effect for this 
length list. As Bousfield et al. (1958) 
apparently used a somewhat longer 
recall period we used a 4-min. recall 
period in this additional experiment; 
otherwise the procedure was identical 
with that of the other experiments re- 
ported here. ; 

The results of the experiment are 
shown in Fig. 4. As can be seen, in 
general the results are quite consistent 
with the empirical curve of Fig. 1, and in 


1,00, 3 
4 3 
<a 
= 
S s0 2 
w ; É 
w 
O 
5 60 = 
č 
te a 
5 5 
5 40 3. 
a fa 
2 a 
= a 
pa : 
Z a 
00, o 


5 
SERIAL POSITION 


Fis. 4. Serial position curve for 20-2.5 lists. 


487 


particular the recency effect is more 
pronounced than the primacy effect. 
This experiment clearly shows that the 
results of Bousfield et al. (1958) are not 
due to the slower presentation rate 
per se. 

Why did Bousfield et al. (1958) find 
primacy more pronounced than recency? 
One possibility is their instructions. 
Twice in their instructions they told Ss 
that the words were to be recalled, 
“in the order in which they occur 
in your memory.” The stress on order 
may have given Ss a set to recall the 
words in the order presented, and Deese 
(1957) has shown that instructions to Ss 
are an important variable in determining 
the shape of the curve. A second possi- 
bility is the design used. Bousfield et al. 
(1958) used a counterbalanced design 
such that each S had only one list at each 
length. Thus, in effect each list was 
(to S) of unknown length, and this fact 
may have encouraged rehearsal in the 
order of presentation. 

In any event, under the conditions of 
the present experiment there seems little 
doubt that the serial position effect of 
free recall is essentially as depicted in 
Fig. 3. Of course, as Deese (1957, p. 581) 
has noted, the serial position curve is 
sensitive to the introduction of experi- 
mental variables, However, it has been 
found that more items are recalled with 
free recall than with ordered recall 
(Deese, 1957; Waugh, 1961), so evidently 
free recall is the preferred, perhaps even 
the more basic, method of recalling a list 


- of unrelated words. 


Finally, why does the serial position 
curve of free recall take the shape it 
does? One possible explanation is in 
terms of short-term proactive and retro- 
active inhibition. That is, each word 
in a list is both preceded by anywhere 
from 0 to (L — 1) other words and fol- 
lowed by anywhere from (L — 1) to 0 
other words. Up toa point, the more 
preceding words the more short-term PI 
and the more succeeding words the more 
short-term RI. The PI and RI effects 
presumably summate to determine the 
total inhibitory effects. 

If this explanation is correct, recent 


488 


studies of the short-term retention of 
individual items should provide an in- 
dication of the course of PI and RI to be 
expected, It has been found that, in 
short-term memory, PI effects appear to 
be greatest after about three prior words 
(Murdock, 1961). This agrees well with 
the finding that the primacy effect levels 
out after the first three or four serial 
positions. In short-term memory RI 
effects appear to approach an asymptotic 
value greater than zero (Murdock, 1961; 
Peterson & Peterson, 1959). This agrees 
well with the finding of a horizontal 
asymptote in the serial position curve. 
Finally, an examination of the RI curve 
of short-term memory even suggests an 
S shaped curve (see proportion of correct 
recalls over different retention intervals, 
Tables 1 and 3, Murdock, 1961, pp. 619- 
620). This agrees well with the Gom- 
pertz recency effect suggested here. 
Thus, it would appear that all the main 
characteristics of the idealized serial posi- 
tion curve shown in Fig. 3 are compatible 
with the results obtained from the short- 
term retention of individual items, and 
these findings lend support to the idea 
that the serial position curve of free 
recall is essentially a manifestation of 
short-term PI and RI effects. 


SUMMARY 


This experiment was a study of the serial 
position effect of free recall, Curves were 
obtained for 10-2, 20-1, 15-2, 30-1, 20-2, and 
40-1 lists, where the first number indicates 
list length and the second number indicates 
presentation time per word. On the basis of 
the available evidence it was concluded that, 
under the conditions of the present experi- 
ment, the serial position curve is characterized 
by a steep, possible exponential, primacy 
effect extending over the first three or four 
words in the list, an S shaped recency effect 
extending over the last eight words in the list, 


BENNET B. MURDOCK, JR. 


and a horizontal asymptote spanning the 
primacy and recency effect. Finally, it was 
suggested that the shape of the curve may 
well result from proactive and retroactive 
inhibition effects occurring within the list 
itself. 


REFERENCES 


Bousriecp, W. A., Waitmarsn, G. A, & 
Esrerson, J. Serial position effects and 
the “Marbe effect” in the free recall of 
meaningful words. J. gen. Psychol., 1958, 
59, 255-262. 

Dees, J. Serial organization in the recall of 
disconnected items. Psychol. Rep., 1957, 3, 
577-582. 

DEESE, J., & Kaveman, R. A. Serial effects 
in recall of unorganized and sequentially 
organized verbal material. J. exp. Psychol., 
1957, 54, 180-187. 

Lewis, D. Quantitative methods in psychology. 
New York: McGraw-Hill, 1960. 

Murpock, B. B., Jr. The immediate reten- 
tion of unrelated words. J. exp. Psychol, 
1960, 60, 222-234. 

Murpock, B. B., Jr. The retention of 
individual items. J. exp. Psychol., 1961, 62, 
618-625. S 

Murpocx, B. B., JR., & BABICK, A. J, The 
effect of repetition on the retention of 
individual words. Amer. J. Psychol., 1961, 
74, 596-601. 

Murpock, B. B., Jr, & Coox, C. D. On 
fitting the exponential. Psychol. Rep., 
1960, 6, 63-69. 

PETERSON, L. R., & Pererson, M. J. Short- 
term retention of individual verbal items. 
J. exp. Psychol., 1959, 58, 193-198. f 

THORNDIKE, E. L., & Loree, 1. The teacher's 
word book of 30,000 words. New York: 
Teachers College, Columbia University, 
1944. 

Waucu, N. C. 
memory-span 
73, 68-79. 

Waucu, N. C, Free versus serial recall, 
J. exp. Psychol., 1961, 62, 496-502. 


Serial position and the 
Amer. J. Psychol., 1960, 


(Received October 6, 1961) 


Journal oj Experimental Psychol 
1962, Vol. 64, No. 5, 489-494 at 


THE SCALING OF SUBJECTIVE ROUGHNESS 
AND SMOOTHNESS ' 


S. S. STEVENS ann JUDITH RICH HARRIS 


Harvard University 


These experiments began as an at- 
tempt to apply the method of magni- 
tude estimation to a continuum for 
which the stimulus seemed to have no 
metric scale, only an ordinal scale of 
grades of sandpaper, or emery cloth. 
Unexpected discoveries led on to more 
engaging inquiries. At the outset, 
both a ratio scale and a category scale 
of apparent tactual roughness were 
determined with 12 grades (grits) of 
emery cloth. The relation between 
the ratio scale and the category scale 
was typical of the relation found on 
prothetic continua, (For these first 
results, see Stevens, 1961a; for an 
earlier related study, see Dudek & 
Baker, 1956.) 


The next study was an exercise in which 
two students, C. S. Harris and J. P. McMa- 
hon, asked 12 Os to judge the smoothness of 
the stimuli instead of the roughness. The 
ratio scale of smoothness approximated the 
inverse, or reciprocal, of the ratio scale found 
for roughness, and the category scale of 
smoothness was the reverse, or the comple- 
ment, of the category scale of roughness. 
These results resemble ‘Torgerson’s (1960) 
findings when he scaled both the apparent 
lightness and the apparent darkness of gray 
papers. 

In terms of a linear scale of apparent 
roughness, it turned out that the stimuli used 
were bunched rather tightly at the low 
(smooth) end of the continuum, 50 much so 
that the two category scales were almost 
logarithmic functions of the respective ratio 
scales. Other studies of category scaling 
suggest that the nearly logarithmic form 
found here is an accident of the stimulus spac- 
ing, and that if an iterative procedure were 
used to arrive at a “pure” category scale the 
curve for roughness would be less curved than 


1 This research was supported by Grant 
G-10716 from the National Science Founda- 


tion (Report PAR-266). 


a logarithmic function (Stevens & Galanter, 
1957). 

Since the available number of emery cloth 
grits is limited, it is dificult to determine a 
pure category scale, but when a sample of 
grits more uniformly spaced in subjective 
roughness was used, the form of the category 
scale changed as predicted: it became much 
less curved than a logarithmic function when 
plotted against the ratio scale of subjective 
roughness. Ten Os judged grits 320, 120, 
80, 50, 40, 30, and 24 twice each on a seven- 
point scale. The average judgments were 
1.17, 2.17, 3.04, 4.12, 5.17, 6.08, and 6.62. 
These values determine a line that is straighter 
than a logarithmic function—a line that is not 
far from the pure form of the category scale, 
as evidenced by the tendency of Os to use each 
category number approximately equally often 
(Stevens & Galanter, 1957). It appears, 
therefore, that roughness behaves like a 
prothetic continuum, and that the pure 
category scale is not a logarithmic function 
of the magnitude scale (Eisler, 1962). 


From these preliminary studies a 
surprising fact emerged: magnitude 
estimations of roughness, and of 
smoothness, turned out to give fairly 
straight lines when plotted in log-log 
coordinates against the grit number of 
the emery cloths. Another instance, 
it seems, of the psychophysical power 
law. (Grit number refers to the 
number of openings per inch in the 
screen employed to sift the abrasive 
particles.) If apparent roughness and 
its reciprocal, apparent smoothness, 
are power functions of particle size, it 
becomes a challenging task to deter- 
mine more accurately the exponents 
involved. In the preliminary experi- 
ments, each involving 12 Os, the 
approximate exponents were — 1.5 for 
roughness and 41.2 for smoothness 
when measured against grit number. 
The next problem was to determine 


489 


490 


which of these two exponents, if 
either, is more nearly correct in 
absolute value. 


APPARATUS AND PROCEDURE 


The stimuli were the twelve grits, 320, 240, 
220, 180, 120, 100, 80, 60, 50, 40, 30, and 24. 
It was assumed that these grits, Tri-M-ite 
brand, met the published standards for 
abrasives (Horton, 1957) which allow the 
grain size to vary with a standard deviation of 
approximately 20% around the nominal size. 
Two different sets of cloths were used. Toa 
sufficient approximation for the present pur- 
pose, the grit number can be regarded as 
proportional to the reciprocal of the grain 
diameter. 

The stimuli were presented one at a time 
to O, who placed his hand through a cloth- 
screened opening. He stroked the emery 
cloth twice with his index and middle fingers. 
Two samples of each of the 12 grits were 
presented in a different irregular order to 
each O. 

Two experiments used the method of 
magnitude estimation, and one used cross- 
modality matching, as follows. 

Experiment 1 (with assigned modulus).— 
Grit 100 was presented first and Ọ was told 
to call it 10. Of the 20 Os, 10 first judged 
roughness and on a later date judged smooth- 
ness. The other 10 reversed the order. 

Written instructions were given to each 0. 
When roughness was being judged, the 
instructions were: 


I am going to present a series of surfaces 
that vary in roughness. Your task is to tell 
me how rough they feel by assigning num- 
bers to them. The first will be the standard 
roughness, which we will call 10. Your task 
ts to assign numbers proportional to your 
subjective impression. Use whatever num- 

ers seem appropriate—fractions, decimals, 
or whole numbers. For example, if a sur- 
face feels 3 times as rough as the standard, 
say 30; if it feels 4 as rough, say 2, etc. 
Try not to worry about being consistent; 
try to give the appropriate number to each 
surface regardless of what you might have 
called some previous surface. In feeling the 
surfaces, draw your index and middle 
fingers twice across each surface as it js 
presented. 


When the task was to judge smoothness, 
the words “smooth” and “smoothness” were 
substituted for “rough” and “roughness”’ in 
the preceding instructions. 


S. S. STEVENS AND JUDITH RICH HARRIS 


Experiment 2 (no assigned modulus).—The 
first stimulus presented was different for 
each O, and, instead of there being a modulus 
called 10, the instruction was to call the first 
stimulus “any number you think appro- 
priate.” Roughness and smoothness were 
judged on different days by each of 10 Os. 

Experiment 3 (cross-modality matching).— 
The O adjusted the intensity of a band of 
noise (500 to 5000 cps) until its subjective 
magnitude appeared to match the subjective 
magnitude of the roughness (or smoothness) 
of each grade of emery cloth. Two matches 
were made to each of the 12 cloths in an 
irregular order by 10 Os. One group of 10 Os 
matched loudness to roughness ; another group 
of 10 Os matched loudness to smoothness. 

The loudness was controlled by a “sone 
potentiometer” (two 2000-ohm potentiome- 
ters, ganged and cascaded). An additional 
attenuator in series enabled Æ to keep O's 
adjustments of the sone potentiometer more 
or less centered in the usable range. The 
voltage across the PDR-8 earphones was 
measured with a vacuum-tube voltmeter. 


RESULTS 


The geometric means of the results 
of the 20 Os who judged roughness 
and smoothness in terms of an as- 
signed modulus (Grit 100 called 10) 
are shown in Fig. 1. The slopes of the 
two straight lines are equal but of 
opposite sign (+1.4 and —1.4). 
This is what is called#for by the 
reciprocal relation between roughness 
and smoothness. On the other hand, 
the points do not always fall close to 
the lines. In general, there is a 
tendency for both functions to be 
slightly concave downward. 

It is also of interest that the stand- 
ard called 10 at the beginning of each 
run was judged less rough when it was 
presented again as a stimulus to be 
judged (geometric mean = 6.43). 
When the task was to judge smooth- 
ness, the standard called 10 was later 
judged more smooth (geometric mean 
= 13.03). A similar adaptation—if 
that is what it should be called—was 
also noted in the preliminary €x- 
periments. 


SCALING OF SUBJECTIVE ROUGHNESS AND SMOOTHNESS 


50 


40 


MAGNITUDE ESTIMATION 


60 
00 


GRIT- NUMBER 


320 220 120 60 40 30 24 
240 180 ' 50 


Fic. 1. The geometric means of the 
estimations of roughness (triangles) and 
smoothness (circles) are plotted against grit 
number in log-log coordinates. (Each point 
is based on 40 judgments—20 Os.) 


Experiment 2, with no assigned 
modulus, gave the ohm shown in 
Fig. 2. Thepoints (8 ometric means) 
lie closer to the straight lines and the 
slopes are 1.5, @ slightly greater 
absolute value than in Exp. tł. Allow- 
ing each O to choose his own modulus 
appears to have produced a better 
result, This added freedom has led 
Os to give superior results on other 
occasions when no standard was use 
(Stevens, 1956). Other things being 
equal, it is better in experiments with 
magnitude estimation to dispense 
with an assigned modulus and to 
average by taking the geometric 
means of the judgments. No prior 
processing of the data is necessary 
with geometric averaging- 

The results of matching loudness to 
roughness and to smoothness are 


491 


8 8 88 


6 


MAGNITUDE ESTIMATION 
o 


320 220 120 80 
240 180 100 
GRIT NUMBER 


60 40 30 24 
50 


Fic. 2. Similar to Fig. 1, except that in 
these experiments on roughness (triangles) 
and smoothness (circles) no standard stimulus 
was designated by E. (Each point is based 
on 20 judgments—10 Os.) 


IBELS 
8 


3 


x 
3 


8 


SOUND PRESSURE LEVEL IN DEC! 
2 
è 8 


w 
S 


320 220 wo 80 60, 40 30 24 
f 
GRIT NUMBER 


Fic. 3. Sound pressure levels in decibels 
re 1 microbar produced in the earphones when 
Qs matched the loudness of a noise to the 
roughness (triangles) and the smoothness 
(circles) of the 12 grades of emery cloth. 
(Each point is the decibel average of 20 


judgments—10 Os.) 


492 


shown in Fig. 3. The decibel averages 
of the sound pressure levels produced 
when Os matched loudness to rough- 
ness and smoothness produce straight 
lines when plotted against a logarith- 
mic scale of grit numbers. This is the 
predicted outcome if the psycho- 
physical power law holds. 

As has been repeatedly demon- 
strated (Stevens, 1959, 1961b) the ex- 
ponents (log-log slopes) obtained in 
cross-modality matches should be 
equal to the ratio between the expon- 
ents of the two modalities determined 
by magnitude estimation. If we take 
the exponent for loudness vs. sound 
pressure to be 0.6 (Stevens, 1955), and 
the exponent for roughness vs. grit 
diameter to be 1.5, the predicted slope 
of the matching function in Fig. 3 is 
2.5. The measured slope is 2.6. For 
smoothness, the measured slope is 
—2.6, as expected. In view of the 
sources of error and uncertainty in 
these experiments, it is reassuring that 
the measured and the predicted ex- 
ponents in cross-modality matching 
agree within 4%. The cross-modality 
matches suggest that the roughness- 
smoothness exponents may be slightly 
greater than 1.5. If we could vary 
roughness to match loudness, presum- 
ably the exponent would be larger 
still (Stevens, 1959) but this com- 
plementary experiment would be diffi- 
cult to execute. 

In order to provide a unit and a 
formula for subjective roughness, it 
appears reasonable for the time being 
to take 1.5 as the exponent and to 
take the apparent roughness of Grit 
320 as the subjective unit. Ifa name 
for this unit is desired, the term ruk, 
derived from a root cognate of rough, 
is suggested. Accordingly, the equa- 
tion for subjective roughness R in 
ruks, as a function of grit number G 
becomes: 


R = 5724G~ -5 


S. S. STEVENS AND JUDITH RICH HARRIS 


In terms of the average diameter in 
millimeters of the abrasive particles, 
the exponent would be +1.5, and the 
constant would be 106.5. This latter 
value is based on the sieve openings in 
the United States standard series 
(Horton, 1957). The equation omits 
the “threshold” constant, because the 
available stimuli did not permit its 
evaluation. 

The main uncertainties regarding 
the size of the exponent for roughness 
stem from two causes, each of which 
would have an opposite effect. The 
available range of stimuli was rela- 
tively short, about 1.12 log units, a 
factor that would be expected to in- 
crease the measured exponent. On 
the other hard, the experiments in- 
volved the matching of numbers 
(or of loudnesses) to roughness (or 
smoothness), never the reverse. Ex- 
perience has shown that this un- 
balanced design tends to decrease the 
measured exponent. Whether the two 
sources of presumed bias have equal 
as well as opposite effects cannot be 
told without further experimental 
analysis. 


DISCUSSION 


Reciprocality—It is clear from Fig. 
1, 2, and 3 that values obtained in the 
judgment of smoothness are approxi- 
mately the reciprocals of values for 
apparent roughness. It seems likely that 
any continuum can be judged with at 
least fair success in terms of its reciprocal, 
although only a few have been looked 
at from this point of view. In addition 
to the continuum roughness~smoothness, 
there are data on lightness-darkness 
(of surfaces) and loudness-softness (of 
noises), One can easily imagine judging 
the longness or shortness of lines, the 
strength or weakness of vibrations, the 
brightness or dimness of luminances, the 
heaviness or lightness of lifted weights, 
and soon. The inverting of a continuum 
is a kind of semantic matter: O is im- 
structed differently, and he tries to 


SCALING OF SUBJECTIVE ROUGHNESS AND SMOOTHNESS 


respond with numbers that are inversely 
proportional to those he would use for 
the continuum “right side up.” At least 
he tries to report reciprocals provided he 
understands the semantic rule: a surface 
that is twice as rough is half as smooth. 

Some continua, like roughness, are 
frequently referred to in terms of the 
inverse aspect, but for many continua 0 
would probably find it a little strange to 
be told to judge the reciprocal. Even 
with smoothness, a few Os commented on 
the difficulty of the task. It seemed 
easier and more natural, for example, to 
match loudness to roughness than to 
match loudness to smoothness. 

Even though people may become ac- 
customed to judging in terms of the 
inverse aspect, there is a sense in which 
what may be called the degree of 
stimulation is basically different from the 
degree of its absence. When he judges 
the degree of absence of stimulation, O 
manages, with fair success, to report the 
reciprocal of his judgment of the strength 
of the sensory magnitude. A related 
type of report was tried out by E. C. 
Poulton who asked Os to estimate frac- 
tional loudness by varying only the 
denominator of their report (see Stevens, 
1956). As the tones became fainter the 
numbers became larger in a manner that 
produced a nice reciprocal relation to the 
standard results of magnitude estima- 
tion. There are many ways of asking O 
to report the apparent magnitudes of a 
series of sensory excitations, but it seems 
hardly likely that all methods of report 
that are logically equivalent will give 
equally good results. 

It is also important to note that some 
sensory attributes that may be thought 
of as opposites are not at all reciprocally 
related. Warm and cold are two striking 
examples (Stevens & Stevens, 1960). 


The exponent for warm is 1.6, for cold 
different 


1.0. These seem to be two 
sense modalities, not two names for the 
continuum, and the stimulus 


same 

domains of the two continua do not 
overlap. Presumably experiments could 
be run in which O told how warm or how 
neutral the stimulus felt, and the results 
would follow the reciprocal relation. An 


493 


analogous experiment could be run with 
O judging cold or neutral, again with the 
expectation of obtaining reciprocal func- 
tions, but the slopes (exponents) would 
differ greatly from the slopes obtained 
with the warm-neutral pair. One must 
distinguish, therefore, between word 
pairs that refer to two different continua 
and word pairs that refer reciprocally to 
a single continuum. A given tempera- 
ture is usually either warm or cold, but a 
given surface is both rough and smooth. 

The stimuli—The stimuli used in 
these studies were those samples of emery 
cloth that happened to be available in the 
laboratory. They appeared to be in good 
order, and they seemed to have been a 
good set of stimuli, in the sense that they 
gave rise to orderly power functions 
when used in eight different experiments. 
It is noteworthy, however, that, as the 
experiments accumulated, the evidence 
became increasingly clear that there were 
minor peculiarities in the series of grits, 
or perhaps in the manner of their ad- 
hesion to the cloths. It is not usual to 
argue from the results of magnitude 
estimation toa nonuniformity in a manu- 
facturing process, but the evidence for 
at least one such “defect” seems clear. 
The cloth with Grit 120 was consistently 
judged too rough relative to Grit 100. 
This fact is especially evident in Fig. 2 
and 3 and was also clear in the prelimi- 
nary experiments. 

Under the microscope one can see that 
Grit 120 involves smaller abrasive grains 
than Grit 100, as indeed it should, but 
the two cloths differ in the degree to 
which the particles appear to be im- 
mersed in the adhesive. The finer par- 
ticles (Grit 120) appear to sit higher on 
the cloth and to present more of their 
surface to view. To the touch, the two 
cloths (100 and 120) feel different in a 
way that can best be described as 
qualitative. The skin catches occasion- 
ally on the finer particles in a way that is 
not characteristic of the coarser particles. 
This catching, due presumably to a 
shallower immersion of the particles in 
the bonding adhesive, may account for 
the relatively higher numerical estima- 


494 


tions of the apparent roughness of the 
finer grit. 

Whatever the explanation, it becomes 
clear that particle size is not the only 
variable in a bonded abrasive that 
can affect apparent roughness. Never- 
theless, the samples used in these ex- 
periments seem to have been only 
minimally contaminated by other factors, 
for sensed roughness grows as a relatively 
clean power function of average particle 
diameter. 


SUMMARY 


Preliminary experiments showed that Os 
can make consistent judgments of tactual 
roughness and smoothness. The stimuli were 
12 grits of emery cloth. Magnitude estima- 
tions of roughness and smoothness produced 
straight lines when plotted (log-log) against 
grit number. The exponents of these power 
functions were determined in two experiments 
with magnitude estimation and one with cross- 
modality matching against loudness. All 
three experiments gave results that were 
power functions of grit number with ex- 
ponents in the vicinity of —1.5 for roughness 
and +1.5 for smoothness. The cross- 
modality matches also confirmed the expon- 
ents determined by magnitude estimation. 


REFERENCES 


Dupek, F. J., & Baker, K. E. ‘The constant- 
sum method applied to scaling subjective 


S. S. STEVENS AND JUDITH RICH HARRIS 


dimensions. 
616-624. 

EIsLER, H. Empirical test of a model relating 
magnitude and category scales. Scand, 
J. Psychol., 1962, 3, 88-96. 

Horton, H. L. (Ed.). Machinery’s handbook. 
(15th ed.) New York: Industrial Press, 
1957. P. 1459. 

Stevens, J. C., & Stevens, S. S. Warmth 
and cold: Dynamics of sensory intensity. 
J. exp. Psychol., 1960, 60, 183-192. 

STEVENS, S. S. The measurement of loudness. 
J. Acoust. Soc. Amer., 1955, 27, 815-829. 

Stevens, S. S. The direct estimation of 
sensory magnitudes: Loudness. Amer. J. 
Psychol., 1956, 69, 1-25. 

Srevens, S. S. Cross-modality validation of 
subjective scales for loudness, vibration, 
and electric shock. J. exp. Psychol., 1959, 
57, 201-209. 

Srevens, S. S. To honor Fechner and repeal 
his law. Science, 1961, 133, 80-86. (a) 

STEVENS, S. S. The psychophysics of sensory 
function, In W. A. Rosenblith (Ed.), 
Sensory communication. New York: Wiley, 
1961. Pp. 1-33. (b) 

STEVENS, S. S., & GALANTER, E. H. Ratio 
scales and category scales for a dozen 
perceptual continua. J. exp. Psychol., 
1957, 54, 377-411. 

Torcerson, W. S. Quantitative judgment 
scales. In H. Gulliksen and S. Messick 
(Eds.), Psychological scaling. New York: 
Wiley, 1960. 


Amer. J. Psychol., 1956, 69, 


(Received October 21, 1961) 


Journal of Experimental Psychology- 
1962, Vol. 64, No. $, 405-308 a 


AN EVALUATION OF THE ACTIVATIONIST HYPOTHESIS 
OF HUMAN VIGILANCE ' 


JACK A. ADAMS axp LAWRENCE R. BOULTER 


University of Illinois 


The arousal or activationist hy- 
pothesis of human attentiveness holds 
that the stimuli impinging on S from 
external sources, or acting from within 
him, determine alertness through the 
reticular activating system of the 
brain (Broadbent, 1958; Fiske, 1961; 
Frankmann & Adams, 1962; Lindsley, 
1957; Malmo, 1959). Studies of 
human vigilance, or long-term atten- 
tiveness for occasional signals in a 
monitoring task, commonly show a 
decrement in signal detection as a 
function of observation time, and 
Scott (1957) interprets this phe- 
nomenon as adaptation to the nearly 
unchanging stimulation. Frankmann 
and Adams (1962) have pointed out 
that vigilance decrement is not as 
ubiquitous as is sometimes believed, 
and that investigators have failed to 
ask why decrement is more promin- 
ently associated with simple tasks and 
is absent or small in complex visual 
tasks with multiple stimulus sources. 
In terms of the activationist hy- 
pothesis, there appear to be special 
sources of stimulation in complex 
tasks. 

Efforts to use explicitly the hy- 
pothesis in a careful predictive sense 
for vigilance findings are hampered by 
an absence of relationships between 
the locus, amount, and type of stimu- 
lation, and measures of molar behavior 
(Adams, Stenson, & Humes, 1961; 
Frankmann & Adams, 1962). How- 


1 This research was supported by Contract 
AF 19(604)-5705 monitored by the Opera- 
tional Applications Laboratory, Deputy for 
Technology, Electronic Systems Division, Air 
Force Systems Command. 


ever, with the hypothesis firmly rooted 
in studies of the reticular formation, 
there is a number of physiological 
findings that can be used to facilitate 
a search for the missing definitions and 
relationships at the molar level. The 
two experiments reported here reason 
from physiological findings and seek 
to identify sources of stimulation that 
can deter decrement in complex visual 
tasks. - One derivation was that a 
special source of stimulation in com- 
plex tasks can be the proprioceptive 
stimulation associated with the head 
and eye movements of scanning the 
stimulus array and it is based on 
proprioception collaterals in the re- 
ticular formation (Rossi & Zanchetti, 
1957; Samuels, 1959). When the 
stimulus sources have wide spatial 
separation, head movements would be 
the most prominent source of pro- 
prioceptive stimulation, but even 
when multiple stimulus sources are 
quite close together we can still expect 
proprioceptive stimulation from eye 
movements alone (Cooper, Daniel, 
& Whitteridge, 1955; Whitteridge, 
1960). The changing retinal stimula- 
tion as the head and eyes scan the 
visual scene is an added source of 
stimulation. 

Another derivation was that re- 
sponse complexity can produce in- 
ternal stimulation that should de- 
crease the amount of decrement, and 
its interaction with the proprioception 
and visual variables was included to 
illuminate the effects on vigilance of 
another source of stimulation in 
complex tasks. Physiologically, com- 
plex responses might be expected to 


495 


496 


influence alertness through cortical 
outputs fed back to the reticular 
system (Adams et al., 1961; Lindsley, 
1957; Rossi & Zanchetti, 1957; 
Samuels, 1959). Adams and Boulter 
(1960) and Adams, Stenson, and 
Humes (1961) have shown that a 
relatively complex decision response 
of four choices associated with each 
signal detection will decrease the 
amount of vigilance decrement. 


EXPERIMENT | 
Method 


Apparatus.—Two units of the multiple 
source vigilance apparatus were used, with 2 
Ss tested at a time. The S sat at a desk and 
faced a semicircular track fixed at eye level 
to the back of the desk. On the track, and 30 
in. from S’s eyes, were mounted four small 
(4X2 in. front surface) digital display 
boxes that presented a two-digit number 
(s in, high) which was the critical signal S 
had to detect. Six different numbers were 
used. The number was always bright enough 
for easy reading, but the brightness level was 
intentionally low so S had to orient to a stim- 
ulus source to see the signal easily. When the 
critical signal came on, S was instructed to 
report the event as fast as possible by pressing 
a detection button located 4 in. from his 
fingertips. A timer started at the onset of the 
signal and stopped when the detection button 
was touched, thus giving a response latency 
score, This simple mode was called detection 
responding. Under another experimental con- 
dition called memory responding, which is 
described in more detail below, S had a set 
of six memory buttons in addition to the 
detection button. These buttons were each 
labeled with one of the numbers and were 
arrayed in a semicircle around the detection 
button, and 4 in. from it. After 5 pressed the 
detection button as rapidly as possible, he 
further selected and pressed a memory 
button, also as rapidly as possible. The 
latency from the onset of the signal to pressing 
the memory button was recorded too. An S 
responded with his preferred hand, but he 
always kept both forearms in an armrest to 
standardize the position of the arm and hand. 
Latency measures such as these have been 
found to have wide variability when the 
starting position for the hand is not con- 
trolled. A microswitch under each armrest 
turned on a light at E's panel whenever § re- 


JACK A. ADAMS AND LAWRENCE R. BOULTER 


moved an arm from the armrest and, when 
this occurred, Æ administered a brief reminder 
over an intercom system. An occasional 
reminder in the practice session, which 
preceded criterion sessions, was usually 
sufficient to standardize this procedure, All 
signals and intersignal intervals were pro- | 
gramed and automatically read and timed by 

a digital tape reader that fed the two ap- | 
paratus units simultaneously. Each S was 
in his own experimental room, which had 
normal illumination. The Æ and his console 
were located in another building. 

Description of stimulus series—Each ses- 
sion was 2.5 hr. long, and for scoring purposes 
was divided into five 30-min. trials. There 
were eight signals on a trial, and each signal 
lasted for 5 sec. The signals were the numbers _ 
10, 20, 30, 40, 50, and 60, and each number 
occurred at least onceon a trial. Two numbers 
appeared at each of the four stimulus sources 
on a trial. Intersignal intervals on each trial 
were .7, 1.2, 2.0, 3.0, 4.0, 5.0, 6.2, and 7.5 
min. Both the assignment of intersignal 
intervals, and of signals to the four stimulus 
sources, were separately randomized for each 
trial. By repeated random sampling, a pool of 
10 tapes was generated according to these 
rules, j 

Experimental procedures.--A major hy- 
pothesis was that sustained vigilance in 
complex visual tasks is a function of the 
proprioceptive and retinal stimulation arising 
from head and eye movements, and the 
manipulation of this class of stimulation was 
by spatial separation of the four stimulus 
sources, The assumption was that the wider 
the separation of sources, the greater the head 
and eye movements, and the greater the 
stimulation. Four amounts of spatial separa- 
tion, specified in terms of the difference 
between the two outermost sources, were 18°, 
36°, 72°, and 144°. The other two sources 
were spaced between the two peripheral ones 
to give equal separation between all four. 
Separations of 18° and 36° placed the four 
sources in a direct field of visual view, and 
essentially they could be scanned with eye 
movements alone. The 72° separation 
required some added head movements for 
comfortable scanning, and scanning the se 
Separation was not possible without heac 
movements, 

A second major hypothesis was that i 
sponse complexity is a source of interna 
stimulation for high alertness in complex 
tasks, Two levels of response complexity were 
used. One was detection responding, whic 
was the simplest, that only required 5 p 
press the detection button when a signal was 


ACTIVATION HYPOTHESIS OF HUMAN VIGILANCE 


detected. The other was memory responding, 
which had much more complexity, and it 
required S to actively employ his immediate 
memory throughout the 2.5 hr. The $ 
always had to remember the last set of four 
numbers that appeared at the stimulus 
sources. As soon as he detected a new number 
at a source, his response was to press first the 
detection button and then press the memory 
button labeled with the number that had ap- 
peared at that same source the last time. 
This requirement for short-term memory was 
quite similar to running memory span 
(Pollack, Johnson, & Knaff, 1959), and it was 
hypothesized that the continuous memory 
requirement would introduce greater internal 
stimulation than detection responding. 

Two groups of 15 Ss each were used, with 
each S participating for five sessions. A 
mixed analysis of variance design was used 
(Lindquist, 1953, p. 292). Response com- 
plexity was a between-Ss variable, with one 
group having detection responding through- 
out and another group having memory re- 
sponding. Within-Ss variables were five trials 
and four spatial separations, and were 
administered to all Ss of both groups. Fol- 
lowing a practice session with a randomly 
assigned spatial separation of sources, each S 
was given four criterion sessions where each 
session was a different one of the four degrees 
of spatial separation. So that S did not have 
two identical separation conditions in a row, 
the three separation conditions that were not 
used in the practice session were equally 
divided among the Ss for the first criterion 
session. The separation condition for the 
practice session was assigned to the last 
criterion session, and the remaining two 
separations were randomly assigned to the 
second and third criterion sessions. To con- 
trol for the possibility of some learning of the 
characteristics of a particular input tape over 
five sessions, a tape from the pool of 10 tapes 
was assigned randomly to an S for each 
session, with the restriction that no tape be 
used twice for any S. Each session was on a 
different day. 

Subjects. —The 30 Ss were male under- 
graduate students who were paid for their 
participation. They were randomly assigned 


to groups. 


Results 

The level of signal detection was 
high, as might be expected with a 
signal that persisted for 5 sec. Both 


497 


detection and memory responding had 
a detection level of 98%. 

Figure 1 shows the main findings. 
Each S’s score was the mean latency 
of his responses on a trial, and Fig. 1 
presents the plot of group mean 
latencies as a function of trials. The 
latencies presented in Fig. 1 are for 
response to the detection button. 
Latencies increased as spatial separa- 
tion increased, and memory respond- 
ing had longer latencies than de- 
tection responding. The differences 
found between detection and memory 
responding were found to be pri- 
marily associated with pressing the 
detection button, indicating that all 
recall and decision delays about which 
of the six memory buttons to press 
took place before the main detection 
button was pressed. The difference 
between the latency to press the 
detection button and the latency to 
press the subsequent memory button 
is the time to move from the detection 
button to the memory button, and the 
means of these difference latencies 


TABLE 1 
ANALYSIS OF VARIANCE IN EXP. I 
Source df MS Fe 
Between Ss 
Detection- Memory 
(C) 1 | 95.88 | 31.85* 
Errory, 28| 3.01 
Within Ss 
‘Trials (A) 4| 1.23| 13.67* 
Spatial separation 
(B) 3 | 20.08 | 59.06* 
AXB 12| 0.11| 1.10 
AXC 4] 0.15| 1.67 
BX 3| 0.05| 0.15 
AXBXC 12 | 0.07 | 0.70 
Error: (w) 112 | 0.09 
Errore(w) 84| 0.34 
Errors(w) 336 | 0.10 
Total 599 


a Errors used to test C effect, Errori(w) used to test 
A and A X C, Errors(w) used for B and B X C, and 
Errora(w) used for A X Band A X B X © 


*P<.0l. 


498 JACK A. ADAMS AND LAWRENCE R. BOULTER 


were found to be essentially constant sponse complexity on decremental 
for all conditions. These latter data trends, and Fig. 1 shows decrement 
have been omitted here. to be associated with all conditions, 

Of primary interest was the in- If our hypotheses about the effects 
fluence of spatial separation and re- of spatial separation and response 


AVERAGED OVER BOTH RESPONSE CONDITIONS 


(SEC.) 


DETECTION RESPONDING 


MEAN LATENCY 


144° Oo --- 4 


TRIALS 


Mean response latency as a function of the two types of response complexity 
and the four separations of stimulus sources: Exp. l. 


Fic. 1. 


ACTIVATION HYPOTHESIS OF HUMAN VIGILANCE 


complexity on decrement are sound, 
significant interactions should be ex- 
pected. Less decrement over trials 
should be found for large spatial 
separations and for memory respond- 
ing. The results of the analysis of 
variance in Table 1 show that the 
hypothesized interaction effects were 
not confirmed. All main effects, how- 
ever, were significant. 


EXPERIMENT II 


‘The activationist hypothesis does 
not clarify the dimensional character- 
istics of stimulation that influence 
performance (Adams et al., 1961), 
and it could be that our operations in 
Exp. I did not define those particular 
characteristics of stimulation that 
deter vigilance decrement. The nega- 
tive results of Exp. I may have 
resulted from the manipulation of 
amount rather than the variety of 
stimulation that has been found to be 
important by other investigators. 
Sharpless and Jasper (1956) found 
that stimulus variety induced alert- 
ness in the cat, and McGrath (1960) 
found that visual detection was 
heightened by varied auditory stimu- 
lation such as music, but not auditory 
white noise. Experiment Il was 
performed to see if change in the 
pattern of head and eye movements 
could influence vigilance decrement. 
The plan of the experiment was to 
establish a pattern of visual observing 
by procedures analogous to operant 
reinforcement techniques and then, at 
a point in a session, require a change 
in the pattern. 


Method 

A pparatus.—The same task was used, but 
it was modified by mounting 4 small but 
distinctive neon cue light on top of each 
stimulus source. In addition to programing 
critical signals, whose duration was shortened 
to 2 sec. in this experiment, the tape input 
mechanism also programed the cue lights 


499 


from source to source at 2-sec. intervals. 
The S was instructed always to watch the cue 
lights as they changed from source to source 
because the signal, whenever it occurred, 
would be coincident with an illuminated cue 
light. One way of viewing the modified task 
is in operant reinforcement terms where 
visually following the changing cue lights isa 
response class that is rewarded when S sees a 
number as it first comes on and has the 
success of an early detection. By faithfully 
following the cue lights, S tends to optimize 
his rewards and his performance. In this 
fashion, we sought to shape a pattern of 
observing responses that could be subjected 
to change. 

Experimental procedures.—The 144° spatial 
separation was used for all groups to insure 
an ample observing response. Detection re- 
sponding was used for all groups. One type of 
observing response was repetitive (R) where 
the cue light moved regularly, €g., Sources 
1,2) 3,4,3; 271 =~: etc., so that S scanned 
regularly back and forth. The other type of 
observing response was unsystematic (U) 
where the occurrence of cue lights was random 
with the restriction that the relative frequency 
at a source be the same as the R condition. 
Except for duration being reduced to 2 sec., 
the occurrences of critical signals on the input 
tapes were the same as in Exp. I. The shorter 
signal duration was used to place greater 
premium on close following of the cue lights 
because S would soon become aware that he 
would stand a chance of missing signals if he 
did not follow the cue lights closely. The 
length of the session was again 2.5 hr., so the 
shortening of signal duration slightly in- 
creased the length of intersignal intervals. 

Four independent groups of 15 Ss each 
were used. All Ss first had a practice session, 
and on a subsequent day had a criterion 
session. Group RU had R scanning through- 
out except on Trial 3 of the criterion session 
when it was changed to U scanning. Group R 
was a control group for Group RU and had R 
scanning throughout both sessions. 
change of scanning pattern for Group RU 
influenced vigilance decrement, it would be 
detectable by comparison with Group R 
which had no change- As a check on the type 
of scanning and the effect of its change, 
another group, UR, had U scanning except 
when it was changed to R on Trial 3 of the 
criterion session. Group U was the control for 
UR, and had U scanning for both sessions. 

Subjects —The 60 Ss were university male 
undergraduates who were paid for their 
participation. Assignment to groups was 


random. 


500 


Results 


As in Exp. I, percent detection was 
high (98%), and the basic score for S 
again was the mean of his response 
latencies on a trial. A plot of group 
means as a function of trials is shown 
in Fig. 2. Group R, with repetitive 
visual scanning, had lower mean 
latencies than Group U with un- 
systematic scanning, but both groups 
had decrement. These apparent 
trends were confirmed in a mixed 
analysis of variance (Lindquist, 1953, 
p. 267) where type of scanning (U 
and R) was the between-Ss variable, 
and trials the within-Ss variable. 
Type of scanning gave an F ratio 
(df = 1/28) of 19.65 (error MS = ,17) 
which was significant at less than the 
.01 level. Trialshad an F (df = 4/112) 
of 9.11 (error MS = .009) which also 
was significant at less than .01. The 
interaction between these two main 
effects gave an F (df = 4/112) of .44 
(error MS = .009) which lacked sig- 
nificance at the .05 level. 

Trial 3 was the point of variation 
in scanning for Groups UR and RU, 
and Fig. 2 shows that the effect of the 
change was to shift a Trial 3 mean in 
the direction of its control group, 


MEAN LATENCY (SEC.) 


TRIALS 


Mean response latency for the four 
groups of Exp. IT. 


JACK A. ADAMS AND LAWRENCE R. BOULTER 


The ¢ test for independent measures 
was used on Trial 2 latencies between 
Groups RU and R, and Group UR and 
U, to establish comparability of 
groups before the point of change. 
The £ ratio for the mean difference 
between RU and R was .05 (error 
MS = .004), and for the difference 
between UR and U was 1.64 (error 
MS = .006). Both ?’s lacked sig- 
nificance. Groups RU and U, and 
Groups UR and R, were then evalu- 
ated with the ¢ test at Trial 3, the 
point of change. The ¢ for the differ- 
ence between RU and U was .86 
(error MS = .005), and UR and R 
was 1.05 (error WS = .005). Neither 
of these two t's was significant at the 
-05 level. To check if there was a 
persisting effect in the trial that 
followed, comparisons of Groups RU 
and R, and Groups UR and U, were 
made on Trial 4. The ¢ for the 
difference between RU and R was .74 
(error MS = .005), and the ¢ for the 
comparison between UR and U was 
1.50 (error MS = .006). Neither t 
was significant at the .05 level. For 
all of these ¢ tests, df = 1/28. 
Interpretation of these statistics 
must be founded on evidence that the 
cue lights changed observing be- 
havior. First, cue lights produced 
lower mean latencies than when they 
were not used, presumably because S 
was directed where to look and was 
often reinforced when he looked there. 
Detection responding with 144° sep- 
aration in Exp. I is essentially the 
same task as given Groups U and R, 
but without cue lights. Group U had 
almost twice the speed of responding 
as the corresponding condition 1n 
Exp. I, and Group R was almost three 
times faster, A second and more 
direct basis for inferring that cue 
lights influenced the observing re- 
sponse is shown in Fig. 3, where mean 
response latency to subsets of stimulus 


ACTIVATION HYPOTHESIS OF HUMAN VIGILANCE 501 


GROUP U GROUP R 
1.50 


1.40 


1.30 


MEAN LATENCY (SEC.) 


—— INNER 


STIMULUS SOURCES 


— = -PERIPHERAL 
STIMULUS SOURCES 


| 3 5 
TRIALS 
Fic. 3. For the four groups of Exp. II, mean response latency for the inner two 
and the outer two stimulus sources. 
For each S a means for each group. If the ob- 
| 


sources is presented. 
latency score was computed as the serving response was under close 


mean response time on a trial to control and S was systematically 
signals of the two inner stimulus following the cue lights, then equal 
sources, and similarly another latency attentiveness to all sources is €x- 
score was computed for signals on a pected. Figure 3 shows that the 
trial for the two peripheral stimulus expectation was realized for Group R. 


sources, Figure: 2 shows the trial Not only was their performance level 


502 


high, but it was the same for both 
the inner and peripheral stimulus 
sources. Less success was achieved by 
Group U which had performance 
noticeably poorer for peripheral 
sources. The larger excursions of head 
and eye movements apparently did 
not fall under close control when 
eliciting cue lights changed unsys- 
tematically. Nevertheless, some 
measure of overall success was evident 
for Group U because its mean per- 
formance level was better than that 
for the corresponding condition of 
detection responding and 144° sep- 
aration in Exp. I. 

Group R and Group U each had a 
separate three-way analysis of vari- 
ance performed on their data shown 
in Fig. 3. There was one measure per 
cell, and trials was one variable, inner 
vs. peripheral a second, and Ss a 
third. For Group R the trials vari- 
able had an F ratio (df = 4/56) of 
6.50 (error MS = .014) which was 
significant at less than the .01 level. 
The inner vs. peripheral variable gave 
an F ratio (df = 1/14) of 2.44 (error 
MS = .009) which failed significance 
at .05. The interaction between 
these two main effects also lacked 
significance at the .05 level (F = .70; 
df = 4/56; error MS = .010). Group 
U had trials significant at less than .01 
(F = 3.76; df = 4/56; error MS = .025) 
and also the inner vs. peripheral 
variable significant at less than 
.01 (F= 38.58; df = 1/14; error 
MS = .053). The interaction be- 
tween main effects was significant 
between the .05 and .10 levels 
(F=2.16; df=4/56; error MS=.032). 

The plots for Groups UR and RU 
also are shown in Fig. 3. Change in 
the patterning of cue lights influenced 
performance at both inner and peri- 
pheral sources, and in directions 
consistent with those for Groups 
U and R. 


JACK A. ADAMS AND LAWRENCE R. BOULTER 


DISCUSSION 


Molar behaviorism concerns itself with 
S-R operations, and physiology occupies 
itself with either the effect of stimuli on 
internal bodily states or the effect of 
bodily states on overt behavior. Physio- 
logical psychology, on the other hand, 
is directed toward completing the arc of 
lawfulness from stimulus, to under-the- 
skin states, to molar behavior. A more 
substantial network of empirical laws 
can be achieved by coordinating the laws 
of psychology and physiology, and a 
firmer scientific footing is gained. In 
the two experiments reported here, we 
sought to move in this direction by trying 
to link characteristics of the task, prop- 
erties of the reticular formation, and 
overt vigilance behavior. Our results 
were negative. Neither the amount nor 
variety of stimulation induced by mani- 
pulating head and eye movements, nor 
response complexity, influenced vigilance 
decrement as we had predicted from the 
activationist hypothesis. An impressive 
number of facts for vigilance behavior 
casually fit the general framework of the 
activationist hypothesis, but the absence 
of operational definitions for the type of 
stimuli, as well as the characteristics of 
each stimulus class, gives the hypothesis 
little predictive capability for measures 
of molar behavior (Frankmann & Adams, 
1962). Our twoexperiments suggest that 
the specification of relevant stimuli and 
their properties is hardly straightforward, 
and they emphasize the problems of 
operationally defining variables of the 
hypothesis well enough for it to be 
thoroughly tested and graduated into a 
substantive psychophysiological law. 

There are several plausible reasons 
why our tests of the activationist hy- 
pothesis failed, and they all illustrate the 
difficulties of operational definition that 
must be resolved before the hypothesis 
can be accepted or rejected. Perhaps our 
manipulations did not induce differential 
stimulation of the reticular formation: 
Adams, Stenson, and Humes (1961) 
found that number of stimulus sources 
did not exert a discernible effect 0° 
vigilance decrement, and they conjec- 


ACTIVATION HYPOTHESIS OF HUMAN VIGILANCE 


tured that increasing the number of 
stimulus sources might not have pro- 
duced an increment in stimulation at the 
reticular formation. The same reasoning 
might be applied here. Or, perhaps the 
operations we used did not sufficiently 
arouse the reticular formation. We did 
not have direct physiological measure- 
ment of the impact at the reticular 
formation of these stimuli generated by 
head and eye movements, but vigorous 
reticular activation from these operations 
is certainly an expectation from our cur- 
rent physiological knowledge. Also, the 
operations of immediate memory might 
be questioned as a way of inducing 
cortical-centered stimuli, but the task of 
keeping the most recent four numbers in 
mind has a reasonableness about it for a 
central source of activation. Even if our 
selection of stimulus classes is granted, 
we still may not have uncovered mani- 
pulations for the relevant dimension of 
stimuli. Both the memory and the 
visual scanning tasks in Exp. I may have 
become repetitive rather early, and 
decrement could have occurred as a 
function of adaptation to the steady 
state of internal stimuli, as Scott (1957) 
has suggested. Early adaptation may 
have been a factor in Exp. II where we 
explicitly tried to introduce variety in 
our key stimuli. Sustained stimulus 
variety, whatever its operational defini- 
tion might be, may prove to be the key 
to human alertness. Possible directions 
for inferring relevant dimensions of stim- 
ulus variety for visual alertness is the 
study by McGrath (1960) where mean- 
ingful audio stimulation such as music 
improved detection, and the studies by 
Adams and Boulter (1960) and Adams, 
Stenson, and Humes (1961) where four- 
choice decision responding improved 
monitoring behavior. McFarland, Hol- 
way, and Hurvich (1942) found that a 
brief interlude of stretching and con- 
versation with E eliminated decrement 
in a measure of visual threshold. 

The typical vigilance experiment, 
where marked decrements are found in 
prolonged monitoring behavior, differs 
from those presented here in that near 
threshold stimuli occur at a single stim- 


503 


ulus source and percentage of signals 
detected is the measure. However, there 
is nothing that would seem to preclude 
our task with its latency measure as a 
sound vehicle for testing the hypothesis. 
Decrement was uniformly obtained but 
we were unable to control it by mani- 
pulating sources of stimulation. 


SUMMARY 


‘The activationist hypothesis contends that 
environmental and internal sources of stimula- 
tion, working through the reticular formation 
of the brain, are sources of human alertness. 
Two experiments were performed to identify 
stimulus determinants of vigilance decrement 
in a complex visual monitoring task. The S's 
task was to detect a two-digit number as 
rapidly as possible when it appeared at any 
one of four stimulus sources. Response 
latency was the measure of performance. 

Experiment | sought to manipulate 
response-produced stimulation arising from 
the stimulation induced by head and eye 
movements and immediate memory. Amount 
of head and eve movements was defined in 
terms of the spatial separation of sources, and 
the physiological basis of its stimulation value 
for alertness was hypothesized to be pro- 
prioception and visual collaterals in the 
reticular formation. Immediate memory was 
defined as a requirement to remember the 
four numbers that last appeared at the 
sources and, when a number appeared, to 
respond with respect to the last number that 
had appeared at that source. Immediate 
memory was hypothesized to provide inputs 
to the reticular formation from cortical areas. 
No effects of these variables on vigilance 
decrement were found. Experiment Il asked 
if the negative results of Exp. I were related 
to amount rather than variety of stimulation 
being manipulated. Experiment II intro- 
duced variety in head and eye movements by 
training the pattern of visual observing 
responses and then changing the pattern on a 
trial. No effects on vigilance decrement were 
found. Problems of operationally defining 
the activationist hypothesis were discussed. 


REFERENCES 


ADAMS, J. A., & BOULTER, L.R. Monitoring 
of complex visual displays: I. Effects of 
response complexity and intersignal interval 
on vigilant behavior when visual load is 
moderate. USAF CCDD tech. Note, 1960, 


No. 60-63. 


504 


Apas, J. A., Stenson, H. H., & Humes, 
. M. Monitoring of complex visual 
displays; II, Effects of visual load and 
response complexity on human vigilance. 
Hum. Factors, 1961, 3, 213-221. 

Broappent, D. E. Perception and communi- 
cation. New York: Pergamon, 1958. 

Cooper, S., DANIEL, P. M., & WHITTERIDGE, 
D. Muscle spindles and other sensory 
endings in the extrinsic eye muscles: The 
physiology and anatomy of these receptors 
and of their connections with the brain- 
stem. Brain, 1955, 78, 564-583. 

Fiske, W. Effects of monotonous and re- 
stricted stimulation. In D. W. Fiske and 
S. R. Maddi (Eds.), Functions of varied 
experience. Homewood, Ill.: Dorsey, 1961. 
Pp. 106-144. 

FRANKMANN, J. P., & Avams, J. A. Theories 
s Sela Psychol. Bull., 1962, 59, 257- 

Linpguist, E. F. Design and analysis of 

i in psychology and education. 
New York: Houghton Mifflin, 1953. 

LINDSLEY, D. B. Psychophysiology and 
motivation. In M. R. Jones (Ed.), 
Nebraska symposium on motivation: 1957. 
Lincoln: Univer. Nebraska Press, 1957. 
Pp. 44-105, 

Mato, R. B. Activation: A neuropsycho- 
logical dimension. Psychol. Rev., 1959, 
66, 367-386. 


JACK A. ADAMS AND LAWRENCE R. BOULTER 


McFarzanp, R. A,, Howay, A. N, & 
Hurvicn, L. M. Studies of visual fatigue. 
Publ. Grad. Sch. Bus. Admin. Harvard U, 
1942, 

McGrath, J. J. The effect of irrelevant 
environmental stimulation on vigilance per- 
formance. Project on Human Factors 
Problems in Anti-Submarine Warfare, 
Technical Report No. 6, 1960, Office of 
Naval Research, Psychologica! Sciences 
Division. 

Poiack, 1., Jounson, L. B., & Kxarr, P. R. 
Running memory span. J. exp. Psychol, 
1959, 57, 137-146. 

Rossi, G. F., & Zancuetti, F. The brain 
stem reticular formation. Arch. Ital. Biol., 
1957, 95, 199-435. i 

Samuets, I. Reticular mechanisms and be- 
havior. Psychol. Bull., 1959, 56, 1-25. 

Scott, T. H. Literature review of the intel- 
lectual effects of perceptual isolation. 
Report No. HR66, 1957, Department of 
National Defence, Defence Research Board, 
Canada. 

SHARPLESS, S., & Jasper, H. Habituation of 
the arousal reaction. Brain, 1956, 7% 
655-680. 

WHITTERIDGE, D. Central control of eye 


movements. In J. Field (Ed.), Handbook 
of physiology. Washington: American 
Physiological Society, 1960. Pp. 1089- 


1109. 
(Received October 28, 1961) 


towed of Bapoimatd 
ott, Vel. 4, Ne. 5, H 


THE RELATIVE EFFICIENCY OF SEVERAL TRAINING 
METHODS AS A FUNCTION OF TRANSFER 
TASK COMPLEXITY ' 


GEORGE E. BRIGGS awo JAMES C NAYLOR 
Ohio State University 


In a recent review Naylor (1962) 
has suggested that the relative effi- 
ciency of part- and whole-task train- 
ing schedules is a function of two 
variables: complexity and organization. 
In his taxonomy, Naylor defines 
complexity as a function of the 
information-processing demands im- 
posed on S$ by each component of the 
task separately, while task organiza- 
tion is specified by the demands placed 
on S by the interactions or inter- 
relations between task components. 
Thus, task complexity is an intra- 
component characteristic, task or- 
ganization an intercomponent char- 
acteristic. 

Naylor (1962) has further sug- 
gested that the efficiency of a whole- 
task training method (as compared to 
a pure-part task method) will increase 
as task complexity is in |, Lee, 
there should bean interaction between 
training method and task complexity 
in terms of transfer performance. 
Therefore, those tasks which place 
greater intradimensional demands 
upon S should be learned best by 
whole practice, whereas those 
having lesser intradimensional 
requirements should show a decrease 
in the relative efficiency of the whole 
training method. 


Laboratory of ‘Aviation Psychology and was 
supported by the United States Navy under 
Contract No. N61339-950, sponsored by the 


lication, use, an 
part for any purpose of t 
Government. 


More specifically, 
Taylor (1954) pointed out that an 
increase in dynamics, as from a simple 
i (such as setting one's 
watch) to rate control (such as 
steering an auto), results in an in- 
crease in the information-processing 
demands placed on S, ie, he must 
increase the use of derivative in- 
formation in the tracking error signal 
to maintain stability in rate task 
control. In fact, Russell (1951) 
clearly showed that compared to his 
transfer function (an analytic ex- 
pression of his information-processing 
functions) with a positional control, 
S will increase his damping (weighting 
of the first derivative of error in 
determining a control movement) by a 
factor of 25 when controlling through 
rate dynamics. Thus, a more simple 
transfer function is adequate for 
position control tasks, but rate dy- 
namics require rate estimates by S, 
thereby increasing the complexity of 
his task. 

Similarly, a further increase in dy- 
namics from rate control to accelera- 
tion control results in an even more 
complex task dimension, since S now 
must provide a transfer function with 
the equivalent of two differentiations, 
one for each of the integral lags 
present in the control dimension. 


505 


506 


The relationship between information- 
processing demands and control dy- 
namics is discussed in detail by 
Birmingham and Taylor (1954). 

The present study represents an 
inyestigation of the task complexity 
hypothesis. In addition to whole and 
pure-part task training methods, the 
present study included groups trained 
via either a progressive-part or a 
simplified-whole method. 


METHOD 


Apparatus.—A three-dimensional compen- 
satory tracking system defined the skill task. 
The S observed three center-reading meters 
labeled “heading,” “altitude,” and “yaw,” 
and he was instructed to maintain null 
readings on these meters as much of the time 
as possible. A single input signal was applied 
to all three dimensions in the form of a simple 
sinusoid of .03 cps. In the more complex 
versions of the task (see below) meters show- 
ing “altitude rate’ and/or “heading rate” 
were provided. All meters were mounted on a 
single 10.5 X 19 in. panel which was located 
approximately 24 in. from S, who mani- 
pulated a single, three-dimensional control. 
The control device was a two-dimensional 
stick where the three control stick movements 
were left-right (heading), front-back (alti- 
tude), and clockwise-counterclockwise rota- 
tion of the head (yaw). All three control- 
display linkages conformed to population 
stereotypes, 

An EASE analog computer provided the 
task dynamics, as illustrated in Fig. 1. As 
indicated, there were three levels of control 
complexity used in this study. Tracking 
proficiency was measured during each trial in 
terms of integrated absolute tracking error 
(average error) for each of the three dimen- 
sions separately, 

Design.—Two independent variables were 
manipulated in a transfer of training para- 
digm: transfer task complexity (two levels) 
and method of training (four methods). The 
two levels of transfer task complexity were 
Level II and Level III of Fig. 1. The four 
training methods were whole-task training, 
pure-part training, progressive-part training, 
and simplified-whole task training, There 
were two groups of Ss for each of the training 
methods (one for each of the transfer task 
complexity levels). The two whole-task 
groups tracked through Level IT and Level III 
dynamics, respectively, for a total of 10 daily 


GEORGE E. BRIGGS AND JAMES C. NAYLOR 


corel co red woe 
prem us | M u ss 
= meses ew 
Bi vate aoe Taro 
EHEHE 
amoa Mee 
wor 
Tae mera EI] f saa a. 
Conran Loa ivy 2 
A Complesity Level I1 (most compie) 
e] oa on) ae 
Control Lg (un #—® res 
EA 
Gee E anne 
[Emr |x} £ 
EHA oe 
oe ‘© 
B. Complexity Level I (medium complexity) 
C] 
Taod Trego! Eye moose 
Contre ee Ta. 
aioe i rE mala 
Contrai Log 
(ret) 
bed vo 
Control ©) 


C Cornplenity Level 1 (least complex) 


Fic. 1. System dynamics used to obtain 
various levels of task complexity. 


sessions. The remaining six groups received 
8 sessions on a training task and 2 sessions on 
the final transfer task. For three of the six 
groups the final transfer task involved Level 
Il dynamics, while the other three groups 
experienced Level II] dynamics upon final 
transfer. 

Pure-part training involved four 35-sec. 
trials per session on each of three separate 
dimensions. Progressive-part training M- 
volved an initial three sessions on a pure-part 
schedule followed by five sessions during each 
of which S practiced four 35-sec. trials on 
each of the three possible pairings of the 


TABLE 1 


EXPERIMENTAL DESIGN OF TRAINING 
AND TRANSFER CONDITIONS 


aini fi 
Group | Training Condition | Training | TH 
; a 11] Level 111 
1 | Whole Level III] Level I 
2 | Pure-Part Level II] Level Ili 
3 | Progressive-Part | Level IH lever 
4 | Simplified Level II | Leve! bs 
: wud “a 
5 | Whole Level II | Level! 
6 | Pure-Part Level II Level 
7 Progressive-Part | Level IT eats il 
8 | Simplified Level | Leve 
oe. 


Note.—See Fig. 1 for a definition of Complexity 
Levels I, IT, and fil, 


l TRANSFER TASK COMPLEXITY s07 
altitude, and yaw dimensions. Of cs 
the two groups which experienced the sim- aoe std 


| 


plified-whole training method, one trained for 
eight sessions on Level I dynamics (see Fig. 1) 
and then transferred to Level II, while the 
other group trained on Level IT and trans 
ferred to Level III. Table 1 summarizes the 
training and transfer conditions. 

Subjects and procedure.—A total of 144 
male undergraduates participated in this 
study. Assignment to the eight groups was 
based on randomization procedures with the 
restriction that groups be filled approximately 
equally throughout the data collection period. 
At the end of this period there were 18 Ss 
per group. The Ss were reimbursed $10.00 
for their time. 

There were 12 35-sec. tracking trials during 
each of the 10 sessions. Practice occurred in 
four-trial blocks with 25 sec. rest between 
trials within a block and 1 min. rest between 
blocks. Scoring occurred over the final 30 
sec. of each trial to avoid the initial transients 
in tracking performance; thus, a block of four 
trials served as the unit of data analysis and 
represents a behavioral sample of 2 min, for 
trials of the initial 
all Ss controlled the task without the 


this procedure was to 
acquisition of skill. 
dynamics provide 
tems, and thus the system tends to go out of 


removing the input markedly improved S's 
chances of “keeping control” over the system 
during the initial training trials as it provided 
a less difficult task. 

All Ss were instructed to keep the meter 
displays representing heading, altitude, and 
yaw at their null position at all times, i 
possible. It was emphasized that control 
over all three task dimensions was required, 
and S was reminded of this if his performance 
indicated that he was paying particular 
attention to one or two displays to the 
detriment of the other dimension(s). The S 
was informed also of the usefulness of 
rate display (if it were present, see Fig. 1)- 

The performance metric, average error, 
is actually the average deviation of the error 
amplitude distribution generated by S during 
each tracking trial. Therefore, the metric 
describes S's intratrial variability in tracking 
accuracy. The score was recorded originally 
in voltage units; however, following data 
collection all scores were transformed to units 
of inches of arc, thereby providing a scale 
matching that of S's tracking display. 


The results from both the training 
and the transfer sessions are sum- 
marized in Fig. 2 (Groups 1-4) and 
Fig. 3 (Groups 5-8). The tracking 
error scores were combined for all 
three dimensions and averaged for 
each group. In order to provide 
comparable data points, it was neces- 
sary to derive daily averages for the 
training sessions, i.c., the pure-part 
and the progressive-part groups ex- 
perienced all three tracking dimen- 
sions during each of the eight training 
sessions; however, during transfer it 
was possible to derive comparable 
data points for four-trial blocks. The 
use of combined error scores is justified 
here, since the performance functions 
on the three dimensions are quite 
similar in relative rank order and no 
apparent artifacts are introduced by 
dealing with “total task” performance 
in either the training or the transfer 
sessions. 

Training.—Several points are of 
general interest from the training data 
of Fig. 2 and 3. First, as expected, 
tracking performance is best during 
Sessions 1 through 8 for that whole- 
task group, Group 8, which ex- 
perienced the least complex system 
(Level 1), and the least proficient 
performance was attained by Group 1 
which experienced the most complex 


Fic. 2. Training and transfer perform- 


ance levels for Groups 1-4. (Transfer to 


Task Complexity Level III.) 


Fic. 3. Training and transfer perform- 
ance levels for Groups 5-8. (Transfer to 
Task Complexity Level I1.) 


whole task (Level 111). Both Groups 
4 and 5 trained on the same whole 
task (Level II), and their performance 
levels (a) are not significantly differ- 
ent one from another (P > .05) and 
(6) fall intermediate to those of 
Groups 1 and 8. 

It may be recalled that no input 
signal was employed during Session 1 
for any group; thus, these data are 
set apart in Fig. 2 and 3. Also, it 
may be recalled that Groups 3 and 7 
transferred from a pure-part schedule 
to practice on pairs of task dimensions 
on Session 4; therefore, the figures 
show a discontinuity between Sessions 
3 and 4 for Groups 3 and 7. It is 
interesting to note on Session 4 and 
thereafter that Group 3 surpasses the 
whole-task group, Group 1, whereas 
Group 7 never does exceed the per- 
formance of Group 5, even though 
Group 7 experienced only two track- 
ing dimensions while Group 5 was 
confronted with all three dimensions. 
This may well be a result of the very 
simple yaw dynamics present in the 
Level II task (see Fig. 1). 

In addition to the comparable per- 
formance levels of Groups 5 and 7 
late in training, it may be noted from 
Fig. 3 that Groups 6 and 8 achieved 
similar “total task” performance. 
Again, this is probably a result of the 
very simple task dynamics encount- 
ered by Group 8 during training. 
The more complex dynamics experi- 


GEORGE E. BRIGGS AND JAMES C. NAYLOR 


enced by Groups 1-4 would account, 
therefore, for the clear separation of 
performance levels for these groups 
during the latter training sessions (see 
Fig. 2). 

An analysis of variance was applied 
to the data of Groups 1-3 and Groups 
5-7 on Session 8. Groups 4 and 8 
were not included, since their task 
dynamics differed from those of the 
companion groups, The results of the 
analysis indicated that training meth- 
ods (F= 78.6; df = 2/102), task 
complexity (F = 85.2; df = 1/102), 
and the Methods X Complexity inter- 
action (F = 28.0; df = 2/102) were 
statistically significant at P < .05. 
The interaction, illustrated in Fig. 2 
and 3, shows that the greatest per- 
formance difference between the Level 
II and the Level III tasks occurred 
with the whole-task groups, while the 
progressive-part groups showed less 
influence of the task complexity vari- 
able, and no difference (P > .05) was 
found between Groups 3 and 7. The 
only other group difference which did 
not attain statistical significance at 
P < .05 on Session 8 was that be- 
tween the whole and progressive-part 
groups on the Level II task dynamics 
(Groups 5 and 7 as mentioned earlier). 

Finally, it may be noted from Fig. 2 
and 3 that all groups had approached 
asymptotic performance levels at or 
before transfer. This was especially 
the case for the two part-task practice 
groups (Groups 2 and 6). If any- 
thing, Groups 2 and 6 were over- 
trained in eight sessions. 

Transfer-—It would appear from 
Fig. 3 that during the initial block of 
transfer trials Groups 6 and 8 per- 
formed at a level inferior to Group 5 
on Session 2, It should be recalled 
that the data point for Group 5 on 
Session 2 is the average for three 
blocks of trials, and when performance 
for Group 5 on the first block of 
Session 2 is compared to that of 


TRANSFER TASK COMPLEXITY 


Groups 6 and 8 on the first block of 
Session 9, the result is not quite 
as unexpected. The transfer index 
(C; — E/Ci — C) X 100 was used 
for this comparison, where Cs is the 
performance of Group 5 on the first 
block of Session 2, C; is the perform- 
ance of the same group on the first 
block of Session 9, and Æ, is the 
performance of Group 6, 7, or 8 on the 
latter block of trials. This index of 
transfer is particularly appropriate 
when the raw data are error scores, as 
in the present case, and the index 
describes relative improvement in 
performance as a function of training. 

The same index was applied to the 
data of Groups 1-4 and the results for 
all groups were: part training = 2% 
transfer to Complexity Level IT, 31% 
to Complexity Level IIT; progressive- 
part training = 75% transfer to Level 
II, 94% to Level I1; and simplified 
training = 28% transfer to Level II, 
44% to Level III. One may note that 
(a) Group 3 attained a very high level 
of performance during transfer (94%), 
while Groups 2, 4, 6, and 8 attained 
rather poor relative transfer per- 
formance levels, and (b) transfer 
performance was relatively higher on 
the more complex task than on the 
less complex task. It is important to 
note that these transfer data are 
expressed in relative terms, as in- 
dicated above. If examined in terms 
of absolute transfer performance levels, 
the pattern of results as listed is 
partially reversed. 

This latter point may be illustrated 
by reference to an analysis of variance 
which was performed on the data of 
all eight groups over the entire six 
blocks of transfer trials. The results 
are summarized in Table 2 where it is 
apparent that, as was the case with 
the training data, there are significant 
differences among the training meth- 
ods and between the two levels of task 
complexity, Further, and of direct 


s09 


relevance to the above statement on 
absolute transfer performance levels, 
the interaction of training methods by 
task complexity is statistically sig- 
nificant at P < .05. Figure 4 con- 
tains the group averages which define 
this interaction. There it is obvious 
that considerably greater absolute 
differences occurred between the whole- 
task group (Group 1) on the more 
complex transfer task and the groups 
trained by either pure-part or sim- 
plified-whole methods (Groups 2 and 
4) than between the comparable 
groups on the less complex transfer 
task (Group 5 vs. Groups 6 and 8). 
However, again it should be em- 
phasized that relative comparisons 
between control (whole training) 
groups and the experimental groups 
show better transfer performance on 
the more complex task than on the 
less complex task for the latter groups. 

The other statistically significant 
interactions indicated in Table 2 are 
of interest. The Transfer Blocks 
X Training Methods interaction indi- 
cates a difference in the slopes of the 
transfer performance curves for the 
several groups. By reference to Fig. 2 
and 3 it may be seen that the slopes 
for the whole task and progressive- 
part task groups are very slight, 
whereas those for the part and 


TABLE 2 
ANALYSIS OF VARIANCE AS APPLIED TO THE 


TRANSFER TRIAL BLOCKS OF Sessions 9 
AND 10 FOR ALL GROUPS 


Source af 


—— 
Training methods (M) 3 
‘Task complexity (C) 


MXC 
Ss pia methods (Ss/ 
M 


) 
'Transfer blocks (B) s 
BXM 15 
BXC 
BXCXM 15 
B X Ss/M 680 
ap <.05, 


Part 


ZZ Simplitied 


Progressive 


Average Error (inches) 


Transfer Tosk Complexity 


Fic. 4. Transfer task performance averaged 
over Sessions 9 and 10 for all groups. 


simplified-whole groups are rather 
steep. Thus, while initial transfer 
performance for the latter two groups 
was relatively poor, their progress 
following initial transfer was quite 
rapid. 

The Transfer Blocks X Task Com- 
plexity interaction is more difficult to 
see in Fig. 2 and 3. However, it isa 
result primarily of the especially high 
rates of improvement by Groups 2 and 
4 on the more complex task. It fol- 
lows, therefore, that not only was 
relative transfer higher for these 
groups on the more complex task, but 
also their rate of approach to the 
performance level of the control 


TABLE 3 


Group Comparisons OF TRANSFER 
PERFORMANCE ON BLOCK 1 
(UPPER Part) AND ON 
Bock 6 (Lower 
PART) FOR ALL 


Grours 
Level III | Level I 
Cond. — = 
Ww P p-r | siw | P [P.P| S 
W .05 | ns |05| os [ns |.05 
P ns -05 | ns | ns O5 | ns 
P-P | ns | ns 05 | ns | ns 05 
S | ns | ns | ns ns | ns | ns 


Note.—Cell entries indicate statistical signific 5 a 
P< .05 and at P >.05 (ns). W, P, P-P, and S iden. 
tify whole, part, progressive-part, and simplified-whole 
methods, 


GEORGE E. BRIGGS AND JAMES C. NAYLOR 


group (Group 1) was greater than that 
for the comparable groups on the less 
complex system. 

Following the analysis of variance, 
summarized in Table 2, a Duncan 
multiple range test (Duncan, 1955) 
was applied to the data for all groups 


on the first block of transfer trials and — 


on the last block. A summary of these 
comparisons is provided in Table 3. 
The upper half of each section of 
Table 3 represents comparisons made 
with the initial block of transfer trials, 
while the lower half is from the last 
block of transfer trials. These results 
show that whereas there were differ- 
ences among groups early in transfer, 
the relatively short transfer experience 
with the whole task eliminated 
all group differences; therefore, the 
effects of training method were quite 
transitory. 


DISCUSSION 


Several findings of the study are of 
particular interest. First, the relative 
transfer indices are higher for all three 
experimental groups on the more com- 
plex task. Using this measure of method 
efficiency, one finds an indication that the 
part-training methods are actually more 
efficient when the task is highly complex 
—a denial of the Naylor hypothesis. 
Second, using absolute transfer perform- 
ance (see Fig. 4), one finds that only the 
progressive-part schedule showed an 1m- 


provement relative to the whole method. ~ 


These data lead to the conclusion that 
both the pure-part and the simplified- 
whole methods become less efficient 
training techniques for highly complex 
tasks and that the whole and some 


schedule of progressive-part methods 
emerge as more satisfactory training 
techniques as task complexity is M- 
creased. This substantiates the hy- 


pothesis that task complexity and tram- 
ing method interact, with whole training 
becoming relatively more efficient as 
complexity is increased. Since the 
validity of the hypothesis therefore de- 


TRANSFER TASK COMPLEXITY 


pends upon the measure of transfer that 
is used, it is not possible to accept or 
reject without ambiguity. 

One possible source for this ambiguity 
lies in the second task variable mentioned 
by Naylor (1962). It is likely that an 
increase in component complexity results 
in a corresponding increment in the 
organizational demands of the task. 
With the three-dimensional task used 
here, the primary organizational require- 
ments imposed on S consisted of time- 
sharing demands, i.e., he had to develop 
an efficient pattern of attention alterna- 
tion among the three displays and avoid 
undue emphasis on a single system 
dimension (component). Thus, as task 
complexity is increased, S must CO- 
ordinate his responses to the several task 
dimensions more exactly to avoid large 
errors in one or another of the dimensions. 

A comparison illustrating the impor- 
tance of time sharing is that between the 
simplified-whole, the progressive-part, 
and the pure-part training methods. If 
increases in complexity of the system 
components influence task organization, 
then the pure-part method should show 
less effect of task complexity, while the 
progressive-part and the simplified-whole 
methods should be affected to a greater 
extent, This prediction was substanti- 
ated by the data: there is no statistically 
significant difference between Groups 2 
and 6 (pure-part) during the training 
sessions; however, both Groups 3 and 7 
(progressive-part) and Groups 4 and 8 
(simplified-whole) differ markedly (see 
Fig. 2 and 3). ne. 

Perhaps one of the more intriguing 
findings related to time-sharing concerns 
the relationship between the whole and 
the progressive-part groups during trans- 
fer, It would appear that time-sharing 
can be learned quite well by the progres- 
sive-part method, as there were no 
significant differences between the whole 
and progressive-part groups at either 
level on Transfer Block 1 (Table 3). 
Since, at the most, only practice on pairs 
of task dimensions was given in the 
progressive-part method, the acquisition 
of time-sharing skill may not require 
practice on the total task. 


511 


It would appear, then, that the time- 
sharing demands of multidimensional 
control tasks represent a potent variable 
determining the relative efficiency of 
training methods. However, this vari- 
able accounts for only part of the transfer 
results. If time-sharing requirements 
were the only variable of importance, 
then those groups trained via a simpli- 
fied-whole method should have been 
superior to the pure-part groups during 
transfer. It may be noted from Table 3 
that these groups did not differ signifi- 
cantly at P < 05; thus, the apparent 
superiority of Groups 4 and 8 over 
Groups 2 and 6, respectively, in Fig. 2 
and 3 is only apparent—not real. This 
was a completely unexpected finding of 
the present research. It was expected 
on the basis of an earlier analysis by 
Briggs and Waters (1958) that a sim- 
plifed-whole training method would 
provide superior transfer performance 
compared to that following pure-part 
training. 

Also, Briggs, Fitts, and Bahrick (1958) 
had previously demonstrated that greater 
amounts of training resulted in greater 
amounts of transfer for a simplified 
training procedure. Since the amount of 
training provided in this study was 
greater than that used in the Briggs et al. 
study, large amounts of transfer for the 
simplified groups were anticipated. 

Therefore, it may be that the benefit 
accrued by the simplified-whole groups 
during training (the acquisition of time- 
sharing skills) was offset in comparison 
with the pure-part groups by their train- 
ing on the specific dynamics of the 
transfer task. Similarity of training and 
transfer tasks emerges, therefore, as a 
second major determinant of transfer 
performance. 

It is concluded that some form of 
progressive-part training will be superior 
to methods such as pure-part and 
simplified-whole for the acquisition of 
skillin a complex, multidimensional task, 
since the progressive-part method utilizes 
a training task of high similarity to the 
transfer task and it also provides an 
opportunity to develop efficient time- 
sharing behavior. 


512 


Finally, attention is directed to the 
fact that practice on the whole task 
during transfer rather quickly eliminated, 
at least statistically, the differential 
effects of training methods (see Table 3). 
Since whole-task training resulted 
in numerically superior performance 
throughout training, the only argument 
for the pure-part or simplified-whole 
methods rests upon the potential savings 
in training time on the whole task itself 
following training. This argument is 
tenable only to the extent that (a) a 
minimum of whole-task training can 
equate the effects of the methods in 
terms of final performance, and (b) dur- 
ing transfer following pure-part or sim- 
plified-whole training the initial pro- 
ficiency of S isable to meet some criterion 
of operating safety, i.e., it is conceivable 
that even after extended training S could 
“lose control” of a complex system and 
thereby violate safety margins for him- 
self and the system. 

The first point above was met in the 
present study by the very rapid progress 
of the pure-part and simplified-whole 
groups during transfer; however, their 
original transfer performance was very 
poor, and the pure-part groups, in 
particular, probably would have been 
judged “dangerous” had their training 
and transfer tasks been in a real-life 
vehicular system. 


SUMMARY 


The relative efficiency of four training 
methods (pure-part, progressive-part, simpli- 
fied-whole, and whole task) in the acquisition 
of skill was investigated in a three-dimensional 
tracking task. Two levels of transfer task 
complexity were used. There were eight daily 
training sessions followed by two transfer 
sessions on the whole task. 

The results showed that training via the 
whole and the progressive-part methods re- 
sulted in statistically equivalent performance 
during transfer for both levels of transfer 
task complexity, Both the whole and the 
progressive-part methods resulted in transfer 


GEORGE E. BRIGGS AND JAMES C. NAYLOR 


performance which was statistically superior 

to that of Ss trained via either the pure-part 
or the simplified-whole method. Further, as 

task complexity was increased, the whole and 

the progressive-part training methods in- 

creased the absolute (but not the relative) 

superiority of transfer performance compared 

to that with pure-part and simplified-whole 

methods. 

Two factors emerged from a logical analysis 
of the results to explain the superiority of the 
whole and the progressive-part methods: 
first, the transfer (whole) tasks required 
rather efficient time-sharing skills and these 
could not be acquired under the pure-part 
method since, by definition, S experiences 
only one dimension of the task at a time under 
this training method, and second, training and 
transfer task similarity was not as high for 
the simplified-whole groups as it was for 
those Ss trained under the progressive-part 
method; thus, even though the former SS 
could acquire the necessary time-sharing 
skills, they were “penalized” by training on@ 
task which bore less similarity to the transfer 
task than was the case for the latter Ss. 


REFERENCES 


Birmincuam, H. P., & ‘TAYLOR, F. V. A 
human engineering approach to the design 
of man-operated continuous control sys- 
tems. USN Res. Lab. Rep., 1954, No. 4333. 

Briccs, G. E., Fitts, P. M., & BARRICK, 


H. P. Transfer effects from a single to a 

double integral tracking system. J. exp- 

Psychol., 1958, 55, 135-142. a 
Bruges, G. E., & W s, L. K. Training 


and transfer as a function of component 
interaction. J. exp. Psychol., 1958, 56, 
492-500. f 

Duncan, D. B. Multiple range and multiple 
F tests. Biometrics, 1955, 11, 1-42. 

Naytor, J. C. Parameters affecting the 
relative efficiency of part and whole practice 
methods: A review of the literature. 2 
Train. Dev. Cent. tech. Rep., 1962, No. 
950-1. 

Russert, L. Characteristics of the human a$ 
a linear  servo-element. Unpublish 
master's thesis, Massachusetts Institute 0 
Technology, 1951. 


(Received October 30, 1961) 


Journal of Experimental Pi 
1962, Vol. 64, No. 5, S13- gus 


RECRUITMENT, LATENCY, MAGNITUDE, AND 
AMPLITUDE OF THE GSR AS A FUNCTION 
OF INTERSTIMULUS INTERVAL ' 


WILLIAM F. PROKASY, JAMES T. FAWCETT? ann JOHN F. HALL 


Pennsylvania State University 


White and Schlosberg (1952) and 
Moeller (1954) report that, within the 
ranges of 0 and 5 sec., the optimum 
interstimulus interval (ISI) in GSR 
conditioning is approximately .5 sec. 
when magnitude of response is em- 
ployed as the dependent variable. 
Bierbaum (1955), in contrast, reports 
a greater response magnitude near 
3 sec. 

While it is not clear why his ISI 
function should diverge from that ob- 
tained by White and Schlosberg 
(1952) and Moeller (1954), Bierbaum 
(1955) reports, in addition, that re- 
cruitment (time elapsing between the 
onset of a GSR and its maximum) 
increases with increases in the ISI. 
Unpublished data from our laboratory 
suggest, further, that both GSR 
latency and recruitment modify across 
extinction trials. 

It is the purpose of the present 
study (a) to examine further GSR 
conditioning as a function of ISI and 
(b) to relate the independent variable 
(ISI) to three GSR attributes other 
than magnitude: latency, recruitment, 
and amplitude. The distinction be- 
tween magnitude and amplitude first 
was made by Humphreys (1943) and 


more recently was discussed by Hil- 
1 This study was supported by an NSF 
grant (G-7463) to the senior author. : 
‘The authors gratefully acknowledge the aid 
of Herbert Krauss and Barry Lively for their 
aid in transforming the data to log con- 
ductance units, and of the staff at the com- 
puter center of Pennsylvania State University 
for making the IBM 650 available to us. 
2 Now with the Peace Corps, Washington, 


DG 


gard (1951, p. 528). These authors 
employed the word magnitude to refer 
toa mean based upon all trials, includ- 
ing those which resulted in no measur- 
able response, while they adopted the 
word amplitude to refer to means 
derived only from those trials on 
which a response occurred. 


METHOD 


Subjects —The Ss were 129 men and 
women enrolled in introductory psychology 
who volunteered with the knowledge that 
shock would be employed. Of these, 23 were 
lost due to equipment malfunction, adapta- 
tion to shock, or E's error. 

Apparatus —The GSR was measured by 
means of the Fels dermohmeter. The record- 
ing electrodes were .15-in. zinc electrodes set 
at the base of Plexiglas cups of .125-in. depth. 
A zinc oxide electrode paste filled the cups and 
made contact with S. The cups were placed 
approximately 1 in. apart (center to center) 
on S's left palm with a constant current of 
70ya. transmitted between them. Resistance 
changes were recorded on an Esterline-Angus 
mnilliammeter operating at a paper speed of 
24 in. per min. 

The CS, a 76-db. (re 0002 dynes/cm?) 
1000-cps tone, was presented by means ofa 
Grason-Stadler twin oscillator through Permo- 
flux PDR-8 earphones, and was superimposed 
on a constant background of white noise 
generated through the earphones at 52 db. 
The UCS was a shock administered by an ac 
variac through .5-in. copper electrodes taped 
to S's right index finger. CS and UCS 
duration were controlled through Hunter- 
Brown interval timers. 

Procedure —The Ss were assigned, ran- 
domly, to one of five treatment conditions: 
0-, .5-, 1-, 3+, oF 5-sec. ISI. CS durations for 
the five conditions, respectively, were: Be toss 
1, 3, and 5 sec. UCS duration was .2 sec. and 
began with the termination of the CS in all 
except Group 0, in which the CS and ucs 


overlapped. 


513 


514 


The S was seated in a sound-shielded room 
(6 X 10 X 6.5 ft.) which was illuminated by a 
75-w. incandescent bulb. After affixing GSR 
and shock electrodes, E indicated that shock 
would be administered in increasing amounts, 
beginning with a barely perceptible level, 
until it reached a level which S judged to be 
“highly annoying, but not painful.” The 
mean voltage achieved was 58.7 (SD = 14.5). 
The S was then instructed that he would 
receive either the already experienced shock 
to the finger, a tone through the earphones, 
or both, but that there was no necessary 
relationship between the two events. The 
earphones were positioned and E returned to 
the outer chamber to initiate training trials. 
All Ss received 20 CS-UCS pairings followed 
by 10 test (extinction) trials. The intertrial 
interval was varied unsystematically between 
25 and 35 sec. Final Ns in each group were: 
Group 0, 20; Group .5, 22; Group 1, 22; 
Group 3, 20; and Group 5, 22. 

Response measures.—The measure em- 
ployed on each test trial was the log of 
conductance change, as measured from the 
base at CS onset to the point of maximum 
pen deflection following any response initia- 
tion that occurred between 1 and 5.5 sec. 
after CS onset. To log conductance change 
was added a constant of 9, thus making all 
scores positive with a range of from .549 
to 3. (Only 3 Ss yielded scores below 1, 
and then on only several trials each.) In the 
event that the pen deflection failed to meet the 
criterion of a response, a score of zero was 
recorded. The minimum deflection recorded 
(as measured by a specially constructed 
template) was 25, 50, 100, 250, or 500 ohms, 
depending upon the dermohmeter Sensitivity 
setting required to adjust appropriately to the 
extent of S's UCR during shock adjustment. 

Both magnitude and amplitude were 
recorded as the mean of log conductance 
change plus 9 on Test Trial 1 and on each of 
three blocks of three of the subsequent nine 
test trials. Magnitude was obtained by in- 
cluding the score of zero on trials in which no 
response occurred. The amplitude index did 
not include scores of zero. For example, if 
S's scores on Test Trials 2, 3, and 4 were 
2.5, 0, and 1.1, mean magnitude for those 
trials would be 1.2 while mean amplitude 
would be 1.8, 

Latency was defined as the time elapsing 
between CS onset and the occurrence of the 
first response in the criterion range, while 
recruitment was defined as the amount of 
time elapsing between response initiation and 
response peak. The units of our template 
were in .03-in. increments which, in conjune- 


W. F. PROKASY, J. T. FAWCETT, AND J. F. HALL 


a oT 


9+LOG CONDUCTANCE CHANGE 
i 


so N 
tod 
at wPLiTUDE 
© MAGNITUDE 
a 
o1—__§_, : - 7 
o i 2 3 + 3 
NTERSTIMULUS INTERVAL (SECONDS) 
Fic. 1. Amplitude and magnitude 
as a function of ISI, measured on Test 
Trial 1. 


tion with the paper speed of 24 in. per min., 
permitted time measurement to the nearest 
75 msec. 

RESULTS 


First extinction trial. —Responses on 
the first extinction trial were em- 
ployed as the most direct index of the 
influence of the ISI variable, as sub- 
sequent responses necessarily reflected 
the added operation of UCS omission. 
Mean magnitude and amplitude as a 
function of ISI are shown in Fig. 1. 
Amplitude means, based only upon 
those Ss who responded, are obtained 
from Ns of 17, 22, 22, 16, and 17 for 
Groups 0, .5, 1, 3, and 5, respectively. 

Magnitude is greatest at .5 sec. 
and declines with longer ISI intervals, 
the decrease over ISI values of .5 to 5 
sec. being significant at the .05 level 


TABLE 1 


LATENCY AND RECRUITMENT MEANS AND SDS 
ON Test Trrat 1 As A Function oF ISI 


| : 
| Lateney (Sec.) | Recrullnee 
ISI y | ss 
(Sec.) . — 
Mean | SD | Mean | Bick 
0 17 | 2.48 | 79 | 2.50 | .72 
S | 22 | 255 | .48 | 3.14 | 1-08 
i 22 | 2.58 | .90 | 2.93 | 3 
3 16 | 3.11 | 1.22 | 3.23 | - 
5 17 | 2.76 | .98 | 2.87 | 1.82 


GSR AS A FUNCTION OF INTERSTIMU 


TABLE 2 


LUS INTERVAL 515 


MEAN LATENCY, RECRUITMENT, AMPLITUDE, AND MAGNITUDE OVER THREE BLOCKS 


OF THREE TEST TRIALS AS A FUNCTION OF ISI 
Amplitude | Magnitude Latency (Sec.) l Recruitment (Sec.) ; 
en — - — 
AES | N | Trial Block Trial Block Trial Block | Trial Block 
F | 1 2 3 1 2 | 3 1 2 | 3 | 1 ON | Fyi 
0 | > | aas7 2384 | 2.581 | 2.285 | 1.854 | 2.459 | 2.53 | 2-58 | 2.52 | 2:08 | 200 | 2.06 
5 19 | 2.506 | 2.375 | 2.343 2.459 | 2.197 | 2.136 | 2.48 2.54 | 2.49 | 2.15 | 2.09 2.12 
1 | 18 2.451 | 2.400 | 2.391 2.415 | 2.223 | 2.203 2.47 | 2.58 | 2.75 | 2.61 2.39 | 2.36 
3 17 | 2.409 | 2.305 | 2.404 2.201 | 2.085 | 1.926 | 3.17 3.31 3.32 | 3.23 | 2.84 | 3.02 
a] | 18 | 2.402 | 2.367 | 2.290 2.374 | 2.230 | 2.096 2.89 | 2.95 | 3.41 | 2.93 3.00 | 2.73 
1 i 


Note.—Within-cell SDs varied from .18 to .50 for amplitude, from .24 
latency, and from .44 to 1,21 for recruitment. 


(F = 3.98, df =3/82, error MS=.593). response 


A similar analysis of amplitude did 


to .80 for magnitude, from .27 to 1.14 for 


measures. Table 2 sum- 


marizes the means for all groups while 


not result in significant differences 
(F= df =3/73, error MS=.119). 

Table 1 summarizes latency and 
recruitment data on Test Trial 1. 
For neither recruitment (F = 1.1, 
df = 4/90, error MS = 1.273) nor 
latency (F = 1.37, df = 4/90, error 
MS = .775) were the differences 
among groups significant, although it 
can be noted that the longest latencies 
occurred in Groups 3 and 5. 

Final nine extinction trials. —Data 
of all Ss who responded at least once 
in each of three blocks of three ex- 
tinction trials were employed in the 


Table 3 provides a summary of the 
analyses of variance. Significant be- 
tween-groups effects were obtained 
only with the recruitment and latency 
measures, in both instances a larger 
time value being associated with 
longer ISIs. The significant between- 
Ss effect obtained with all measures 
attests to their reliability. 

Both magnitude and latency 
changed across test trial blocks, with 
magnitude decreasing and latency in- 
creasing. There was a tendency for 
amplitude to decrease, suggesting that 
there was some decrease in the size of a 
response provided that one occurred. 


analyses of variance © the four 
TABLE 3 
VARIANCE OF MAGNITUDE, AMPLITUDE, LATENCY, AND RECRUITMENT 
ANa a FUNCTION or ISI OVER ‘THREE BLOCKS OF "THREE TEST TRIALS EACH 
Amplitude Magnitude Latency Recruitment 
Soure d 
pag : MS F MS F MS F MS F 
EE EE a eee a 
Between Ss 80 wk +e 
y 37 U2 76 6.061 | 9.06 8.879 | 6.91 
Sr (b) "6 ie 3.86*** .152 4.82*** | .668 2.58*** | 1.285 B65" 
bil a ve eit | -521| 1:48 
Tri 2 | -143 2.56 1.716 | 11.00 .935 | 3. P j 
Be Trials g | .056 | 1.00 056| -36 20 1.02 LS 58 
Error (w) 152 | .056 156 i 5 
oP <a 
me P< 001, 


516 W. F. PROKASY, J. T. FAWCETT, AND J. F. HALL 
TABLE 4 
MAGNITUDE MEANS AND SDs ror ALL Ss as A Function or ISI over 
THREE BLOCKS or THREE Test Trrars EACH 
Test Trial Block 
See) of : 2 | : 

Mean SD Mean SD Mean SD 

20 1.411 95 1.030 -98 1.116 1.11 

5 22 2,260 58 1.960 -81 1.845 95 

1 22 2.183 72 1,931 .92 1.915 81 

3 | 20 1.926 89 1,892 92 1.637 98 

5 22 2.007 93 1.925 „84 1.715 99 


In the above analyses Group 0 had 
only 9 Ss that met the criterion of at 
least one response in each block of 
three test trials. Because of possible 
bias that might enter by such selection 
of Ss, a second analysis of magnitude, 
incorporating data from all Ss, was 
made. Table 4 provides the means 
and SDs while Table 5 summarizes 
the analysis of variance. 

Table 5 reveals, first, that there is a 
significant between-groups effect and, 
second, that response reliability (as 
evidenced by the sharply increased F 
value for between-Ss means) has in- 
creased. An examination of mean 
magnitude in Table 4 indicates that 
the bulk of the differences between 
groups rests in Group 0; that is, the 
increase in N in Group 0 over the 
prior analysis added sufficient num- 


TABLE 5 


ANALYSIS OF VARIANCE OF MAGNITUDE FOR 
ALL Ss As A FUNCTION or ISI AND THREE 
BLOCKS or THREE Test Triats EACH 


Source df | MS F 
Between Ss 105 
ISI 4 7.502 Fi bog 
Error (b) 101 2.022 9.96%% 
Within Ss 212 | 
Trials 2 | 2.343 11.54*** 
ISI X Trials 8| 179 88 
Error (w) 202 | 2.03 
*P < 01. 
a+ P< 001. 


bers of zeros to provide a between- 
groups effect. 

Double responses.—Although pre- 
ceding analyses were based on the 
first GSR occurring in the interval 
from 1 to 5.5 sec. after CS onset, there 
were, in Groups 3 and 5, frequent 
instances of a second response with a 
latency range of from 4 to 8.5 sec. 
after CS onset. Figure 2, a tracing 
of one such response from an S in 
Group 5, illustrates the onset of the 
first response, a plateau, and then the 
onset of the second response. In 
Group 5, of 20 Ss who responded, 19 
gave at least one double response. 
In Group 3, of 18 Ss who responded, 
15 gave at least one double response. 
With the exception of 1 S on one trial 
in Group 0, no double responses were 
observed in Groups 0, .5, and 1. 


ae 5 SECOND E 
"o 


Fic. 2. A tracing of a double response 
from 1 S in Group 5. (Onset of the 5-sec- 
event marker is temporally coincident with 
the vertical line indicating initiation of R, and 
Ra, but is displaced due to physical location 
of event recorder. R; and Rs are latencies of, 
respectively, the first and second response.) 


GSR AS A FUNCTION OF INTERSTIMULUS INTERVAL 


Discussion 


The ISI function obtained with mag- 
nitude as the dependent variable is 
similar to that obtained by White and 
Schlosberg (1952) and Moeller (1954). 
Within the range tested, results of all 
three studies suggest that magnitude is 
greatest near an ISI of .5 sec., similar 
to the optimum value obtained in 
numerous studies of skeletal response 
conditioning (see Kimble, 1961, pp. 
156f.). That amplitude did not vary asa 
function of ISI suggests that the mag- 
nitude function is determined in large 
part by whether or not Ssin the different 
ISI conditions responded.? On this basis, 
the distinction between magnitude and 
amplitude made by Humphreys (1943) 
and Hilgard (1951) merits further con- 
sideration. 

Two of the three observations on 
latency and recruitment have some 
precedent in the literature: the positive 
relationship between ISI and recruitment 
corroborates Bierbaum’s (1955) finding 
and the increase in latency across €x- 
tinction trials is consistent with Pavlov's 
(1927, p. 49) report that the latency of 
salivation increases during extinction 
trials. We have been unable to find a 
precedent for the positive relationship 
between latency and ISI in autonomic 
conditioning, though such a relationship 
has been obtained with conditioned 
skeletal responses (e.g-, Boneau, 1958; 


3 Ratios of number of responders to number 
in each group on Test Trial 1 were 17/20, 
22/22, 16/20, and 17/22 for, respectively, 
Groups 0, .5, 1, 3, and 5. While there is no 
entirely adequate statistical test available 
to compare these frequencies for the five 
groups simultaneously, a Fisher exact test 
comparing frequency of responders in Group 
5 with that of Group 3 and of Group 5 
yielded P values of, respectively, .043 and .032 
for frequency disparities as large as those 
obtained. Furthermore, employing the ob- 
tained order of mean magnitude (from high 
to low) in the five groups as a basis for 
comparison, of the 120 possible ways that the 
ratios could be ordered only 5 would cor- 
respond as well as or better to the magnitude 
order than that obtained, even if the tied 
ratio between Groups -5 and 1 were counted 
as a reversal. 


517 


Ebel, 1961). Though data are limited, 
present evidence suggests that, in addi- 
tion to magnitude, the very form of the 
GSR can be brought under the control of 
external stimulating events. Such a 
possibility warrants the continued ob- 
servation of latency and recruitment in 
GSR research. 

Few references have been made to 
second responses in GSR studies. Rod- 
nick (1937), with ISIs of 17 and 21 sec., 
found that a second response would 
occur shortly before shock onset, and 
Stewart, Stern, Winokur, and Fredman 
(1961) demonstrated the acquisition of a 
second GSR while employing an ISI of 
75sec. Grings, Lockhart, and Dameron 
(1961) have shown, further, that this 
second response is differentiated more 
rapidly than is the first response in a 
discrimination learning situation in which 
an ISI of 5 sec. is employed. While the 
nature of the second response is not 
understood, it tends to occur prior to 
UCS onset. Whether or not its absence 
from Groups 0, .5, and 1 results because 
it does not occur in these groups or 
because recording speed is too slow to 
separate response rate changes super- 
imposed on the first response remains to 
be investigated. 


SUMMARY 


Five groups of Ss trained at CS-UCS 
intervals of 0, .5, 1, 3, or 5 sec. were employed 
in a study of the role of the interstimulus 
interval (ISI) in the conditioning of the GSR. 
All Ss received 20 tone-shock pairings followed 
by 10 tone alone (test) trials. Four attributes 
of the GSR were measured; amplitude, 
magnitude, latency, and recruitment. 

The principal findings were: (a) magnitude 
was greatest with an ISI of .5; (b) magnitude, 
but not amplitude, varied with ISI; (c) la- 
tency and recruitment both increased as a 
function of ISI; (d) latency increased across 
extinction trials; and (e) a second response 
was observed frequently in Ss exposed to 
[SIs of 3 and 5 sec. 


REFERENCES 
Brersaum, W. B. Temporal aspects in con- 


ditioning of the GSR. Unpublished doc- 
toral dissertation, University of Florida, 


1955. 


518 


Bongau, C. A. The interstimulus interval 
and the latancy of the conditioned eyelid 
response. J, exp. Psychol., 1958, 56, 464- 
471. 

Ener, H. C. Stable-state behavior and 
reversibility as a function of the inter- 
stimulus interval variable in classical eyelid 
conditioning. Unpublished master’s thesis, 
Pennsylvania State University, 1961. 

Grincs, W. W., LOCKHART, R. A., & Dam- 
ERON, L. E. Interstimulus interval as a 
variable in GSR conditioning of mentally 
deficient individuals. Paper read at 
American Psychological Association, New 
York, 1961: 

HILGARD, E. R. Methods and procedures in 
the study of learning. In S. S. Stevens 
(Ed.), Handbook of experimental psychology. 
New York: Wiley, 1951. 

Humpureys, L. G. Measures of strength of 
conditioned eyelid responses. J. gen. 
Psychol., 1943, 29, 101-111. 


W. F. PROKASY, J. T. FAWCETT, AND J. F. HALL 


Kimpce, G. A. Hilgard and Marquis’ 
conditioning and learning. (2nd. ed.) New 
York: Appleton-Century-Crofts, 1961. 

MOELLER, G. The CS-UCS interval in GSR 


conditioning. J. exp. Psychol., 1954, 48, 
162-166. 
PavLov, I. Conditioned reflexes. New York: 


Oxford Univer. Press, 1927. 

Ropnick, E. H. Characteristics of delayed 
and trace-conditioned responses. J. exp. 
Psychol., 1937, 20, 409-425. 

STEWART, M. A., STERN, J. A., WINOKUR, 
G., & Frepman, S. An analysis of GSR 
conditioning. Psychol. Rev., 1961, 68, 60- 
67. 

Warte, C. T., & Scutosperc, H. Degree of 
conditioning of the GSR as a function of the 
period of delay. J. exp. Psychol., 1952, 43, 
357-362. 


(Received October 30. 1961) 


Journel of Experimental Psychology 
1962, Vol. 64, No. 5, 519-325 


ON-TARGET VERSUS OFF-TARGET INFORMATION AND 
THE ACQUISITION OF TRACKING SKILL ' 


ALTON C. WILLIAMS? and GEORGE 


E. BRIGGS 


Ohio State University 


Knowledge of results or informat ion 
feedback has been an area of sub- 
stantial research interest since the 
classic study of Thorndike (1927), the 
most recent review for motor skills 
being that of Bilodeau and Bilodeau 
(1961). The present study was con- 
cerned with a restricted topic within 
the more general area of knowledge of 
results: the influence of augmented 
feedback on thé acquisition of skilled 
performance. As the term implies, 
augmented feed back represents knowl- 
edge of performance in addition to 
that normally present in a skill task 
and it is defined as secondary in- 
formation supplemental to some pri- 
mary feedback signal (s). ` 

Of the numerous studies on acquisi- 
tion of tracking skill only those by 
Rapparlie (see Bray, 1948, pp. 195- 
196) and by Payne and Hauty (1955) 
have employed augmented feedback 
based on an off-target criterion, the 
majority of Es having activated such 
signals when tracking error was within 
tolerance, i:e., when S was on-target. 
Thus, augmented feedback has been 
utilized in a majority of the research 
as a means of emphasizing “correct” 
responding, a use which seems logical 


Ipis research was carried out in the 
Laboratory of Aviation Psychology and was 
supported by the Unite 
Contract No. N61339-830, sponsored by the 
United States Naval Training Device Center, 
Port Washington, New York.. Permission is 
granted for reproduction, translation, publica- 
tion, use, and disposal in whole or in part for 
any purpose of the United States Government. 

2 Now with the Space Systems Division, 
United States Air’ Force, Los Angeles, 
California. 


in view of the Skinnerian position that 
errors or incorrect responses should 
be minimized in the construction of 
training tasks and programs, at least 
for discrete-verbal skills. However, 
it is an open question whether in 
continuous motor skill tasks error 
information feedback might not be 
more effective than information on 
correct responding, and the present 
study was concerned, therefore, with 
the effects on tracking performance of 
augmented feedback based on an 
off-target criterion relative to that 
based on an on-target criterion. Fur- 
ther, a simple off-target criterion 
condition and an off-target condition 
providing directional information were 


compared. 
METHOD 


Apparatus.—The skill task was defined by 
the SETA apparatus (Gain & Fitts, 1959) 
which provides S with a simple positional 
tracking task. The S tracked a 6 cpm + 12 
cpm sinusoidal signal via a one-dimensional 
compensatory display of tracking information 
on a 5-in. cathode ray tube. This display 
was noise free and provided S with his primary 
source of feedback information. Augmented 
feedback was provided to the experimental 
groups in the form of auditory clicks delivered 
to headphones worn by S. The clicks oc- 
curred at the rate of two per sec- when S was 
within or outside (depending upon the experi- 
mental condition) preset tolerance limits of 
system error. The tolerance limits were 
attained by adjusting two yoltage com- 
parators to the desired voltage levels such 
that a plate voltage relay was activated 
whenever tracking error fell within the 
critical levels. Activation of the relay either 
closed a circuit from the click generator to the 
headphones or opened that circuit depending 
upon the experimental condition. Further, it 
was possible to activate only the left ear- 
phone when tracking error was to the left on 


519 


520 


the visual display and only the right earphone 
in the opposite condition. The on-target 
band thus defined represented tracking error 
of rather small amplitudes: in terms of the 
visual display, the on-target band was +.08 
in. around the fixed target element. 

Performance was scored by electronically 
integrating the absolute value of tracking 
error, This is the analog of the average 
deviation of S's error amplitude distribution 
and will be identified as average error. The 
scores were transformed from voltage units 
to units of inches on the display scale; thus, 
average error as reported here may be inter- 
preted as the average deviation of S’s tracking 
error amplitude distribution plotted on the 
same scale used in the visual tracking display. 

In addition to average error, E recorded 
the amount of time during each tracking 
trial when tracking error was within the 
tolerance limits. These data were trans- 
formed to percent time within tolerance 
scores, 

Subjects and procedure-—The Ss were 88 
volunteer male undergraduates. Each S 
participated in five daily 30-min. sessions, 
and none of the Ss had previous experience 
with a laboratory tracking task. 

Twenty-two Ss were assigned via a 
randomization procedure to each of four 
groups. The control group, Group C, did not 
experience augmented feedback at any time 
during the experiment. The three experi- 
mental groups experienced the auditory 
clicks during the initial three and one-half 
training sessions whereupon augmented feed- 
back was withdrawn for the remaining one 
and one-half sessions, Group I received 
augmented feedback whenever tracking error 
was within the preset tolerance limits (see 
above), while Groups O and O-D received 
the signals whenever tracking error exceeded 
the on-target limits, For Groups I and O 
augmented feedback occurred simultaneously 
in both earphones, while for Group O-D the 
left earphone was activated when displayed 
error was to the left of the fixed target refer- 
ence on the visual display, and the right ear- 
phone was activated when error was to the 
right. 

All tracking trials were of 30-sec. duration 
and were administered in blocks of four trials. 
There were 3 blocks of trials on the first and 
last sessions and 4 blocks in each of the other 
sessions, a total of 18 blocks. The initial 13 
blocks defined the training conditions for the 
experimental groups, while transfer to the 
no-augmented-feedback condition occurred 
over the final 5 blocks of trials. Rest periods 
of 30 sec. were provided between trials within 


ALTON C. WILLIAMS AND GEORGE E. BRIGGS 


a block and 1.5 min. rest occurred between 
blocks. Performance was scored over the 
final 25 sec. of each trial, which permitted the 
initial transients in tracking behavior to 
dampen out prior to scoring. 

The instructions to all Ss stressed the 
importance of maintaining zero error on the 
visual display, and each S was reminded of 
this ultimate goal at the beginning of each 
session, 


RESULTS 


‘Average error.—Figure 1 provides a 
summary of tracking accuracy as de- 
fined by the average error scores. It 
may be noted that all three experi- 
mental groups attained proficiency 
levels superior to that of Group C. 
Further, Group O was superior to both 
Groups I and O-D throughout both 
training and transfer, and there was 
considerable overlap between the lat- 
ter two groups during training. It 
appears, then, that augmented feed- 
back based on a simple off-target 
criterion (Group O) resulted in per- 
formance superior to that attained 
either with an on-target criterion 
(Group I) or with an off-target cri- 
terion which included directional in- 
formation (Group O-D). 

These observations are supported 
by the results of the Mann-Whitney U 
test which was applied to the average 
error data for groups. The tracking 
error scores were summed for each 5 
across all 18 trial blocks for this 
analysis; thus, nı = ną = 22 for each 
two-group comparison performed. 
The alternative hypothesis tested here 
was that more Ss in one of a pair of 
groups were superior in tracking pro- 
ficiency than could occur by chance. 
A nonparametric analysis was em- 
ployed since marked heterogeneity of 
variance was found in the data (see 
Fig. 2). The results of the U tests 
are listed in the upper half of Table 1, 
where it may be noted that all group 
comparisons accepted the alternative 
hypothesis at P < .05 except that be- 
tween Groups I and O-D. 


ACQUISITION OF TRACKING SKILL 


Average Error 
(in inches ) 


Training 


Transfer 


Four-Trial Blocks 


Fic. 1. Average tracking error for all groups during training and transfer. 


Variability —Augmented feedback 
also exerted an influence On inter-S 
variability within groups (intragroup 
variability), and these data are sum- 
marized in Fig. 2. The data plotted 
are intragroup SDs of the average 
error scores for each block of training 
and transfer trials. As with the aver- 
age error data (Fig. 1), the three ex- 
perimental groups are superior to the 
control group in terms of intragroup 
variability. However, here, Group O 
does not enjoy a clear superiority over 
Group O-D, as was the case in Fig. 1, 
but both Groups O and O-D do ex- 
hibit more inter-S homogeneity than 
that attained by Group I. 

These observations are supported 
by the results of U tests which were 
applied to each pairing of groups. It 
was not possible to test the same 
hypothesis as in the case of the aver- 
age error data; instead, the alter- 


native hypothesis tested with the 
intragroup SDs was that more intra- 
group SDs for one of a pair of groups 
would be smaller in value than those 
for the other group than could occur 
by chance. For each comparison 
ny = n = 18. The results are listed 
in the lower half of Table 1 where it 
may be seen that (a) Groups O and 
Q-D exhibited more intragroup homo- 
geneity than did Group I, and (b) all 
three experimental groups were su- 
perior in this regard to Group Gait 
may be concluded, then, that aug- 
mented feedback not only results in a 
higher level of performance accuracy 
but also individual differences within 
groups are less than is the case for Ss 
who do not experience these addi- 
tional feedback cues. 

Time within tolerance—The per- 
centage of time S spent within the 
4.08-in. tolerance limits (TWT) is 


Intersubject Variability 
(in inches) 


Training 
Four-Trial Blocks 


Fic. 2. Inter-S variability for all groups during training and transfer. 


summarized for each four-trial block 
in Fig. 3. While the differences 
among experimental groups are not as 
apparent as with the average error 
data, there are statistically significant 
(P <.01) differences both during 
training and transfer (see Table 2). 


ALTON C. WILLIAMS AND GEORGE E. BRIGGS 


Transfer 


The pattern of group proficiency 
levels on TWT differs in an important 
way from that found with the average 
error data: in terms of the TWT 
scores Group | is significantly superior 
to all other groups during training at 
P < .05. This rank order for Groups 


TABLE 1 
RESULTS OF THE MANN-WHITNEY U Test APPLIED TO AVERAGE 


ERROR AHD INTER~S 


VARIABILITY DATA 


| 
Performance Criterion 


Groups Compared * 


ca | cop | co | mop | 


1/0 0-D/0 
| fie 
Average U | 333 346 | 394 | 270 330 339, 
Error z 2.14 | 2.44 3.57 | 0.66 2.07 2.28 
(m = m = 22) Fa gaa 014 001 | .510 038 0225 
Variability Oras rio =. | 16 46 63 w 
(ny = m = 18) P | <.002 | <.002 | <.002| <.002 | <.002| > 


* For average error: less /more accurate group; for variability: more /less variable group. 


ACQUISITION OF TRACKING SKILL 


Percent Time Within Tolerance 


Training 
Four- Trial Blocks 


Percent time within tolerance for all groups during training and transfer. 


Fie. 3. 


I and O is just the reverse of that 
noted for the average error measure 
where Group O was superior, and this 


TABLE 2 


ANALYSIS OF VARIANCE OF PERCENT TME 
WITHIN TOLERANCE DATA FOR TRAINING 
AND TRANSFER 


Source df MS F 
Training 
Groups (G) 3| 2741.131 7-225 
Ss within groups 
(Ss/G) 84| 379.54 
Blocks (B) 12| 2470.55| 182.66*** 
B XG 36| 20.02} 1.48* 
B X Ss/G 1008} 13.52 
Transfer 
Groups (G) 3| 1314.14]  5.34** 
Ss within groups 
(Ss/G) 84| 246.16 
Blocks (B) 4| 17.18} 1.21 
BXG 12| 39.41] 2.78** 
B X Ss/G 336| 14.20 


un 
w 
Qə 


reversal serves to define how Ss of 
Groups O and I responded differ- 
entially to the task (see below). 

Finally, it may be noted that Group 
I actually deteriorated during transfer 
while Groups C, O, and O-D either 
“held their own” or improved in 
terms of TWT. This deterioration 
gave rise to the significant Blocks 
X Groups interaction in the transfer 
analysis of Table 2. 


DISCUSSION 


The majority of past research in this 
area has found superior performance by 
groups trained with augmented feed- 
back compared to no-augmented-feed- 
back control groups. In this regard, the 
present research merely confirms those 
previous data. The contribution of the 
present study is twofold : first, the use of 
an off-target criterion for the activation 
of augmented feedback was found to be 
superior to an on-target criterion, the 


524 


latter being the most common criterion 
used in previous research; and second, 
for the first time it has been noted here 
that not only accuracy but also group 
homogeneity of tracking performance can 
be improved significantly by the use of 
augmented feedback, especially when 
based on an off-target criterion. 

In addition to these major findings, it 
is of interest to note several other points. 
First, from Fig. 1 and Fig. 3 it may be 
seen that no deterioration in performance 
occurred for either Group O or Group 
O-D at transfer (augmented feedback 
withdrawn), but Group I does exhibit 
some deterioration on Transfer Blocks 
1 and 2 of Fig. 1 and over the entire 
transfer session in Fig. 3. This is logical 
in view of the similarity between the 
training and transfer trials for Groups O 
and O-D, i.e., during training these Ss 
experienced a diminution in number of 
auditory clicks as a result of increasing 
skill. Group I, however, experienced a 
marked change in going from training to 
transfer, i.e., these Ss experienced in- 
creasing amounts of auditory clicks with 
training followed by an abrupt change to 
no-augmented-feedback at transfer. 
These observations were supported by 
statistical analyses of the transfer data: 
Groups O and O-D were superior to 
Group C throughout transfer in terms of 
tracking accuracy (P < .05), but Groups 
I and C did not differ (P > .05) from the 
first transfer trial to the last. 

The reason for the loss of superiority 
by Group I over Group C may be found 
in Fig. 2: upon transfer Group I ex- 
hibited an increase in inter-S variability 
considerably greater than the modest 
increase shown by Groups O and O-D. 
It follows, then, that the transfer per- 
formance deterioration by Group | 
involved not only a loss in tracking 
accuracy (Fig. 1) but also an increase of 
within-group variability (Fig. 2). Since 
apparent accuracy deterioration is not 
noted for Groups O and O-D, and since 
the increase in inter-S variability was 
relatively less than that for Group I 
during transfer, it follows that an off- 
target for the 


criterion activation of 


ALTON C. WILLIAMS AND GEORGE E. BRIGGS 


augmented feedback is to be preferred 
as a training variable. 

A second point of interest can be 
introduced by again noting from Fig. 1 
that, in terms of average error, Group 0 
is superior throughout to Group I. Now, 
the average error metric actually is the 
average deviation (AD) of S's tracking 
error amplitude distribution, and thus 
the smaller average error for Group O 
indicates that, on the average, those Ss 
generated smaller errors than did Group 
I. However, Group I actually spent 
more time within the +.08-in. tolerance 
limits (see Fig. 3). It follows, then, that 
the shape of the error amplitude dis- 
tribution for Group O must differ from 
that of Group I. 

What might be the forms of the error 
amplitude distributions for Groups O and 
1? The previous data of Bahrick, Fitts, 
and Briggs (1957) suggest that if the 
distribution of tracking error for Group J 
is nonnormal (as in fact is the case for 
the present data), that distribution is 
probably leptokurtic. On this basis, it 
follows that Group O generated a 
bimodal error amplitude distribution, as 
this is the only distribution that could 
result in both smaller AD and smaller 
percent time within tolerance scores for 
Group O compared to Group I. In 
other words, Ss of Group O spent con- 
siderable time tracking closely around 
the two tolerance limit points and made 
relatively few errors of large amplitude, 
while Group I spent more time within the 
tolerance limits but committed occasional 
large tracking errors. 

These deductions from the data, then, 
lead one to conclude that tracking be- 
havior with augmented feedback based 
on an off-target criterion differs funda- 
mentally from that when such feedback 
is based on an on-target criterion: in the 
former case large errors are emphasized, 
and § apparently learns rather quickly 
to minimize these occasional lapses m 
tracking accuracy, while in the latter case 
the importance of small errors is em- 
phasized, but apparently S does not 
respond as quickly or as efficiently tO 
correct for occasional large tracking 
errors. Since minimizing large tracking 


ACQUISITION OF TRACKING SKILL 


errors is a primary task for S early in 
training, it follows that augmented feed- 
back based on an off-target criterion 
should be particularly helpful in shaping 
the desired behavior during early ac- 
quisition of a continuous control skill. 
It is problematical whether or not the 
above generalization will hold in discrete- 
verbal learning tasks. However, it is 
suggested that if the element of discovery 
is present in such discrete tasks, some 
criterion for the activation of augmented 
feedback analogous to the off-target 
criterion in the continuous case would be 
helpful in focusing S's attention on 
response alternatives reasonably close to 
the correct alternative. 

Finally, it is interesting to note that 
the additional feedback information 
available to Group O-D (off-target direc- 
tion of error) did not provide for a 
performance level superior to that of 
Group O which received augmented 
feedback on a more simple, nondirec- 
tional off-target criterion. 

Two explanations for the relative 
inferiority of Group O-D are suggested. 
First, it is possible, of course, that the 
addition of a directional cue to the off- 
target criterion was not particularly 
useful information to Group O-D since 
directional relationships between control 
and display movements are one of the 
most simple aspects of a tracking task to 
be learned. Secondly, it is possible that 
the information on error direction was 
actually disruptive. It may be recalled 
that auditory clicks were delivered to the 
left earphone when error was to the left 
on the visual tracking display. Thus, a 
signal in the left earphone indicated that 
S should move his control to the right. 
While data are lacking on population 
stereotypes in such a stimulus-response 
task, it is probable that a more com- 
patible S-R arrangement would be one in 
which a signal in, say, the left earphone 
indicates a control movement 1$ required 
to the left. It is suggested that both of 
the above possibilities were responsible 
for the inferior performance of Group 
O-D relative to that of Group O. 


SUMMARY 


During training three experimental groups 
received augmented feedback (auditory clicks 
at the rate of two per sec.) when tracking 
accuracy was within (an on-target criterion) 
or outside (an off-target criterion) fixed 
tolerance limits. During transfer, no aug- 
mented feedback was provided. All three 
experimental groups were superior in tracking 
accuracy during training to a control group 
which did not receive augmented feedback. 
Of the three experimental groups, the group 
receiving augmented feedback when off- 
target was superior to a group which ex- 
perienced clicks when on-target. It was 
superior also to a group which received clicks 
when off-target but differentially according 
to the direction of tracking error. 

During transfer both off-target groups 
remained superior in tracking accuracy to the 
control group, but the on-target group and the 
control group attained comparable per- 
formance, 

It follows that augmented feedback based 
on a simple off-target criterion was the most 
effective training condition. An analysis of 
the data suggested that this superiority was a 
result of the emphasis an off-target criterion 
places on occasional large tracking errors. 
The group trained on this condition ap- 
parently learned to reduce such errors more 
quickly and efficiently than did the on-target 
criterion group. 


REFERENCES 


Banrick, H. P., Fitts, P. M., & Bricos, 
G. E. Learning curves: Facts or artifacts? 
Psychol. Bull., 1957, 54, 256-268. 

Biopeav, E. A., & Brropgau, I. McD. 
Motor-skills learning. Annu. Rev. Psychol., 
1961, 12, 243-280. 

Bray, C. W. Psychology and military 
proficiency. Princeton, N. J.: Princeton 
Univer. Press, 1948. 

Gain, P., & Fitts, P M. A simplified 
electronic tracking apparatus (SETA). 
USAF WADC tech. Rep., 1959, No. 59-44. 

Payne, R. B, & Haury, G. T. Effect of 
psychological feedback upon work de- 
crement. J. exp. Psychol., 1955, 50, 343- 
351. 

THORNDIKE, E. L. The law of effect. Amer. 
J. Psychol., 1927, 39, 212-222. 


(Received October 30, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 5, 526-532 


RESISTANCE TO EXTINCTION AFTER VARYING AMOUNTS 
OF DISCRIMINATIVE OR NONDISCRIMINATIVE 
INSTRUMENTAL TRAINING! 

M. R. D'AMATO, DONALD SCHIFF, axo HARRY JAGODA 


New York University 


Resistance to extinction and num- 
ber of reinforced responses (acquisi- 
tion level) have classically been 
thought to be monotonically related. 
Recently, however, doubt concerning 
the monotonicity of the relationship 
has been raised in several quarters 
(e.g., Birch, Ison, & Sperling, 1960; 
Murillo & Capaldi, 1961; Senko, 
Champ, & Capaldi, 1961). Although 
nonmonotonicity between resistance 
to extinction and acquisition level has 
been reported in a rather large 
number of recent studies, many of 
these are of questionable relevance 
because either they made use of some 
type of an intermittent reinforcement 
schedule (e.g., Capaldi, 1957, 1958) 
or they employed a discriminative 
rather than a nondiscriminative task 
(Murillo & Capaldi, 1961), in some 
cases “extinction” actually constitut- 
ing reversal learning (e.g., Senko et al., 
1961). In certain other studies 
reporting nonmonotonicity and not 
subject to either of the preceding 
objections (Lewis & Duncan, 1956, 
1958), one finds other reasons for 
questioning their pertinence for the 
present problem, such as the use of 
human Ss in money payoff situations, 
which bear only the slightest re- 
semblance to the strictly instrumental 
situations in which, with animal Ss, 
the relationship of monotonicity was 
originally established. 

1 This research was supported by Research 
Grant M-2051 from the National Institute 
of Mental Health, National Institutes of 
Health, United States Public Health Service, 


and Grant G-14724 from the National 
Science Foundation. 


Of the several published studies falling 
in the latter category, two have reported 
resistance to extinction and acquisition level 
to be nonmonotonically related. North and 
Stimmel (1960) found that extinction of the 
running response proceeded more rapidly for 
Ss given 90 or 135 rewarded trials in a runway 
than for Ss given 45 reinforced trials. Wilson 
(1958), on the other hand, found no evidence 
of a reduction in resistance to extinction, even 
though he carried (runway) acquisition to 
480 trials, Harris and Nygaard (1961), 
working with thirsty rats in a free operant 
situation, observed the usual monotonic 
relationship between acquisition level and 
resistance to extinction of the bar pressing 
response up to 360 reinforced responses, the 
highest acquisition level they worked with. 
Under rather similar conditions, Margulies 
(1961) reported monotonicity up to 1,000 
trials; and as described in an earlier paper 
(D’Amato & Jagoda, 1962), unpublished 
results from our laboratory suggest persistence 
of the monotonic relationship up to some 
7,000 rewarded bar pressing responses, again 
with thirsty rats as Ss. Finally, in conflict 
with the preceding results, King, Wood, and 
Butcher (1961) recently reported a reduction 
in the resistance to extinction of pigeons 
receiving 600 or 900 reinforced key pecks, 
as compared to Ss permitted 300 rewarded 
responses. 


In summary, the results from 
studies employing a nondiscriminative 
(simple) instrumental response are 
equivocal, though they suggest that 
under certain conditions the mono- 
tonicity assumption holds up t° 
several thousands of responses. 

The present study had three ob- 
jectives. First, all of the cited Skinner 
box studies in which monotonicity was 
obtained employed the thirst drive 
and water reward. It would be of 
some value to demonstrate that mono- 
tonicity also holds in the Skinner box 


526 


tting for the hunger drive and food 
reward, particularly since all other 
relevant studies have been conducted 
under the latter motive-incentive 
conditions. Second, we wished to 
carry acquisition training beyond the 
level achieved in previously published 
‘reports, to a maximum of 1,600 
reinforcements. Third, and most 
important, we wished to assess the 
influence of two different training 
“procedures, discriminative versus non- 
- discriminative instrumental training, 

‘on the shape of the function relating 
acquisition level and resistance to 
extinction. The hypothesis under 
examination was that the traditional 
© monotonic relationship would prevail 
‘for Ss given nondiscriminative train- 
ing, but not for Ss given discrimina- 
‘tion training, the latter being ex- 
"pected to show a significant decline 
in resistance to extinction with pro- 
longed acquisition training. 

One plausible rationale for the 
“preceding hypothesis is as follows. 
In general, Ss trained on a dis- 

crimination program are essentially 
on an intermittent reinforcement 
schedule (with respect to the experi- 
mental situation) until the discrimina- 
tion is firmly acquired; beyond this 
point their experience is to all intents 
and purposes one of continuous rein- 
forcement (since they then make few 
errors, or respond little in 55). 
Prolonged discrimination training, 
then, has the effect of carrying Ss 
beyond their intermittent reinforce- 
ment experience well into the con- 
- tinuous reinforcement segment. 
it is assumed that directly following 


venes between the intermittent rein- 
forcement experience and extinction, 
the hypothesis follows. 


RESISTANCE TO EXTINCTION 


$27 


Merinop 


Subjects. The Se that completed the 
study were 94 experimentally naive albino 
rate (St males aed 45 females) S3 to 94 days 
of age at the start of the study. One S was 

because of a programing error and a 
second due to illness, All Se were bred in our 
laboratory. 

Apperetus.— Two Grason-Stadler two-bar 
Skinner boxes were used: the right bars were 
removed from both boxes, converting them 
into single-bar boxes. A force of approxi- 
mately 20 gm. was required to activate the 
bar microewitches in the two bones, 

Design.—The experiment was run in two 
replications (of unequal N), but since the 
results of the two replications were quite 
similar in form, replications was not included 
as a factor in the statistical design. 

There were four levels of acquisition, 
defined in terms of the number of reinforced 
responses allowed after pretraining : 200, 400. 
800, and 1,600. At cach acquisition level 
there were two separate groups of Ss that 
differed in the type of training received. The 
“J” groups, designated as 1(200), 1 (400), 
1 (800), and 1 (1600), underwent simple (non- 
discriminative) instrumental training, each 
of the four groups being composed of 12 Ss. 
The “D" groups, D(200), D(400), D(800), 
and D(1600), were trained on a successive 
brightness discrimination problem. There 
were 11 Ss in the first two groups and 12 each 
in the last two groups. 

All Ss of a replication were quasirandomly 
assigned to the eight groups, in part balancing 
litters over groups, and placed on deprivation 
at the same time. The | and D groups of 
a given acquisition level started acquisition 
training together; the scheduling of the 
beginning of acquisition was so arranged for 
the various reinforcement groups that all 
groups entered extinction within a day or 
two of each other. 

Deprivation training. —One week prior to 
the beginning of pretraining, Ss were placed 
on a 22-hr. food deprivation training regimen, 
water being constantly available in the home 

Four days later, the feeding period 
was reduced to 14 hr. daily and remained at 
that duration throughout the study. 

Skinner box pretraining,—There were 3 pre- 
training days. On Days 1 and 2, 50 rein- 
forcements (45-mg. Noyes rat pellets) were 
given in the conditioning of approach re- 
sponses to the food tray at the sound of the 
feeder magazine. On Day 3, Ss were shaped 
on the bar pressing response with 25 to 50 
reinforcements. The stimulus conditions in 


(528 


the boxes during pretraining were the same as 
prevailed during simple instrumental training 
and during the S? portion of discrimination 
training. 

Discrimination training—All D groups 
were exposed to a simple brightness dis- 
crimination situation. Illumination of the 
left (white jeweled) stimulus light constituted 
SP and provided an illumination level of 7 to 
10 ft-c, measured with the target of a Weston 
illumination meter placed at bar level and 
2 in. in front of the lens of the stimulus light. 
Measured under the same conditions the 
illumination under S4, which was produced 
by the shielded house light, read about 
0.1 ft-c. 

All SP periods were 45 sec. in duration and 
alternated with Sê periods varying between 
33 and 65 sec. in length. One revolution of 
the programing film tape took about 6 min. 
and provided approximately equal total times 
in SP and S^. Since 50 reinforced responses 
in SP were allowed each day, Groups D(200), 
D (400), D(800), and D(1600) required 4, 8, 
16, and 32 training days, respectively. 

Simple instrumental training.—The treat- 
ment of the I groups was exactly the same as 
that accorded the corresponding D groups 
except for the elimination of the Sê periods, 
ie., the left stimulus light was always on. 
As in the D groups, 50 reinforcements were 
permitted on each acquisition day. 

At the end of the daily 50 reinforced re- 
sponses, Ss of all groups entered a time-out 
period and were quickly removed from the 
Skinner boxes to their home cages where, no 
sooner than 15 min. later, they were fed for 
the 1}-hr. period. The estimated average 
number of hours of food deprivation at the 
start of a day's session was 22 hr. 

Extinction—The extinction procedure, 
which began the day following the termination 
of acquisition, was precisely the same for all 
Ss and consisted of one daily 10-min. period 
on each of 5 successive days with the left 
stimulus light illuminated, i.e., in the former 
sp condition, The duration of the extinction 
periods, 10 min., was judged to be short 
enough to resemble closely the length of the 
acquisition sessions (which overall averaged 
7.15 min.) and yet long enough to sample 
adequately the extinction process. It should 
be pointed out, however, that because of the 
absence of Sê periods, the extinction sessions 
bore a greater similarity to the acquisition 

sessions of simple instrumental training than 
to those of discrimination training. The 
exclusion from extinction of Sê periods (more 
correctly, periods in which the former S* 


M. R. D'AMATO, D. SCHIFF, AND H. JAGODA 


stimulus was present) was based on two 
considerations: (a) We wished to make direct 
comparisons of the extinction performance of 
corresponding I and D groups, which would be 
feasible only if the extinction sessions were 
identical for all Ss. (b) Inclusion of such 
periods would have raised problems concern- 
ing the treatment of responses made in ssi 
since the number of such responses most 
probably would be related to the acquisition 
level variable. 

It should be recorded that the cue provided 
by activation of the feeder magazine was 
maintained during extinction, Deprivation 
level at the start of extinction sessions was 
confined within the limits of 21 to 23 hr. 
During the course of the experiment the 
relative humidity ranged between 58% and 
70%, and the temperature between 70° 
and 78° F. 


RESULTS 


Discrimination acquisition. —A dis- 
crimination ratio (DR), obtained by 
dividing the number of responses 
made in Sê by the number performed 
in SP, was calculated for each S after 
every daily training session. Ina plot 
of the daily means of the DRs, Group 
D(1600) showed some improvement 
over the level of discrimination finally 
achieved by Group D(800). To 
evaluate this difference the mean DR 
over the last 8 acquisition days was 
calculated for every S of Group 
D (1600), as was the mean DR over 
the last two sessions for each S of 
Group D(800). The group means 
based on these measures (.141 and 
.218, respectively) did not, however, 
differ significantly (t = 1.67, df = 221 
P m £2); : 

Extinction in the I groups.—the 
number of responses made by each 5 
in each of the five daily extinction 
sessions was converted to commo 
logs, providing the basis for all 
statistical analyses. An overall index 
of resistance to extinction was ob- 
tained for each S by summing 't$ 
five daily log scores. 

The first question of interest co™ 
cerns the relationship observed be- 


RESISTANCE TO EXTINCTION 


MEAN SUMMED LOG M 
Pi 


NUMBER OF REINFORCEMENTS 


Fic. 1. Relationship between acquisition 
level (number of reinforcements) and resist- 
ance to extinction in the simple instrumental 
(1) and the discrimination (D) groups. 


tween resistance to extinction and the 
number of reinforced responses al- 
lowed during acquisition. The solid 
line in Fig. 1, based on the means of 
the summed log scores, presents this 
relationship. It is plain from the 
figure that there is no tendency for 
resistance to extinction to decrease 
with increasing training, even when 
acquisition is carried to 1,600 rein- 
forced responses; in fact, the solid line 
suggests an increasing trend. 

The most powerful (and specific) 
way of analyzing the present data is 
by a trend analysis of the linear and 
quadratic components of the curve 
(Grant, 1956). The presence of a 
significant quadratic component (with 
negative sign) alone or in accompani- 
ment with a significant negative 
linear component would constitute 
evidence for a nonmonotonic relation- 
ship between acquisition level and 
resistance to extinction. A significant 
negative linear component in con- 
junction with an insignificant quad- 
ratic would, essentially, support the 
same interpretation. On the other 
hand, an insignificant quadratic com- 
ponent would support the assumption 
of monotonicity if the linear com- 
ponent were either insignificant or 
positive in sign, The latter would 
indicate that the curve was still rising, 
while the occurrence of insignificance 


$29 


in both the linear and quadratic com- 
ponents would signify that the func- 
tion was asymptotic. 

In order to take into consideration 
the unequal spacing of the independ- 
ent variable, the coefficients of the 
orthogonal polynomials were calcu- 
lated in the manner suggested by 
Grandage (1958). Analysis of vari- 
ance showed both the linear and the 
quadratic components of the trend to 
be insignificant (F = 1.29 and 0.59, 
respectively, df = 1/44). Thus, for 
the numbers of rewarded responses 
employed in this study, the function 
relating resistance to extinction and 
acquisition level in the nondiscrimina- 
tively trained Ss was essentially 
asymptotic. 

Despite the absence of significant 
differences in the summed log ex- 
tinction scores, one would like to 
know whether extinction in the four I 
groups followed a parallel develop- 
ment over the five extinction sessions. 
Figure 2 presents the extinction curves 
of the four I groups over the 5 
extinction days. A trend analysis 


2,00 
1:75 
150 


125 


æ 
8 
3 
i 100 

7 

-=-= T1600 
50 
1 2 3 4 5 
EXTINCTION DAYS 
Fic. 2. Extinction curves of the four 


simple instrumental groups, based on the log 
of the number of daily extinction responses. 


530 


(Edwards, 1960) was applied to the 
data and the differences among the 
groups’ linear components evaluated. 
The appropriate F (2.38, df = 3/176) 
fell short of accepted significance 
levels (P = .07). The curves of Fig. 2 
suggest that more impressive differ- 
ences among the linear components of 
the groups’ trends, as well as among 
their overall extinction scores, might 
have been obtained if extinction had 
been carried one or two sessions 
further. At any rate, the data of the 
figure plainly show that Group | (1600) 
is not inferior to any of the other | 
groups. 

Extinction in the D groups.—The 
first concern is again with the relation- 
ship between acquisition level and 
resistance to extinction, which, as 
may be seen from Fig. 1, is vastly 
different from that obtained with the 
I groups. A trend analysis of the 
curve revealed a significant linear 
component (F = 4.13, df = 1/42, 
P < 05), as well as a significant 
quadratic component (F = 6.31, 
df = 1/42, P < 025). Because of 
the unequal Ns in the four discrimina- 
tion groups, the trend analysis was 
based on groups means, rather than 
sums, in the manner suggested by 
Walker and Lev (1953). 

Figure 3 shows that the inferiority 
of Group D(1600) relative to Groups 
D (400) and D (800) is present through- 
out extinction, though it appears most 
marked on the last 2 extinction days. 
Once again the differences among the 
groups’ linear trends were evaluated 
and once again the resulting F (2.51, 
df = 3/168) was quite close to ac- 
cepted significance levels (P = .06), 
suggesting differences among the 
slopes of the extinction curves. 

Thus, in contrast to the I groups, 
the discriminatively trained Ss, after 
an initial increase, reveal a sharp 
reduction in resistance to extinction 


M. R. D'AMATO, D. SCHIFF, AND H. JAGODA 


2.00 


1.75 


1.50 


1.25 
« 
8 
3 
z 1.00 
z 
EJ 
==- D1600 
-75 
-50 
EXTINCTION DAYS 
Fic. 3. Extinction curves of the four 


discrimination groups, based on the log of the 
number of daily extinction responses. 


as a function of increasing acquisition 
training. Further, the nonmono- 
tonicity appears in the early stages of 
extinction, tending to be somewhat 
more marked during the latter ex- 
tinction days. 

Comparison of extinction in the I and 
D groups.—li it is in fact true that 
discrimination training, at least in its 
early phases, provides an intermittent 
reinforcement experience, then resist- 
ance to extinction should be greater 
in the combined D groups than in the 
I groups. Analysis of the differences 
between the means of the summed log 
scores of the combined four D groups 
(7.43) and the four I groups (6.96) 
revealed that discrimination training 
did indeed lead to significantly greater 
resistance to extinction (t = 2.53, 
df = 86, P < .02). 

It will be observed in Fig. 1 that a 
reversal in the extinction curves of the 
| and D groups occurs in the 1600 
groups; the reversal is, however, far 
from significant (t = 1.25, P > .20). 
Finally, tests of differences between 
the means of the I and D groups 
receiving 200, 400, and 800 reinforce- 


RESISTANCE TO EXTINCTION 


ments yielded, in the same order, the 
following results: £ = 1.45, P > .10; 
t = 2.25, P < 105; ¢ = 2.63, P < .02. 


Discussion 


In general, our results support the hy- 
pothesis that the presence or absence of 
monotonicity in the function relating 
acquisition level to resistance to extinc- 
tion depends importantly on the type 
of learned response under consideration. 
In agreement with earlier reports (Harris 
& Nygaard, 1961; Margulies, 1961) 
monotonicity was found with a non- 
discriminative (free operant) response; 
with a discriminative response, on the 
other hand, a marked nonmonotonicity 
between acquisition level and resistance 
to extinction was observed. While the 
present results do little to explain the 
nonmonotonicity obtained in the pre- 
sumably nondiscriminative situations of 
North and Stimmel (1960) and King, 
Wood, and Butcher (1961), they perhaps 
shed some light on other, related, studies 
in which a strong discriminative com- 
ponent was present (e.g., Murillo & 
Capaldi, 1961; Senko et al., 1961). Con- 
ceivably, they also possess some relev- 
ance for the overlearning reversal effect 
(the faster reversal learning of Ss 
receiving extensive overtraining), inas- 
much as that phenomenon has been 
attributed by some to a nonmonotonic 
relationship between acquisition level 
and resistance to extinction of the ap- 
proach response to the originally positive 
stimulus (e.g., Birch et al., 1960). 

Turning now to a possible mechanism 
by which the nonmonotonicity of the 
discrimination groups might be ex- 
plained, two separate factors seem to be 
involved. First, it probably is safe to 
assume that the intermittent reinforce- 
ment experience unavoidably associated 
with the early phases of discrimination 
training has the effect of augmenting 
resistance to extinction in Ss receiving 
moderate amounts of discrimination 
training. This assumption is supported 
in the present study by the superior 
resistance to extinction of the combined 
discrimination groups, with further veri- 


531 


fication coming from an earlier study 
by Jenkins (1961a), who worked with 
pigeons in a discrete trials situation. 

Second, it is still necessary to explain 
how resistance to extinction becomes 
depressed with overtraining, and two 
possibilities suggest themselves. The 
first one is that mentioned earlier, 
namely, that overtraining has the effect 
of carrying Ss beyond their intermittent 
reinforcement experience well into a 
region of virtual continuous reinforce- 
ment. However, there are the following 
difficulties with this possibility. (a) Sev- 
eral studies (e.g., Jenkins, 1961b; Theios, 
1962) have failed to demonstrate a 
reduction in resistance to extinction as a 
result of interpolating a continuous rein- 
forcement segment between a partial 
reinforcement experience and subsequent 
extinction. (b) There is evidence that 
nonmonotonicity between acquisition 
level and resistance to extinction can 
occur in situations where responding to 
the negative stimulus is under the con- 
trol of E rather than S, as in successive 
discrimination training on a straight- 
away (Birch et al., 1960). (c) Finally, 
the nature of the present argument is 
such that, in principle, no amount of 
discrimination training could reduce 
resistance to extinction below that of 
a comparable nondiscriminative group. 
The results obtained with the 1,600 
groups of the present study suggest that 
such a reversal is a distinct possibility 
if acquisition were carried somewhat 
further. 

An alternative interpretation of the 
basis of the nonmonotonicity maintains 
that the discrimination experience is 
vital completely apart from attending 
changes in the effective reinforcement 
schedules (cf. Murillo & Capaldi, 
1961, who found nonmonotonicity 
only in those Ss that had learned the 
discrimination presented them during 
acquisition). Thisisa position, however, 
that is difficult to specify with any degree 
of precision. Nevertheless, it is sug- 
gestive that in Group D(1600) goodness 
of discrimination (the reciprocal of the 
mean of the DRs over the last 10 
acquisition days) was negatively cor- 


532 


related with resistance to extinction 
(summed log R), with rho equal to .53 
(05 < P < .10). 


SUMMARY 


This experiment investigated the hy- 
pothesis that acquisition level and resistance 
to extinction would be monotonically related 
for a simple (nondiscriminative) instrumental 
response (bar pressing in a Skinner box), but 
the function would be nonmonotonic for a 
comparable discriminative response (successive 
brightness discrimination). Four groups of 
Ss were trained on the simple instrumental 
response and allowed 200, 400, 800, or 1,600 
reinforced responses, 50 per day. The same 
numbers of reinforced responses were given 
to four corresponding groups of discrimina- 
tively trained Ss, the procedure employed 
with the latter differing only in the insertion 
of occasional S* periods. All groups were 
exposed to one 10-min. extinction period (in 
the former S*) on each of 5 successive days. 
Trend analyses of the data supported the 
initiating hypothesis; and as expected, the 
discriminatively trained Ss were, as a group, 
more resistant to extinction than the Ss 
trained on the simple instrumental response, 


REFERENCES 


Bixcu, D., Ison, J. R., & Speruine, S. E, 
Reversal learning under single stimulus 
presentation. J. exp. Psychol., 1960, 60, 


Caratpi, E. J. The effects of different 
amounts of alternating partial reinforce- 
ment on resistance to extinction, Amer. J. 
Psychol., 1957, 70, 451-452. 

Capatpr, E. J. The effect of different 
amounts of training on the resistance to 
extinction of different patterns of partially 
reinforced responses. J. comp. physiol. 
Psychol., 1958, 51, 367-371. 

D'Amato, M. R, & Jacopa, H. Over- 
learning and position reversal. J, exp. 
Psychol., 1962, 64, 117-122. 

EDWARDS, A. L. Experimental design in 
psychological research. (Rev. ed.) New 
York: Holt, Rinehart, & Winston, 1960. 

GRANDAGE, A. Orthogonal coefficients for 
unequal intervals. Biometrics, 1958, 14, 
287-289. 


M. R. D'AMATO, D. SCHIFF, AND H. JAGODA 


Grant, D. A. Analysis-of-variance tests | 
the analysis and comparison of cw 
Psychol. Bull., 1956, 53, 141-154. 

Harazis, P., & NyGaarp, J. E. Resistance 
extinction and number of reinforceme 
Psychol. Rep., 1961, 8, 233-234. 

Jenkins, H. M. The effect of discrimi 
training on extinction. J. exp. Psyd 
1961, 61, 111-121. (a) 

Jenkins, H. M. Resistance to extinctid 
when partial is followed by regular 
forcement. Paper read at Psychonon 
Society, New York, 1961. (b) 

Kine, R. A., Woop, P., & BUTCHER, 
Decreased resistance to extinction as 4 
function of reinforcement. Amer. Psy 
chologist, 1961, 16, 468. (Abstract) 

Lewis, D. J., & Duncan, C. P. The effect ol 
partial reinforcement and length of acquisti 
tion-series upon resistance to extinction off 
motor and a verbal response. <A mer. 
Psychol., 1956, 69, 644-646. 

Lewis, D. J., & Duncan, C. P. Expectatil 
and resistance to extinction of a lever 
pulling response as a function of percentagi 
of reinforcement and number of acqui. 
trials. J. exp. Psychol., 1958, 55, 121-128 

MARGULIES, S. Response duration in opera’ 
level, regular reinforcement, and extincti 
J. exp. Anal. Behav., 1961, 4, 317-321. 

Muro, N. R., & Caran, E. J. The 0 
of overlearning trials in determini 
resistance to extinction. J. exp. Psychol 
1961, 61, 345-349. b 

NORTE, A. J., & StimmeL, D. T. Extinction 
of an instrumental response following # 
large number of reinforcements. Psychol. 
Rep., 1960, 6, 227-234. 

Senko, M. G., Cuamp, R. A, & CAPraLDly 
E. J. Supplementary report: Resistance ta 
extinction of a verbal response as a function 
of the number of acquisition trials. J. exp- 
Psychol., 1961, 61, 350. 

Tueros, J. The partial reinforcement effe 
sustained through blocks of continuou 
reinforcement, J. exp. Psychol., 1962, © 
1-6. y 

WALKER, H. M., & Lev, J. Statistical ina 
ference. New York: Holt, 1953. 

Witson, J. J. Level of training and goal- 
movements as parameters of the in 
mittent reinforcement effect. Unpublis 
doctoral dissertation, New York Universi 
1958. 


(Received November 14, 1961) 


ot tomo pa 
eet, Ve 04, 7 


T MAZE REVERSAL LEARNING AFTER SEVERAL 
DIFFERENT OVERTRAINING PROCEDURES ' 
WINFRED F. HILL, NORMAN E. SPEAR 
Norikwesiera Unwernity 
axb KEITH N. CLAYTON 
Vanderbilt University 


Several investigators (Brookshire, 
Warren, & Ball, 1961; Capaldi & 
Stevenson, 1957; North & Clayton, 
1959; Pubols, 1956; Reid, 1953) have 
demonstrated that overlearning of a 
discrimination facilitates its subse- 
quent reversal. This effect at first 
seems paradoxical, since it appears to 
imply that increasing the number of 
reinforced trials to one cue weakens 
the tendency to respond to that cue. 

Several mechanisms have been sug- 
gested which might explain this over- 
learning-reversal effect (ORE). (a) 
The additional practice in making 
choices during the overtraining may 
facilitate subsequent reversals. This 
might be mediated by acquired ob- 
serving responses, as suggested by 
Reid (1953) and Pubols (1956). 
(b) The long series of rewards may 
make a change more discriminable 
and hence make reversal learning 
faster, a suggestion made by Capaldi 
and Stevenson (1957). (c) The 
greater number of rewards, by build- 
ing up stronger ra’s, may result in 
greater frustration and hence greater 
disruption when reward ceases to 
follow the accustomed response. This 
is the explanation offered by North 
and Stimmel (1960) for their finding 
that overlearning facilitated extinc- 
tion in a straight alley. It is also 
consistent with Birch, Ison, and 


1 This research was supported by Grant 
G-8706 from the National Science Foundation 
and was conducted at Northwestern Uni- 
versity. 


Sperling's (1960) finding that with 
single stimulus presentation the over- 
learning-reversal effect is mainly at- 
tributable to the rate of extinction 
of the old response. (d) The long 
series of trials on which nearly every 
response is rewarded may reduce, 
through any of various mechaniams, 
the tendency to avoid the incorrect 
cues, thus making it easier to approach 
those cues when they become correct. 
This explanation is suggested by 
D'Amato and Jagoda (1961). (e) So 
much stimulus satiation may be built 
up to the correct cues that there is a 
tendency to avoid these cues as soon 
as they cease to be associated with 
reward. Though this has not been 
suggested as an explanation for the 
ORE, it seems consistent with re- 
search summarized by Glanzer (1958). 

The present study attempted to 
provide evidence concerning the rela- 
tive importance of these various 
mechanisms for the ORE. Four 
groups were compared. Two were the 
groups found in any study of the 
ORE: Group N reversed as soon as 
the discrimination was learned, Group 
Fr given overtraining. The othertwo 
groups were also given overtraining, 
but with special features. Group Co 
was forced to the correct side on all 
its overtraining trials, thus being 
deprived of the opportunity for mak- 
ing choices. Group In was given 
twice as many overtraining trials as 
Groups Fr and Co, with half forced 
to the correct and half to the in- 


533 


534 


correct side. This group was thus 
given more experience with the in- 
correct side than any of the others. 
It was predicted that Group Fr would 
reverse fastest and either Group N 
or In slowest, with the exact positions 
of the groups throwing light on the 
various explanations of the ORE. 


EXPERIMENT | 
Method 


Subjects and apparatus.—The Ss were 64 
experimentally naive female albino rats of the 
Sprague-Dawley strain, 74 to 75 days old 
at the beginning of experimental training. 
One S$ was discarded because of apparatus 
failure and was replaced by another S of the 
same description. The apparatus was the 
narrow T maze described by Cotton, Lewis, 
and Metzger (1958). This is an enclosed T 
maze with a 4-ft. stem and 9-in. arms (exclud- 
ing goal box). The wooden doors beyond the 
choice-point, utilized to prevent retracing, 
were also used for forcing purposes in this 
experiment. 

Prehandling.—Each S received six daily 
3-min, sessions of prehandling, the last one 
48 hr. prior to the beginning of experimental 
training. During each session $ was allowed 
to explore a large unpainted wooden box, 
presented with four of the pellets later to 
serve as reward, and picked up and replaced 
at least five times by E. Pellets uneaten 
were returned with S to the home cage along 
with a fifth pellet. A once-daily feeding 
schedule began on the first day of prehandling 
and was maintained throughout experimental 
training. The ration was 10 gm, of finely 
ground Purina lab chow and was presented 
30-35 min. after the start of prehandling or 
` experimental training. 

Experimental training—On each trial, 
after placement into the start box, the door 
was opened as soon as S was oriented toward 
it. Located just outside the start box was a 
treadle which, when S stepped on it, started 
two 1/100 sec. clocks, The first clock stopped 
automatically when S stepped on a treadle 
located just before the choice-point, The 
second clock stopped when S stepped on a 
treadle just beyond the choice-point in either 
arm of the maze. The side designated as 
correct for a particular S was baited with 
two 45-mg. Noyes pellets presented in a 
slightly bent tin lid. No lid was present on 
the nonrewarded side. For half the Ss, the 
right side was designated correct; for the 


W. F. HILL, N. E. SPEAR, AND K. N. CLAYTON 


others the left was correct. The noncorrection 
procedure was used throughout, and the 
duration of goal-box confinement was ap- 
proximately 10 sec. regardless of choice, 
Between trials S was kept for 15 sec. ina 
carrying cage before being placed in the start 
box for the next trial. 

General design—There were three stages 
to the experimental training—acquisition, 
overtraining, and reversal. The experimental 
variable was manipulated only in the over- 
training stage. However, prior to the 
beginning of acquisition, Ss were randomly 
assigned to four experimental groups of 165s 
each. The three stages of training required a 
total of 13 days for all groups 

Acquisition—Twelve free trials were given 
on each of 3 consecutive days. Groups Fr, 
Co, and In received acquisition on Days 1, 2, 
and 3 of acquisition. Group N, however, 
received acquisition on Days, 1, 2, and 11. 

Overtraining—On Days 3-6 and 8-10 
Group N was treated exactly as it had been 
during prehandling and was given no T maze 
experience during this time. It thus served 
asa no-overtraining control group. Group Fr 
was given 15 free trials on each of Days 4-6 
and 8-10 and 12 free trials on Day 11, making 
a total of 102 free overtraining trials. Group 
Co was given 15 trials, all forced to the correct 
side, on each of Days 4-6 and 8-10, and 12 
trials, also forced correct, on Day 11. This 
made a total of 102 forced-correct overtraining 
trials. Group In was given 30 trials on each 
of Days 4-6 and 8-10 and 24 trials on Day 11. 
All of these 204 trials were forced, half to the 
correct side and half to the incorrect. The 
distribution of correct and incorrect forced 
trials was determined randomly, and was 
different for each S. On Day 7, all Ss in all 
groups were fed on schedule but were not 
handled. 

Reversal.—All Ss received 15 free trials on 
each of Days 12 and 13. During this stage, 
reward was placed in the goal box opposite to 
that which had originally held the reward 
during acquisition and overtraining, so that 
the formerly correct goal box became 1N- 
correct and vice versa. 


Results 


Acquisition —The mean proportion 
of correct choices on the last five trials 
was .93. An analysis of variance on 
the latter half of acquisition reveal 
no significant differences among the 
groups (F = 1.87). In Group, 
there was no evidence of forgetting 


T MAZE REVERSAL LEARNING 


TABLE 1 


‘Tora Correct Cnoices 
IN REVERSAL 


——— 
| Experiment ] | Experiment 11| Experiment m 
Group | — 


Mean | SD Mean | SD P 
mpe } 


SD 


7.40 
16.69) 5.42 | 9.07) 9.13 


over the interval between the second 
and third days of training. 
Overtraining—Group Fr made a 


total of 3.3% incorrect responses 
during overtraining. The median 


number of correct responses after the 
last error was 54. 

Reversal.—The course of reversal 
learning for the four groups is shown 
in Fig. 1. The mean number of cor- 
rect choices on the 30 reversal trials is 
shown for the four groups in Table 1. 
The overall F for the four groups was 
37.01, significant at the .001 level. 
Adjacent groups were compared with 
ł tests. Groups N and Co did not 
differ significantly (t = .57). How- 
ever, Group Co was superior to Group 
Fr at the .01 level (t = 2.96), and 
Group Fr in turn was superior to 
Group In at the 001 level (¢ = 6.02). 

The mean number of incorrect 
responses before the first correct 
reversal response is shown for the four 
groups in Table 2. All differences 


TABLE 2 


NuMBER OF ERRORS BEFORE FIRST CORRECT 
RESPONSE IN REVERSAL 


535 


were significant at the .01 level except 
that between Groups N and Co, 
which did not approach significance. 
In view of the failure to find an 
overlearning-reversal effect in terms 
of total correct responses in reversal, 
the reversal data were also analyzed 
in terms of a criterion of 18 correct 
choices in 20 successive trials, with not 
more than one error in the last 10 
of the 20. This is the same as Pubol’s 
(1956) criterion except that his 20- 
trial units always involved 2 complete 
days of running, whereas ours could 
begin on any trial. Comparison with 
Pubol’s second experiment is most 
appropriate since it was closest to ours 
in the kind of discrimination involved. 
The numbers of Ss reaching this 
criterion in the four groups were 10, 
T 105 ‘ana. 3, respectively. These 
values yield a x° of 13.7, significant 
at the .01 level for 3 df. 
Speeds.—Speeds in feet per second 
on the first five trials of reversal are 
shown in Table 3. Stem speeds are 
from the starting treadle to the 
treadle just before the choice-point; 
total speeds are from the starting 
treadle to one of the two treadles just 
beyond the choice-point. For stem 
speeds there were no significant differ- 
ences among the groups. The overall 
F was 1.29, and the f ratio between the 
two extreme groups (1 and 2) was 
1.79. For total speeds, however, the 
overall F was 4.16, significant at the 


TABLE 3 


MEAN SPEEDS IN FEET PER SECOND ON First 
Frve REVERSAL TRIALS 


Experiment I | Experiment I1|Experiment IT 


Experiment I Experiment I Experiment IT 
Group Group 
Mean| SD |Mean| SD Mean| SD Stem | Total | Stem | Total | Stem | Total 
44) 5.75 | 5.08) 3.35 N | 4.01 | 2.70 | 3.87 | 2.89 | 3.91 | 2.86 
A | éi 17 64t D13 | 11,931 8.50 Er |442 | 3.50 | 4.36 | 3.53 | 4.20 | 3.23 
Co | 3.81] 3.02 | 5.25 3.77 Co | 4.11 | 3.03 | 4.45 | 3.48 
In | 12.44] 5.45 | 16.19 8.90 In | 4.07 | 3.10 | 3.44 | 2.78 


536 


.01 level. Of the differences in total 
speed among adjacent groups, only 
that between Groups Fr and In 
reached the .05 level of significance 
by ż test. 


EXPERIMENT II 


The failure to find an ORE in Exp. I 
brought the purpose of the study into 
question. An attempt was therefore 
made to change the conditions so as 
to replicate the earlier findings of an 
ORE. In order to bring Exp. II 
closer to these earlier studies, the 
number of overlearning trials was 
increased, the intertrial interval was 
made longer, and various other details 
were changed. 


Method 


Subjects and apparatus.—The Ss were 64 
experimentally naive female albino rats of the 
Sprague-Dawley strain, 74 to 78 days old 
at the beginning of experimental training, 
divided into four groups of 16 as in Exp. I. 
Two Ss were discarded because of apparatus 
failure and 5 were discarded because on three 
consecutive trials they failed to trip the first 
treadle within the 2-min. time limit or failed 
to trip one of the second treadles within the 
3-min, limit. (On such trials, $ was placed 
in a randomly selected goal box on the first 
occasion and subsequent odd-numbered oc- 
casions, and in the alternative goal box on all 
even-numbered occasions. The time score 
was recorded as 180 sec. and the goal box 
into which $ was placed was recorded as 
chosen.) These discarded Ss were replaced 
by Ss of the same description. The ap- 
paratus was the same as that used in Exp. I. 

Prehandling.—Prehandling was the same 
as in Exp. I, but the feeding schedule was 
changed slightly. The once-daily ration of 
10 gm. was presented 2 hr. after the start of 
prehandling or experimental training, since 
the altered procedure increased the total time 
of experimental training each day from 15-25 
min. to 90-110 min. 

Experimental training.—Except for a 
change in intertrial interval and magnitude of 
reward, all details were the same as in Exp. I. 
In Exp. II, 3-4 min. elapsed between the 
placement of S in its carrying cage following 

a trial and its removal at the beginning of the 
subsequent trial, In addition, only one 45- 


W: F. HILL; NOE. SPEAR, AND K. N; CLAYTON 


mg. pellet was present in the proper goal 
box on each trial. These changes applied to 
acquisition, overtraining, and reversal. 

Acquisition.—Fifteen free trials were given 
on each of 2 consecutive days. Groups Fr, 
Co, and In all received acquisition on Days 1 
and 2 of experimental training. Group N 
was prehandled (in the same manner as 
before) on Days 1-9 and then given acquisi- 
tion on Days 10-11. 

Overtraining.—Group Fr was given 20 free 
trials on each of Days 3-6 and 8-9 and 15 
free trials on each of Days 10-11, for a total 
of 150 free overtraining trials. Group Co 
was given 20 trials, all forced to the correct 
side, on each of Days 3-6 and 8-9 and 15 
trials, also forced correct, on Days 10-11, for 
a total of 150 forced correct trials. Group In 
was given 40 trials on each of Days 3-6 and 
8-9 and 30 trials on Days 10-11. Half of 
these 300 trials were forced correct and half 
forced incorrect. 


Reversal.—On Days 12 and 13 all Ss were 
given reversal training as in Exp. I. 


Results 


Acquisition.—The mean proportion 
of correct choices on the last five 
trials of acquisition was .93. An 
analysis of variance of the last ha 
of acquisition revealed no significg 
difference among the groups (F #1). 
To check comparability with one 
previous study using spatial discrimi- 
nation, it was determined for each S 
on what trial, if at all, it reached the 
criterion defined in Exp. I. For the 
four groups combined, 35 of the 64 Ss 
reached this criterion, the median for 
completion of the criterion being 
Trial 29. This means that by Pubols’ | 
definition the median S received one 
trial of overtraining during ac- 
quisition. 

Overtraining.—Group Fr made a 
a total of 2.7% incorrect choices 
during overtraining. The median 
number of correct responses after the 
last error was 100. 

Reversal.—The course of reversal | 
learning for the four groups is shown 
in Fig. 1, and the mean number of 


T MAZE REVERSAL LEARNING 


GROUPS 
No—- 
Frosss2s0 


100 


I 
od 
I 
I 


jl 
t 
fi 
I 
I 


i 
$ 


I 
1 
I 
I 
1 
I 
! 


PERCENT CORRECT 


| 
a 
ia 
Ld 


) Oe 3) E (6 


EXPER. I 


1 2.3 
BLOCKS OF FIVE TRIALS 


vw 
Q 
~ 


EXPER. II 


a AR } (2 3. S58R8 


Fic. 1. Reversal learning curves in the three experiments. 


correct choices for all 30 trials is 
shown in Table 1. The pattern of 
group differences is clearly similar to 
that found in Exp. I. Analysis of 
variance of the four group means 
yielded an F of 22.77, significant at the 
‘001 level. By £ test, Group In was 
found to be lower than each of the 
others at the .001 level, while Group 
Fr was lower than Group Co at the 
.05 level. Other differences were not 
significant. 

The mean number of errors before 
the first reversal is shown in Table 2. 
Group N does not differ significantly 
from Groups Fr and Co, but all other 
differences are significant at least at 
the .05 level. 

The reversal data were also ana- 
lyzed according to the same criterion 
as in Exp. l. The numbers of Ss 
reaching criterion in the four groups 
were 8, 7, 14, and 1, respectively, 


yielding a x? of 21.3, significant at the 
001 level for 3 df. 

Speeds.—Speeds on the first five 
reversal trials are shown in Table 3. 
The overall difference among the 
groups is significant at the .01 level 
both for stem speeds (F = 7.27) and 
for total speeds (F = 5.95). In both 
cases the two higher groups differ 
significantly from the two lower, but 
Group Fr does not differ significantly 
from Co or Group N from In. 


EXPERIMENT III 


Since the pattern of results in Exp. 
II did not differ appreciably from that 
in Exp. I, a further attempt was made 
to find an ORE. Since Ss in Exp. 
I and II were younger than those in 
some of the previous studies that 
found an ORE, older Ss were used in 
Exp. III. Only Groups N and Fr 
were included in this experiment. 


538 


Method 


Subjects and apparatus.—The Ss were 27 
experimentally naive female albino rats of the 
Sprague-Dawley strain, 120-121 days of age 
at the beginning of experimental training. 
Three Ss were discarded because on three 
consecutive trials they failed to trip the 
treadles within the specified time limits, 
which were the same as in Exp. II. These 
discarded Ss were replaced by other Ss of the 
same description. 

Procedure and design—The Ss were ran- 
domly assigned to one of two groups which 
were treated exactly like Groups N and Fr 
of Exp. II. There were 13 Ss in Group N and 
14 Ss in Group Fr. 


Resulis 


Acquisition—Mean proportion of 
correct choices on the last five trials 
of acquisition was .94 for Group N 
and .84 for Group Fr. A £ test on the 
number of correct responses in the 
latter half of acquisition revealed that 
Group N was significantly superior 
at the .02 level (¢ = 2.63). No reason 
for this difference is apparent. The 
median S in the two groups combined 
just reached criterion, with no addi- 
tional trials. 

Overtraining.—Group Fr had a 
mean of 2.0% incorrect choices during 
overtraining. The median number of 
correct responses after the last error 
was 102. 

Reversal—The course of reversal 
learning is shown in Fig. 1. The total 
numbers of correct responses in re- 
versal are given in Table 1. The 
superiority of Group N is significant 
at the .01 level (1 = 2.89). In view 
of the superiority of Group N in 
acquisition, this might be attributed 
to a chance superiority in learning 
ability of Ss in the group. However, 
the within-groups correlation between 
number of correct choices in the latter 
half of acquisition and number correct 
during reversal is negative and non- 
significant (r =—.22), which makes 
such an interpretation implausible. 


W. F. HILL, NIE SPEAR, AND K. N. CLAYTON 


Mean errors before the first reversal 
are shown in Table 2. The difference 
between the groups is significant at 
the .01 level. The reversal criterion 
previously described was reached by 6 
Ss in Group N and 3 in Group Fr, 
yielding a nonsignificant x? of .91, 

Speeds.—Stem ‘speeds and total 
speeds for the two groups are shown in 
Table 3. As in both previous experi- 
ments, Group Fr was faster than 
Group N on both measures. The 
differences were not significant, how- 
ever (i's = .85 and 1.20, respectively). 


DISCUSSION 


Reversal learning was found to be 
temporarily retarded by free-trial over- 
training, greatly retarded by forced-trial 
overtraining when half the trials were 
to each side, and unaffected by forced- 
trial overtraining when all the trials 
were to the correct side. These effects 
were primarily accounted for by the 
duration of perseveration on the formerly 
correct side in the different groups at the 
beginning of reversal, as may be seen by 
comparing Tables 2 and 3. 

The most noteworthy aspect of these 
findings is the failure to confirm the 
finding of other investigators that over- 
learning facilitates reversal. We seem 
to have ruled out the possibility that 
the discrepancy between our results and 
those of previous Es is due to age of Ss, 
number of overlearning trials, or inter- 
trial interval. Four differences remain 
between our procedure and earlier ones. 

1. We used a constant number of 
acquisition trials before overtraining 
instead of carrying all Ss to a criterion. 
Because of the variability of the Ss, a 
training procedure which gives a con- 
stant number of trials may fail to insure 
that all Ss will master the original task 
or may permit a few to receive a degree 
of overtraining. Either of these condi- 
tions might facilitate reversal perform- 
ance of Group 1 and reduce the chance 
of demonstrating an ORE. For this 
reason comparisons were made among 
three subgroups of Groups N and Fr: 


T MAZE REVERSAL LEARNING 


(a) those that failed to reach the acquisi- 
tion criterion, (b) those that had 5 or 
fewer acquisition trials after reaching 
criterion, and (c) those that had 6 to 10 
(the maximum possible) acquisition trials 
beyond criterion. In all three subgroups 
in all three experiments, Group N made 
more correct reversal responses than 
Group Fr. (In view of the small N in 
these subgroups, no statistical tests were 
made.) This suggests that the failure 
to use an acquisition criterion was not 
critical. There might be an interaction, 
however, between this consideration and 
the next one. 

2. We gave a constant number of 
reversal trials rather than carrying each 
S to a reversal criterion, Perhaps Ss in 
Group Fr, once they overcame their 
initial disadvantage, would have sur- 
passed Group N in“ reaching criterion. 
Neither the reversal learning curves nor 
the number of Ss reaching criterion in the 
two groups lend any support to this 
view. As for the possible interaction of 
Differences 1 and 2, of the Ss in all three 
experiments that reached the acquisition 
criterion with five or fewer trials to spare, 
5 in Group N and 4in Group Fr reached 
the reversal criterion. 

3. We tried to control differences in 
amount of handling between overtrained 
and nonovertrained groups by giving 
Group N extra handling when they were 
not being run, This may have con- 
trolled away the whole effect. If this 
should be the explanation, it would make 
the overlearning-reversal effect a more 
trivial phenomenon than has been 
suspected. 

4, Our task was a simple spatial 
discrimination with no irrelevant visual 
cues (such as Pubols had) and no 
separation of place and response cues (as 
with Brookshire et al.). If observing 
responses are the crucial factor in the 
overlearning-reversal effect, such a simple 
discrimination may be too easy for such 
responses to be important. This ex- 
planation implies that our task is easier 
than those in which the effect has been 
found, In terms of acquisition rate, this 
appears to be true of most of the other 
studies, though not of Pubols’ Exp. 2. 


539 


If this exception can be explained by the 
extensive pretraining in Pubols’ experi- 
ment leading to faster learning, then the 
simplicity of our discrimination might 
well be the crucial factor in our failure 
to find an overlearning-reversal effect. 
This would be consistent with D’Amato 
and Jagoda’s (1962) failure to find an 
overlearning-reversal effect in a spatial 
discrimination. Their study, of which 
we were unaware when conducting ours, 
is to our knowledge the first published 
failure to find an overlearning-reversal 
effect with high overlearning. The 
situation is complicated, however, by 
an unpublished study of Erlebacher 
(1961). He failed to find the over- 
learning-reversal effect, even though he 
used a brightness discrimination. The 
necessary conditions for obtaining an 
overlearning-reversal effect thus remain 
in doubt. 

Our results and the strikingly similar 
findings of D’Amato and Jagoda seem 
to be clearly incompatible with the first 
three interpretations of discrimination 
reversal listed in the introduction. These 
findings can, however, be reconciled 
with Interpretations 4 and 5, both of 
which are concerned with avoidance of 
the formerly incorrect side. Either of 
these interpretations is consistent with 
the rank order of our three overtraining 
groups. The unexpected superiority of 
Group N in reversal could then be ex- 
plained by their weaker tendency to 
approach the formerly correct side, a 
tendency supported by the generally 
slower speeds of Group N. The similar 
patterns of stem speed and total speed 
suggest that time spent in the stem 
rather than time spent in the choice area 
or arms was the major source of variance 
in both speed measures. 


SUMMARY 


Experiment I attempted to find the cause 
of the overlearning-reversal effect by com- 
paring T maze reversal learning by four 
groups of rats that received different patterns 
of overtraining in acquisition. Reversal was 
fastest for the group receiving no overtraining 
and the group receiving all its overtraining 
trials forced to the correct side. Free-choice 


540 W. F- HILL, N: 


overtraining gave somewhat slower reversal, 
and overtraining with an equal number of 
forced trials to the two sides gave much slower 
reversal. 

In view of the failure to replicate previous 
findings of faster reversal after overtraining, 
two further experiments were run in an 
attempt to replicate these earlier findings. 
Both experiments gave the same pattern of 
results as Exp. 1; no overlearning-reversal 
effect was found. These results appear to be 
consistent with interpretations in terms of 
stimulus satiation or of avoidance of non- 
rewarded cues, but not with interpretations 
in terms of observing responses, discrimi- 
nability, or frustration. 


REFERENCES 


Bircu, D., Ison, J. R., & SPERLING, S. E. 
Reversal learning under single stimulus 
presentation. J. exp. Psychol., 1960, 60, 
36-40. 

BROOKSHIRE, K. H., WARREN, Jj. M, & 
Bax, G. G. Reversal and transfer learning 
following overtraining in rat and chicken. 
J. comp. physiol. Psychol., 1961, 54, 98-102. 

CAPALDI, E. J., & STEVENSON, H. W. Re- 
sponse reversal following different amounts 


of training. J. comp. physiol. Psychol., 
1957, 50, 195-198. s 
Corton, J. W., Lewis, D. J., & METZGER, R. 
Running behavior as a function of ap- 
paratus and of restriction of goal box 


E. SPEAR, AND K. N. CLAYTON 


activity. J. comp. physiol. Psychol., 1958, 
51, 336-341. 

D'Amato, M. R., & Jacopa, H. Analysis 
of the role of overlearning in discrimination 
reversal. J. exp. Psychol., 1961, 61, 45-50. 

D'AMATO, M. R, & Jacopa, H. Over- 
learning and position reversal. J. exp. 
Psychol., 1962, 64, 117-122. 

ERLEBACHER, A. Reversal learning in rats 
as a function of percentage reinforcement 
and degree of overlearning. Unpublished 
doctoral dissertation, University of Wis- 
consin, 1961. 

GLANZER, M. 
and stimulus satiation. 
1958, 55, 302-315. 

NORTH, A. J., & CLAYTON, K. N. Irrelevant 
stimuli and degree of learning in discrimi- 
nation learning and reversal. Psychol. Rep., 
1959, 5, 405-408. 

NORTA, A. J., & Stimme, D. T. Extinction 
of an instrumental response following a 
large number of reinforcements. Psychol. 
Rep., 1960, 6, 227-234. 

Pusots, B. H., Jr. The facilitation of visual 
and spatial discrimination reversal by over- 
learning. J. comp. physiol. Psychol., 1956, 
49, 243-248. 

Rew, L. S. The development of noncon- 
tinuity behavior through continuity learn- 
ing. J. exp. Psychol., 1953, 46, 107-112. 


Curiosity, exploratory drive, 
Psychol. Bull., 


(Received November 14, 1961) 


Journal of Experimental Psychol: 
1962, Vol. 64, No. 5, SALSA r4 


MONETARY INCENTIVE AND RANGE OF PAYOFFS 
AS DETERMINERS OF RISK TAKING! 


LEONARD KATZ 


University of Massachusetts 


Myers and associates (Myers & 
Fort, 1961; Myers & Katz, 1962; 
Myers & Sadler, 1960) have been 
concerned with the effects of pa- 
rameters of the payoff distribution 
upon the choice between gambling 
and not gambling. Myers and Sadler 
(1960) varied the number of chips 
which might be won or lost on each 
gamble (range), the average payoff 
being zero. When the alternative to 
gambling was the sure gain of one 
chip, gambling increased with in- 
creases in range; when the alternative 
to gambling was the sure loss of one 
chip, gambling decreased with an 
increase in range. 

In the present experiment chips 
worth no money and chips worth 5¢ 
were used to provide data on effects 
of incentive value. Monetary incen- 
tive and range of payoffs are similar 
to each other in that an increase in 
either increases the risk associated 
with each gamble. The implied hy- 
pothesis is that increased incentive, 
like increased range, may result in 
more gambling, where the alternative 
to gambling is the sure gain of one 
chip, and in less gambling when the 
alternative to gambling is the sure 
loss of one chip. The objective of the 
present study was, therefore, to 
obtain data to test this hypothesis. 
The S's decision to gamble under 


1 This research was supported by National 
Science Foundation Grant G-11380 and 
National Institute of Mental Health Grant 
M-3803, and was part of a study submitted 
in partial fulfillment of the Master of Science 
degree at the University of Massachusetts. 
The author is indebted to Jerome L. Myers, 
who served as thesis adviser. 


different payoff ranges was followed 
by the loss or gain of chips which had 
no monetary value or was followed by 
the loss or gain of chips worth 5¢. 

In addition to providing data on the 
effects of monetary incentive, the 
present study investigated the effects 
of a payoff range greater than that 
used in the Myers and Sadler (1960) 
study. To control for any range 
effects due to differences among 
ranges in sequences of payoffs, one 
sequence was randomly generated and 
the other two were derived from it in a 
manner described in the procedure 
section. 

METHOD 


Materials.—Four decks of 100 3 X 4 in. 
white cards were prepared. The known 
payoff deck contained 50 cards with +1 
written on them, alternated randomly with 
50 cards with —1. The other decks, of 100 
cards each, provided for three different ranges 
of unknown payoffs, The narrow-range deck 
(N), had integers randomly chosen from +2 
to +6 and from —2 to —6. The medium 
range deck (M), was constructed by adding 
10 to every positive number of (N) and sub- 
tracting 10 from every negative number, 
giving a range of +12 to +16 and —12 to 
— 16, retaining the same ordinal positioning of 
cards as found in Deck N. The wide range 
deck (W), was constructed by adding and 
subtracting 20 to the integers of Deck N. 

Procedure.—On each of 3 successive days S 
was presented with the known payoff deck, 
and a different one of the three unknown 
payoff decks. Order of use of Decks N, M, 
and W was counterbalanced in a 3 X 3 Latin 
square with 6 Ss given each order. Eighteen 
Ss gambled only for poker chips (0¢) and 18 
others gambled for chips worth 5¢ apiece. 
The main features of the experimental design 
are shown on the left-hand side of Table 1. 

The Ss were tested individually, being 
given full instructions at the beginning of 
Session 1. For Sessions 2 and 3 Ss were told 


541 


542 LEONARD KATZ 
TABLE 1 
MEANS OF PROPORTIONS OF GAMBLING RESPONSES ON +1 AND —1 TRIALS 
FOR SUCCESSIVE 25-TRIAL BLOCKS 
Trials 
Overall 
Alternative AF, 
i R 
‘aa oe +7 1-25 26-50 51-75 76-100 
594 539 456 511 525 
+1 M .620 551 494 604 .567 
WwW 466 472 394 596 482 
Og 
N .903 .910 .874 911 899 
=1 M -196 796 822 744 -198 
W 838 836 756 800 197 
N 654 588 489 563 573 
cy M -688 537 461 574 565 
WwW .658 574 483 689 601 
3 N 833 893 881 878 871 
—1 M 796 790 815 761 .790 
WwW -782 756 741 750 ASE 


only that the unknown payoff deck of that 
day was a new one. Two hundred poker 
chips were stacked in front of S, and Ss with 
a monetary incentive (5¢) were told that 
each chip was worth 5¢; ie., their initial 
stake was worth $10.00. 

Briefly, Ss were instructed to turn over 
the top card in the deck of known payoffs 
at the beginning of each trial. They then 
chose between standing pat by accepting the 
gain or loss of one chip represented by the 
card and gambling by drawing the top card 
from the deck of unknown payoffs. If S de- 
cided not to gamble, the top card in the un- 
known payoff deck was turned anyway, show- 
ing what hewould have won or lost had hegam- 
bled. 

Ratings.—At the end of each session, Ss 
were given an 11-point scale from —5 to 
+5, along which they rated the means of the 
known payoff deck and of the unknown 
payoff deck used that session. They were also 
asked to describe their gambling strategies 
and changes in strategy. After Session 3, 
Ss were asked what they would have done 
differently in Sessions 1 and 2. 

Subjects—The Ss were 36 male under- 
graduates enrolled in the university summer 
session, who were divided randomly into two 
groups of 18 Ss each. Each S was paid $3.00. 


RESULTS 


Choires,—The scores were propor- 
tions of choices to gamble on both +1 


and —1 trials during each of four 25- 
trial blocks. For example, if in @ 
particular block S gambled on 3 of the 
12 (or 13) trials in which the alterna- 
tive to gambling was +1, his +1 
proportion for that block was .250 (or 
231). Table 1 presents means of 
these proportions for each combina- 
tion and range. An analysis 0 
variance was performed on arc-sine 
transforms of the proportions. The 
results of this analysis (shown M 
Table 2), in conjunction with the 
relationships shown in Table 1, sug- 
gest the following conclusions: (a 
under all combinations of range and 
incentive, more risk-taking occurs 
on —1 trials than on +1 trials 
(P < .001); (b) the difference in m-i 
centives had little effect on tota 
number of risks taken; (c) more 
gambling occurred with Deck M than 
with Deck N on +1 trials and the 
reverse on —1 trials (P < .01); and 
(d) value, incentive, and range have & 
joint effect on gambling (P < 001). 
A further breakdown suggests that 
this effect is due largely to differences 
in the quadratic curvature of the V- 


_ e 
EE R O S N 


DETERMINERS OF RISK TAKING 


curves over the three ranges. For 
+1 trials, the curves for incentive are 
essentially mirror images of each 
other; for —1 trials, the curves for 
incentive are essentially parallel to 
each other. 

In addition, there were a number 
of significant interactions involving 
blocks of trials. These effects are 
probably due to the interaction of 
range, incentive, and value with the 
effects of both the temporal sequence 
of blocks within a single session and 
the different mean payoffs of blocks. 

Ratings.—Table 3 shows the means 
of Ss’ ratings of the means of the 
known and unknown payoff decks for 
each combination and incentive. The 
means for the known payoff deck were 
closer to the true mean of zero and 


TABLE 2 


ANALYSIS OF VARIANCE OF ARC-SINE 
‘TRANSFORMS OF THE PROPORTION 
or GAMBLING RESPONSES 
IN EACH TRIAL 


Source af MS F 
Incentive (I) 1 710.00) 
Ss/1 34) 2,047.71 
Value (V) (known 
payoff) 1| 103,825.00] 33.72*** 
Range (R) 2| 1,361.40) 2.22 
Blocks (B) 3| 1,454.93] 4.90** 
V XB 3 987.23] 3.28* 
VXR 2| 1,835.60) 8.73** 
R XB 6 187.01] 8.87** 
RXBXV 6 367.75| 2.25* 
Wet 1) 2,910.50} 1.07 
BX I 3 56,80} 0.19 
R XI 2 292.70) 0.47 
VXIXR 2| 1,606.10) 13.66*** 
RXIXB 6 61.66] 2.82* 
VXIXB 3 238.16] 1.39 
RXBXVXI 6 154.27| 1.57 
Ss X V/I 34| 2,701.36 
Ss X B/I 102 296.53 
Ss X R/I 86 611.05 
Ss X V X B/I 102 300.18 
Ss X V X R/I 68 210.20 
Ss X R X B/I 204 21.80 
Ss XV XB 
X R/Í 204) 162.91 
*P < 05 


543 


TABLE 3 
Means or Ss' RatTiNGs or Deck MEANS 


| Payoff 
Range | Known | Unknown 
| og st o¢ sé 
N 33 16 Ad —0.66 
M | .33 16 -00 1.05 
w Ad —.22 88 0.88 


were less variable than the means for 
the unknown payoff decks. Neither 
set of means varied systematically 
with incentive or range of unknown 
payoff. Nor were there any sys- 
tematic relationships between these 
deck means and either overall propor- 
tions of risks or proportions of risks 
for the last 25 trials. Most Ss re- 
ported following a “gambler’s fal- 
lacy” ? strategy and none reported an 
awareness of gambling differentially 
with different decks of unknown 
payoffs. Incentive had no differential 
effect. 
Discussion 


The decision to gamble or not to 
gamble following known outcomes of 
loss or gain of a poker chip was in- 
vestigated as a function of three ranges 
of unknown payoff involving the loss or 
gain of chips worth nothing or worth 5¢. 
Previous findings (Myers & Katz, 1962; 
Myers & Sadler, 1960) of more gambling 
when the alternative to gambling was the 
loss of a chip than when the alternative 
was the gain of a chip were confirmed. 
This difference was reduced by monetary 
incentive, although the interaction was 
not statistically significant: trials when 
the alternative was —1 were followed 
by fewer choices to gamble for chips 
worth 5¢ than for those worth nothing, 
while trials where the alternative was 
+1 were followed by more choices to 
gamble for chips worth 5¢ than for those 


2 Due to the procedure for constructing the 
payoff sequences, prediction of the alternative 
event following a run is not really fallacious. 


544 


worth nothing. Thus, there is some 
suggestion that monetary payoff and 
range are functionally equivalent; for all 
three ranges monetary incentive yielded 
an increase in gambling on +1 trials, a 
decrease on —1 trials. 

Other experiments have shown the 
form of value and range interaction 
obtained for the 5¢ incentive group. 
Myers and Sadler, using three ranges up 
to and including the range of Deck M of 
the present study, found that gambling 
consistently increased as range increased 
when the alternative was a gain of one 
chip, but gambling decreased as range 
increased when the alternative was a one- 
chip loss. Myers and Katz (1962) 
obtained similar results through a range 
of Deck M. Suydam and Myers (1962), 
susing a very different procedure, found 
this convergence of positive and negative 
value curves over range, for several 
values. It may be assumed that the 
gamble represents an approach-avoid- 
ance conflict, both tendencies increasing 
as range does. Within this frame of 
reference, the data suggest that, as range 
increases, the avoidance gradient rises 
more swiftly against negative alter- 
natives; the approach gradient rises more 
swiftly against positive alternatives. 


SUMMARY 


The effects on gambling behavior of 
monetary incentive, range of payoffs for 


LEONARD KATZ 


be taken in lieu of gambling were determined. 
One group of 18 Ss gambled for chips only 
and another group of 18 Ss gambled for chips 
worth 5¢ each. Three ranges of unknown 
payoffs were used, one at each of three ses 
sions. The known payoff, the acceptance of 
which was the alternative to gambling, 
remained constant. 

Neither incentive nor range had a signifi- 
cant effect upon the total number of risks 


taken. The Ss gambled significantly more : 


gambling, and value of a payoff which ] 


when the alternative to gambling was a loss 
(—1) than when it was a gain (+1). Several 
interactions were significant, including Range 
X Value and Range X Value X Incentive. 
These led to the conclusion that gambling is 
affected differentially on +1 and —1 trials 
by the range of chips to be gained or lost, and 
by the interaction of range and monetary 
incentive. 


REFERENCES 


Myers, J. L., & Fort, G. G. A sequential 
analysis of gambling behavior. Paper 
presented at Psychonomic Society, Co- 
lumbia University, 1961. 

Myers, J. L., & Katz, L. Range of payoffs 
and feedback in risk-taking. Psychol. Rep. 
1962, 10, 483-486. 

Myers, J. L., & SapLeR, E. Effects of 
range of payoffs as a variable in risk taking. 
J. exp. Psychol., 1960, 60, 306-309. l 

Suypam, M. M., & Myers, J. L. Two paii 
rameters of risk-taking behavior. Psychol. 
Rep., 1962, 10, 559-562. 


(Received November 14, 1961) 


Journal of Experimental Psycholo 
1962, Vol. 64, No. 5, S45 648 4 


SPATIAL S-R CONTIGUITY IN HUMAN 


DISCRIMINAT 


ION LEARNING 


C. D. STANDISH anv R. A. CHAMPION 


University of Sydney 


Though differing in purpose, both 
of the experiments to be reported are 
basically similar in design to an 
earlier study (Champion & Standish, 
1960) in which preliminary training 
was given with two pairs of stimuli in 
fixed spatial relations (spatial dis- 
crimination learning) followed by a 
test stage involving the two pairs and 
their transposes (nonspatial learning). 
Under these conditions negative trans- 
fer from training to test occurred when 
between-pair differences among the 
stimuli were greater than within-pair 
differences, but positive transfer oc- 
curred when within-pair differences 
dominated. These findings were in- 
terpreted in terms of the type of S-R 
theory proposed by Spence (1960); 
in particular, the differential transfer 
was explained through the presence or 
absence of within-pair discriminations 
on the part of S in the training period, 
it being argued that the occurrence of 
these discriminations promoted posi- 
tive transfer while their absence 
caused interference. The basis of the 
present studies rested in the further 
assumption that spatial S-R con- 
tiguity would also promote within-pair 
discriminations, and it was predicted 
that groups trained and tested under 
such conditions would show positive 
transfer, even though between-pair 
differences were dominant. 

The first experiment consisted of a 
tition of the 1960 study with the 
that S was required to 
pressing a button adjacent 
lus rather than one some 
In order to make a 
ffect of this form 


repe 
exception 

respond by 
to the stimu 
distance from it. 
severe test of the e 


of S-R contiguity, the stimuli in each 
pair were presented close together in 
space (S-S proximity), but this al- 
lowed the possibility that the result 
of the experiment was due to prox- 
imity rather than contiguity, the two 
factors being confounded. The second 
experiment was therefore conducted 
in an attempt to separate these two 
factors and attention was concen- 
trated on the contiguity variable, 
with degree of proximity held constant. 


EXPERIMENT | 
Method 


Subjects —The Ss were 44 undergraduates 
from courses in psychology at the University 
of Sydney, there being 13 ‘Ss in Groups 1 and 
2, and 9 in Groups 3 and 4. Color-blind 
students were excluded. 

Apparatus and stimuli—The apparatus 
was identical with that used previously 
(Champion & Standish, 1960), allowing S to 
be presented with colored circles of light 1 in. 
in diameter, except that the lights were only 
2 in. apart in the horizontal midline of the 
5 X 2 in. milk-glass screen, and the response 
buttons were mounted as close to the stimuli 
as possible, one button being located im- 
mediately above each light. In an attempt 
to equate within-pair differences the stimulus 
settings were adjusted to the following values 
of red (R) and green (G) on the Munsell 


system, with brightness and saturation held 
approximately constant: Stimulus Ri = 5R, 
R, = SYR, Gi = 7.5GY, Gs = 2.5GY. The 


results of the 1960 study had already been 
confirmed with these settings (Standish, 
1960). 

Procedure—The instructions to S were 
slightly modified to allow for the new location 
of the response buttons, and between re- 
sponses S's hand rested on the table im- 
mediately below the vertical midline of the 
screen, but otherwise the procedure was 
identical with that reported earlier, the con- 
trol groups being denied preliminary training 


545 


546 


TABLE 1 
Stimutus Pairs For Ss 


Experimental Groups 


In Exr. I 


Control Groups 


1 3 2 4 
bow w>b bow w>b 

Rit: Ri— | Git: Ri — => 37 

Gi=:Gi+ =j = 


G2-—:Rit 


Rit: R:— |Git:Re— | Rit: Re— |Git:R2— 
Ga—:Git |Gr—:Ri+ | Gr—:Gi+|G2=:Rit¢ 
Re~:Rit | Re—:Gi+| Re—:Ri+|R:—:Git 
Git:Gi— | Rit:G2— | Git:Gr— | Ri+:Gi— 


Note.—The spatial relations of the symbols in each 
pair correspond to the actual left-right arrangements 


of the stimuli. The designations b > w etc. refer to 


relative magnitudes of stimulus differences between and 
within pairs. 


so as to allow an assessment of the nature and 
amount of transfer. A summary of the 
procedure and stimulus pairs is given in 
Table 1. 


Results 


The results, in terms of mean trials 
to the criterion of eight successive 
correct trials, are summarized in 
Table 2. Because of the presence of 
some extreme scores, nonparametric 
Statistical tests were used. The 
application of a U test to the data of 
the first stage for Groups 1 and 3 (two 
stimulus pairs in fixed spatial rela- 
tions) confirmed the earlier finding of 
Superior performance with between- 
pair differences dominant (U =6 
with mı = 13 and m = 9, P < .001). 


TABLE 2 
TRIALS TO CRITERION 1N Exe, | 


Between-Pair Within-Pair 


Differences Diff 
sae Dominant Damitene 
Gr 1/G 2 
Exp.) | (Control) | Yee (Coes 
Training 
Mean 14.4 — 25.9 — 
SD 3.6 = 9.4 — 
Test 
Mean | 19.0 32.8 16.3 33.6 
SD 22.8 29.4 4.8 19.8 


C. D. STANDISH AND R. A. CHAMPION 


The use of a factorial median test 
(Sutcliffe, 1957) with the data of the 
test stage (two stimulus pairs and 
their transposes) failed to reveal any 
differential transfer, there being no 
significant interaction between the 
experimental-control variable and 
type of stimulus difference (x? = 3.38 
for 1 df). Subsequent U tests showed 
that significant positive transfer oc- 
curred with both between-pair and 
within-pair differences dominant 
(U = 38.5 with ny and n = 13 
P < .02, and U = 11.5 with m and 
nı = 9, P < .02, respectively). 


EXPERIMENT II 
Method 


Subjects.—The Ss were 60 undergraduates 
from courses in psychology, divided into four 
groups of 15. Color-blind students were not 
excluded. 

Apparatus.—The apparatus consisted of a 
sheet of building board 72 in. wide and 36 in. 
high, mounted vertically on a table in front 
of S and containing two 2} X 2} in. pearl- 
perspex squares 4 in. apart in the horizontal 
midline. An attempt was made to locate 5 
equidistant from the two squares so that the 
centers subtended the same visual angle as 
did the centers of the circles in the 1960 study. 
Housed behind the perspex squares were 15-w. 
lamps whose intensity was controlled by two 
variacs. The duration of illumination of the 
stimuli on any trial was 200 msec. In order 
to achieve spatial contiguity of stimulus and 
response S was instructed to use response 
buttons mounted on each perspex square, 


one in each top corner nearer to the vertical — 


midline of the building board. To obtain 
noncontiguity conditions, S was told to use 
two buttons mounted side by side in a small 
metal box located 3 in. in front of the board, 
the box being centered 16 in. to the right of 
the midline of the board. The response 
buttons activated indicator lamps as a signal 
to E. F 
Stimuli.—The chief basis of discrimination 
in Exp. II was intensity of illumination, an 
the four values of the stimuli in foot-Lambert 
units were as follows: Bı = 125, Bz = 100, 
D, = 2.5, Ds = 3.2. These illumination in- 
tensities were measured with a photometer at 
the usual location of S’s head. Lights B; and 
B: were similar bright stimuli, whereas Di 


HUMAN DISCRIMINATION LEARNING 


and D, were similar dull stimuli, The change 
from bright to dull illumination of the filament 
lamps was accompanied by the usual change 
in hue. 

Procedure,—The instructions to S were 
again modified to the extent demanded by the 
changes in the apparatus and the conditions 
of training. The independent variable was 
degree of spatial S-R contiguity (buttons 
near to or far from lights) and the stimulus 
conditions were limited to dominance of 
between-pair differences, the required dis- 
criminations being between B, and Bs, and 
between Dand Dy Otherwise the procedure 
was as before, with training and test stages 
for the two experimental groups (Group 1, 
contiguity, and Group 3, noncontiguity), 
and test stages alone for the corresponding 
control groups (Groups 2 and 4, respectively). 
The criterion of learning was increased to 12 
successive correct trials. 


Results 


The trials-to-criterion scores for the 
training and test stages are presented 
in Table 3. For these data it was 
possible to use a parametric statistic, 
and the application of a ¢ test to the 
training scores of Groups 1 and 3 
revealed no significant difference. An 
analysis of the variance for the test 
data of the second stage showed that 
while the pretraining had no con- 
sistent effect (experimental vs. control 
groups), the influence of spatial S-R 
contiguity and degree of differential 
transfer (interaction) were both sta- 
tistically significant. Follow-up f 
tests showed that the differences 


TABLE 3 
TRIALS TO CRITERION IN EXP. Il 


Contiguity Noncontiguity 


Stage 
Group 3 | G 
(Exp.) 


Training 
Mean <A 
SD F 
Test 
Mean 57.6 
D 29.9 


w 


between Groups 1 and 2, and Groups 
1 and 3 were significant (/ = 2.56, 
P < 02, and t= 4.74, P < 01, re 
spectively, for 28 d/) but that the 
differences between Groups 2 and 4, 
and Groups 3 and 4 were not signifi- 
cant. It thus appears that positive 
transfer occurred only under con- 
tiguity conditions, but that there was 
no negative transfer with nonconti- 
guity of S and R. 


Discussion 


The results of the two experiments 
lead to the conclusion that spatial S-R 
contiguity, like between-pair similarity, 
leads to positive transfer from the spatial 
to the nonspatial learning situation 
(training to test), and if the theoretical 
interpretation given in terms of SR 
theory is correct, then this is because 
the contiguity promotes more effective 
within-pair discrimination in the first 
stage. The chief finding of Exp. 1 was 
that positive transfer from training to 
test may be obtained even when between- 
pair differences are dominant, provided 
that S be required literally to approach 
the positive stimulus cue in the course 
of the response. This result is to be 
contrasted with that of the 1960 study, 
where the dominance of between-pair 
differences produced negative transfer. 
That the present outcome cannot be 
attributed to the greater proximity of the 
stimulus cues in Exp. I is demonstrated 
by the similar result in Exp. Il with a 
return to the same degree of non- 
proximity as obtained in the 1960 study. 

The data of Exp. II suggest that the 
direct effects of contiguity may not be 
as powerful with adult humans as with 
children and lower animals, by com- 
parison with the marked effects obtained 
by Murphy and Miller (1958, 1959). 
Learning was more efficient under con- 
tiguity conditions both in the training of 
Groups 1 and 3 and in the test periods of 
Groups 2 and 4 (Table 3), but in neither 
case was the difference statistically 
significant. However, when the two 
scores for each S in the experimental 


548 


groups of Exp. II (Groups 1 and 3) on 
the training and test stages were com- 
bined, so as to make the two tasks one, 
a significant difference emerged (t = 4.32 
for 28 df, P < .01), possibly due to the 
greater stability of the scores. It will 
also be noted that the significant nega- 
tive transfer obtained in the 1960 study 
with noncontiguity and between-pair 
differences dominant was not duplicated 
in Groups 3 and 4 of Exp. II, although 
the result was in the same direction. 
Spence’s recent elaboration of an S-R 
theory of selective learning (Spence, 
1960) points to some of the complexities 
to be coped with in any detailed formula- 
tion, but brief consideration may profit- 
ably be given to the broader theoretical 
significance of spatial S-R contiguity. 
In the present context this variable has 
been assumed to act through within-pair 
discrimination, and in S-R terms the 
efficiency of this latter discrimination 
depends in turn upon the relative 
strengths of the orienting and approach 
tendencies to the positive and negative 
discriminanda. The necessary link be- 
tween spatial contiguity and the relative 
strengths of the correct and incorrect 
responses may now be provided if it be 
allowed that spatial contiguity amounts 
to temporal contiguity of stimulus and 
response, for the importance of the latter 
variable in simple learning seems un- 
doubted (e.g., Champion, 1962). The 
equation of spatial and temporal con- 
tiguity in the discrimination-learning 
situation follows from the fact that when 
S is required to respond by pressing a 
button in or near the positive stimulus 
(contiguity) then there is more likely to 
be a short time interval between exposure 
to the stimulus and the occurrence of the 
response than if S has to look away from 
the stimulus to locate the appropriate 
response button (noncontiguity), This 
effect might show up more clearly if the 
stimuli were presented for a long time 
interval, but the argument applies even 
with intervals as short as 100 msec., for 


C. D. STANDISH AND R. A. CHAMPION 


there is no requirement that stimulus and 
response overlap in time, the terms 
“contiguity” and “noncontiguity” being 
relative rather than absolute. 


SUMMARY 


Two experiments on discrimination learn- 
ing were conducted under conditions in which 
preliminary training was given with two 
pairs of stimuli in fixed spatial relations, 
followed by test learning involving the two 
pairs and their transposes. The aim was to 
test the effects of spatial S-R contiguity, for it 
was predicted that contiguity would cause 
positive transfer from training to test. The 
prediction was confirmed, and the result was 
interpreted in S-R terms with the hypothesis 
that contiguity, like between-pair similarity, 
promotes within-pair discriminations on the 
part of the learner, it being supposed that 
spatial contiguity has this effect through the 
more basic variable of temporal contiguity. 


REFERENCES 


Cuampion, R. A, Stimulus-response con- 
tiguity in classical aversive conditioning. 
J. exp. Psychol., 1962, 64, 35-39. 

Cuamprion, R. A., & Sranpisn, C, D. Stim- 
ulus differences in discrimination learning. 
J. exp. Psychol., 1960, 60, 78-82. 

Murray, J. V., & Miler, R. E. The effect 
of the spatial relationship between the cue, 
reward, and response in simple discrimina- 
tion learning. J. exp. Psychol., 1958, 56, 
26-31. 

Murpny, J. V., & Minter, R. E Spatial 
contiguity of cue, reward, and response in 
discrimination learning by children. J. exp. 
Psychol., 1959, 58, 485-489. 

SPENCE, K. W. Behavior theory and learning. 
Englewood Cliffs, N. J.: Prentice-Hall, 
1960. 

Stanpisn, C. D. Stimulus differences and 
response tendencies in discrimination learn- 
ing. Unpublished master’s thesis, Uni- 
versity of Sydney, 1960. 

SUTCLIFFE, J. P. A general method of 
analysis of frequency data for multiple 
classification designs. | Psychol. Bull., 1957, 
54, 134-137. 


(Received December 4, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 5, 549-550 


SUPPLEMENTARY REPORT: STIMULUS FAMILIARIZATION 
IN PAIRED-ASSOCIATE LEARNING 


RUDOLPH W, SCHULZ ano IRVING F. TUCKER 


State University of Lowa 


When Ss are familiarized with the stimulus 
units but not the response units of a list prior 
to paired-associate (PA) learning, it has 
generally been found that stimulus familiar- 
ization (SF) has a slight inhibitory effect or 
no effect at all on PA performance (Under- 
wood & Schulz, 1960). Recently, Gannon and 
Noble (1961) have reported significant 
facilitation of performance on a list of paired 
dissyllables following 20 trials of SF. Since 
the latter result is the one to be expected if 
frequency of prior experience is the vehicle 
through which stimulus meaningfulness has 
its effect on PA performance, it is a result 
with considerable theoretical significance (e.g., 
Cieutat, Stockwell, & Noble, 1958; Under- 
wood & Schulz, 1960). However, the finding 
of a positive effect from SF is also conspicu- 
ously inconsistent with the results of previous 
studies. Therefore it seemed especially im- 
portant to further assess the reliability of this 
result and consider potential alternative 
explanations for it. 

One such alternative is that Gannon and 
Noble’s procedure of having S articulate the 
stimulus unit during the PA anticipation 
interval, when combined with variation in 
amount of SF, may have inadvertently pro- 
duced simultaneous variation in the effective 
length of the anticipation interval. Since 
practice in articulation of stimulus units is 
directly related to amount of SF, familiarized 
Ss might spend relatively less of the 2-sec. 
anticipation interval for stimulus articulation 
than nonfamiliarized Ss. On the basis of the 
presumed direct relationship between PA 
performance and length of the anticipation 
interval Gannon and Noble's results would 
then be expected. The present experiment 
tested this hypothesis by comparing the 
performance of Ss instructed to pronounce the 
stimulus units during PA anticipation with 
performance of Ss instructed not to pronounce. 
A significant interaction of PA instruction and 
amount of SF will be required to support the 
present contention. 

Method.—A 2 X 3 factorial design with 2 
levels of PA instruction—articulation (A) vs. 
nonarticulation (NA) of stimulus units—an 
3 amounts of SF (0, 20, and 60 trials) was 
used. The six respective conditions will be 


referred to in terms of the values of the 


independent variable associated with them 
(e.g, Cond. Ao articulation instructions and 
O familiarization, Cond. N Ag nonarticulation 
instructions and 60 trials of familiarization, 
etc.). 

The materials and procedures were iden- 
tical to those used by Gannon and Noble 
(1961) with the following exceptions: (a) A 
'85-sec. rate of presentation was used during 
familiarization; (b) PA performance consisted 
of 17 anticipation trials. 

A total of 144 Ss, 24 per condition, taking 
introductory psychology at the University of 
lowa were randomly assigned to conditions as 
they appeared at the laboratory. The Ss had 
not served in prior verbal learning ex- 
periments. 

' Results and discussion.—Performance on 
the PA list under the six conditions, in terms 
of mean total number of correct responses 
during 17 anticipation trials, is shown in 
Fig. 1. The predicted interaction between 
PA instructions and amount of SF was ob- 
tained, and is shown by an analysis of 
variance to be the only significant effect 
(F = 3.78, df = 2/138, P <.05). In agree- 
ment with the results of Gannon and Noble 
(1961), performance was a monotonic in- 
creasing function of number of familiarization 
trials when Ss were required to pronounce the 
stimulus terms of a PA list prior to anticipa- 
tion of the response terms. However, with 


MEAN NUMBER OF CORRECT RESPONSES 


AMOUNT OF FAMILIARIZATION 


Fic. 1. Mean total number of correct responses 
during 17 anticipation trials as a function of A in- 
structions and number of stimulus familiarization trials. 
(The standard error of the means in Fig. 1, as estimated 
from the within-groups MS of the overall analysis of 


variance, was 2.71.) 


549 


550 


nonarticulation PA instructions, performance 
was inversely related to amount of familiariza- 
tion. It can also be seen from Fig. 1 that, 
even though the facilitating effects of fami- 
liarization appear to be approaching an 
asymptotic level under Cond. Aso, perform- 
ance under Cond. NA» was slightly better 
than under Cond. Ag. Irrespective of how 
proficient S becomes at pronouncing the 
stimulus units, it takes longer to pronounce 
than not to pronounce; it takes longer to say 
something than to say nothing. 

Intercomparison of the various conditions 
via the critical difference technique (Linquist, 
1953) revealed two significant (P < .05) 
differences, Cond. NAo vs. Cond. Ap and 
Cond. NAs vs. Cond. NAs. Inspection of 
acquisition as a function of trials for each of 
the conditions did not reveal evidence of 
interaction between treatments and trials. 

In conclusion, it is apparent that PA in- 
structions regarding S’s response to the 
stimulus term during the anticipation interval 


Journal of Experimental Psychol 
1962, Vol. 64 Ne, 5, 550-81 pi 


SUPPLEMENTARY REPORT: TIME BETWEEN PAIRINGS AND 


L. R. PETERSON, K. HILLNER, AND D. SALTZMAN 


can determine whether SF facilitates or in- 
hibits PA performance. We believe that these 
effects are attributable to covariation of the 
effective length of the anticipation interval 
with amount of familiarization. Furthermore, 
it may be expected that such factors as word 
length, pronunciability, and meaningfulness 
will also, depending on the length of the 
anticipation interval, interact with PA in- 
structions and amount of familiarization. 


REFERENCES 


Creutat, V. J., STOCKWELL, F. E., & Norte, C. E. 
The interaction of ability and amount of practice 
with stimulus and response meaningfulness (m, m') 
in paired-associate learning. J. exp. Psychol., 1958, 
56, 293-302. 

Gannon, D. R., & Nore, C. 
as a stimulus factor in 


Familiarization (n) 
ed-associate verl 


learning. J. exp. Psychol., 1961, 62, 14-23. s 
Linquist, E, F. Design and analysis of experiments in 
psychology a education. Boston: Houghton 
Mifflin, 1953. i 
UnpeRwoop, B, J., & Scuutz, R. W. Meaningfulness 
and verbal learning, Chicago: Lippincott, 1960. 


(Received September 14, 1961) 


SHORT-TERM RETENTION ! 


LLOYD R. PETERSON, KENNETH HILLNER, ann DOROTHY SALTZMAN 


Indiana University 


Peterson, Saltzman, Hillner, and Land 
(1962) found marked forgetting within 8 sec. 
after a single paired-associate presentation, 
when other presentations filled the interval. 
The present study investigates the effect of 
an 8-sec, interval similarly filled which is 
inserted between the first and second pres- 
entations of an individual pair later tested for 
retention. 

Method.—The technique of the previous 
experiment was used. The first as well as the 
second pairing consisted of § reading aloud 
the stimulus and the response from the drum. 
Either 0 or 8 sec, separated the two pairings. 
The retention interval, measured from re- 
moval of the second presentation from the 
drum, was 2, 4, 8, or 16 sec. There were 
42 Ss who were tested 16 times in each of the 
eight conditions. Stimuli were three- and 
four-letter words. ‘The responses were in- 
dicated in the instructions to be the numbers 


! This research was supported by Grant G 12917 
from the National Science Foundation to Indiana 
University, a grant for which the senior author is 
principal investigator. 


1-10. Twelve seconds rest separated 16 
blocks of 31-41 exposures, save for a J-min. 
rest between Blocks 8 and 9. ? 
Results.—Table 1 shows that massed pair 
ings resulted in superior recall at the 2- and 
4-sec. retention intervals, while 8-sec. spacing 
was superior at the 8- and 16-sec. retention 
intervals. An analysis of variance found 
significance for both Spacing (F = 10.72, 
df=1/41, P<.01) and Retention (F= 71.80, 


TABLE 1 
Proportions Corrretty RECALLED 

——— eed —_ n l 
l 
| First Half Second Half t 
fianai ae J I 

Spacing | Retention Interval | Retention Interval 

Eriein (Sec) (Sec.) 


w~|2|}4)8 


be 
| 


f 

ig pe _— 

0 Sec. |.82 |.69 |as |.48 |.85 |.74 |45| 48 
8 Sec. 81 |-66 |56 |56 | 82 |63 |.60 | 40 
a ———a 


| | | j 


SUPPLEMENTARY REPORT 


df = 3/123, P <.01). Their interaction was 
also significant (F = 16.47, df = 3/123, 
P <.01). The two halves of the session did 
not differ significantly (F = .73, df = 1/41, 
P>.05). After pooling data from the two 
halves of the session, individual ¢ tests showed 
that differences between the two spacing 
conditions were significant at the 4-, 8-, and 
16-sec. retention intervals (f = 3.07, 5.23, 
4.84; df = 41, P <.01). 

A paradox is presented by the finding that 
when an interval during which marked 
forgetting can be shown to occur is introduced 
between pairings, there is improvement in 
retention at long intervals. Underwood's 


Journal of Experimental Psychology 
1962, Vol, 64, No. 5, 551-552 


SUPPLEMENTARY REPORT: 


551 


(1961) explanation of distributed practice of 
lists does not seem appropriate here, since he 
concludes that distribution is superior to 
massing only when response learning is in- 
volved. Response learning was minimized 
by the instructions in the present study of 
spacing between individual pairings. 


REFERENCES 


Perersos, L. R., SALTZMAN, D., HILLNER, K., & 
LAND, V. Recency and frequency in paired-associate 
learning. J. exp. Psychol., 1962, 63, 396-403. 

UNDERWOOD, B. J. Ten years of massed practice on 
dainui practice. Psychol. Rev., 1961, 68, 229- 
247. 

(Received August 18, 1961) 


YOKED COMPARISONS OF CLASSICAL 


AND AVOIDANCE EYELID CONDITIONING 
UNDER THREE UCS INTENSITIES * 


L GORMEZANO, JOHN W. MOORE, AND EDWARD DEAUX 


Indiana Universily 


Moore and Gormezano (1961) observed 
that classical conditioning when experi- 
mentally equated in terms of partial reinforce- 
ment pattern and number of UCS occurrences 
by a yoking procedure, was inferior to avoid- 
ance conditioning. The present investiga- 
tion was conducted to determine the effects 
of UCS intensity on such yoked comparisons 
of classical and avoidance conditioning. 

Method.—The general apparatus, proce- 
dure, and stimuli were the same as in the 
earlier experiment. The only variation was in 
the intensities of the UCS employed. Twenty 
Ss were assigned to each of the six cells of a 
2 Xx 3 factorial design in which classical and 
avoidance conditions were made orthogonal to 
three UCS puffs of nitrogen which had in- 
tensities sufficient to support 40-, 80-, and 160- 
mm. columns of mercury. In addition 2 male 
Ss were lost because of apparatus failure. The 
two recording systems and sex were also made 
orthogonal to the classical-avoidance and UCS 
intensity dimensions. 

Results. —The distributions of response 
latencies for all six groups in acquisition and 
extinction were recorded. The distributions 
(not shown) revealed that aside from the 


1 This research was supported by Grant G 16030 
from the National Science Foundation. A report of 
this experiment was presented at the Psychonomic 
Society, New York, September 1961. 


higher frequency of responses in the CR range 
for the avoidance groups, the only discernable 
difference was the tendency for the modal 
responses to decrease in latency, under both 
conditioning procedures, as UCS intensity 
increased. 

Figure 1 presents the results of plotting 
percentage CRs for the six experimental 
groups in acquisition and extinction. The 
initial points on all acquisition curves are the 
mean percentage CRs on Trial 1 and the 
remaining points are for successive blocks of 
10 trials. ‘The extinction curves are plotted 
in 5-trial blocks. The figure indicates that the 
acquisition performance of the avoidance 
groups (A160, A80, A40), for each of the three 
UCS intensities, was superior to each of their 
respective yoked-classical groups (i.e, A160 
vs. Y160, A80 vs. Y80, and A40 vs. Y40). 
Though Groups A160 and A80 follow es- 
sentially identical courses of acquisition, 
group Y160 was inferior to Y80. The extinc- 
tion curves reveal that the higher level of 
responding of the avoidance groups in acquisi- 
tion also persisted in extinction. As was ob- 
served in the previous study, CRs of the 
avoidance groups demonstrated steeper decay 
functions than the yoked-classical groups and 
are probably in large part due to their having 
started at higher levels of responding. 

Split-plot analyses of variance (Snedecor, 


|. GORMEZANO, J. W. MOORE, AND E. DEAUX 


ee: 
(98S ets 


- 
=o 
o--—= 
a 


PER CENT CRs 
S Come 


5 


1 2 3 


acquisition 


- 
-- 


ed 


`, - 
` - 
-8I 


On mm mn nor 


extinction 


BLOCK OF TRIALS 


Fie. 1. 


1959) on the arc-sine transform of individual 
percentage CRs for the 70 acquisition and 20 
extinction trials revealed significant F values 
for the classical vs, avoidance comparisons in 
acquisition (F = 27.15, df = 1/48, P < .005) 
and extinction (F=22.55, df=1/48, P <.005). 
The UCS intensity dimension failed to pro- 
duce significant differences in either acquisi 
tion or extinction. A significant F value 
obtained in acquisition for the Cla: 
Avoidance X UCS Intensity interaction 
(F = 14.39, df = 2/48, P < .005), reflecting 
the fact that as UCS intensity increased, 
performance of the avoidance groups increased 
while performance of the yoked-classical 
groups decreased. This interaction appears 
to be simply a function of a negative correla- 
tion between the performance levels of the 
paired Ss. However, the significant inter- 
action is heavily weighted by the low level of 
responding of Group Y160. If the perform- 
ance of Group Y160 is not sampling error, its 


The percentage CRs plotted in 10-trial blocks during acquisition and 5-trial blocks in extinction. 


performance relative to Y80 suggests a Partial 
Reinforcement X UCS Intensity interaction. 
A significant F value was also obtained for the 
Classical-Avoidance X Sex interaction in eX 
tinction (F = 5.02, df = 1/48, P < .05); and 
examination of the mean percentage CRs 
revealed that under the avoidance procedure 
males gave a higher percentage of ag 
whereas, under the yoked-classical procedure 
the females were superior. In the remaining 
sources of variation those of statistical sig- 
nificance were without psychological import 
(i.e.,' interactions involving the recording 
system source of variation). 


REFERENCES 


Moore, J. W., & Gormezano, I. Yoked compariso”® 
of instrumental and classical eyelid conditio 
J. exp. Psychol., 1961, 62, 552-559. 
SNEDECOR, G. W. Statistical methods. 
Iowa: Iowa State Coll, Press, 1959. 


(Sthed) Ames 


(Received September 7, 1962) 


Journal of 


Experimental Psychology 


Vor. 64, No. 6 


DECEMBER 1962 


EDITORIAL 


This issue of the Journal of Experi- 
mental Psychology marks the end of 
144 issues and 12 years of my editor- 
ship. During these 12 years there 
have been marked changes in experi- 
mental psychology—chiefly in the 
directions of more mathematical for- 
mulation and testing of theory, more 
elaborate (and more adequate) ex- 
perimental designs and statistical 
analyses, more emphasis on the human 
subject, and more emphasis on the 
“higher mental processes” of the 
human subject—whether in learning 
and problem solving or in “informa- 
tion processing” performance. The 
Journal has attempted to respond to 
and reflect these changes, even en- 
courage them, at the same time that 
it remained a sympathetic medium for 
more traditional problems and meth- 
ods and for the wide range of content 
properly described as the experimental 
analysis of the mental and behavioral 
processes of the individual organism— 
especially man. 

Provided that the problem of an 
experimental study fit within the 
content boundaries prescribed for the 
Journal, the criterion for acceptance 
of an article has been at all times the 
question whether it warranted space 
in the ever-more-crowded archives of 
our science. This is, of course, a 
multidimensional criterion, some di- 


mensions being quite objective and 
some being quite subjective. Objec- 
tive, or at least rational, dimensions 
were matters pertaining to the ade- 
quacy of the experimental design for 
the collection of data on the problem 
as stated, the adequacy and appro- 
priateness of the measures extracted 
from the data and the statistical tests 
employed, and the logical relationship 
between the data exhibited and the 
conclusions drawn. Criticism and 
rejection on the basis of these char- 
acteristics of the experiment may be 
considered as the application of ‘‘in- 
ternal” criteria, since the emphasis is 
on internal consistency. Here the 
question was usually formulated as 
“Tg this a valid experiment?” Many 
times the answer has been “No.” 
Proper control groups were not tested; 
the design confounded variables that 
made the results and conclusions 
trivial, rather than important; the 
chosen method of summarizing the 
data led to conclusions that did not 
hold up if the data were summarized 
in another equally appropriate, or 
more appropriate, way ; etc. 

The next step in the assessment of 
an article involved a judgment with 
respect to the confidence to be placed 
in the findings—confidence that the 
results of the experiment would be 
repeatable under the conditions de- 


553 


554 


scribed. In editing the Journal there 
has been a strong reluctance to accept 
and publish results related to the 
principal concern of the research when 
those results were significant at the .05 
level, whether by one- or two-tailed 
test! This has not implied a slavish 
worship of the .01 level or any other 
level, as some critics may have 
implied. Rather, it reflects a belief 
that it is the responsibility of the 
investigator in a science to reveal his 
effect in such a way that no reasonable 
man would be in a position to dis- 
credit the results by saying that they 
were the product of the way the ball 
bounced. At least, it was believed 
that such findings do not deserve a 
place in an archival journal, even 
though they may be proper fare for 
symposia, scientific meetings, and 
dittoed handouts. The P level of a 
finding which was the major purpose 
of the investigation (anyone can find 
a significant practice effect) is only 
one element in the persuasion, others 
being the relation of necessity be- 
tween the predicted relationship and 
other previously or concurrently de- 
monstrated effects, and the consist- 
ency of the relationship across a 
sequence of experiments. But an 
isolated finding, especially when em- 
bodied in a 2 X 2 design, at the .05 
level or even the .01 level was 
frequently judged not sufficiently 
impressive to warrant archival publi- 
cation, The same philosophy applied 
when negative results were submitted 
for publication, but here rejection 
frequently followed the decision that 
the investigator had not given the 
data an opportunity to disprove the 
null hypothesis, i.e., the sensitivity of 
the experiment was substandard for 
the type of investigation in question 
and was therefore not sufficient to 
persuade an expert in the area that 
the variable in question did not have 


EDITORIAL 


an effect as great as other variables of 
known significant effect. 

Even if a proffered experimental 
study passed the hurdle of design 
adequacy and judged repeatability, 
it still needed to pass another hurdle, 
and an increasing number of articles 
failed this third hurdle as we moved 
into the late 1950s and early 1960s. 
Increasingly, we applied a criterion of 
substantiality to experimental studies. 
By this was meant that the investiga- 
tion should not merely identify the 
effect of a variable, but should move 
beyond that simple demonstration to 
either the determination of a function 
relating levels of the variable to levels 
of the effect or the assembly of further 
information about the demonstrated 
effect—the composition of the vari- 
able, the range of tasks in which it had 
an effect, the variation of the effects 
as a function of a second or third 
variable, etc. We believed that the 
day of the archival report based on a 
simple experiment with an experi- 
mental and control group or with a 
2X2 design was past in many 
mature areas of psychological re- 
search, and that each published report 
should make a more substantia 
contribution to the problem. In 
particular, it seemed desirable for 
experimental psychology to move 
toward the determination of quantita- 
tive functional relationships betwee? 
independent and dependent variables, 
especially since so many of these 
quantitative relationships in behavior 
turn out tobe nonmonotonic. Failure 
to make a serious effort to understand 
the variable, either through plotting 
the effects of several levels of it of 
through follow-up experiments with 
the intent of determining the ge 
erality or other contingencies of i 
effect, was considered sufficient reason 
to choose not to publish until suc 
additional work had been done. 


| 


x tC 


EDITORIAL 


These, then, have been the guiding 
criteria for acceptance of articles for 
the Journal. When an article was 
rejected, an attempt was made to 
state the basis for rejection in terms 
of those criteria. But the same 
criteria were employed as the basis 
for required revisions prior to publica- 
tion. Only about 10% of all articles 
received by the Journal were pub- 
lished without substantial revision or 
rejection, which is to say that four out 
of five of the published articles (which 
were, in turn, 50% of those received) 
have suffered substantial revision be- 
fore publication. Very often the revi- 
sion was required for the purpose of 
condensing the article, eliminating du- 
plicated data in figures and tables, and 
in other ways decreasing the length of 
an article without reducing its es- 
sential content. But with a frequency 
that would surprise some readers, 
required revisions have consisted of 
adding detailed description of pro- 
cedures, making explicit some design 
factor, adding data in tables or 
figures, and even urgings to the 
author to add words in order to make 
more explicit his analysis and, inter- 
pretations. In short, the philosophy 
of acceptance has been that an article 
should not be rejected if the experi- 
ments were acceptable, even though 
major revision of the data analysis or 
the article as a whole was required. 
In fairness to contributors who sub- 
mitted completely acceptable articles, 
those who were required to revise 
were given a limited period of time 
(usually 30 days) in which to re- 
submit the revision without loss of 
position in the publication order. 

The intent of these criteria as 
applied either to acceptance or re- 
vision before publication has been to 
get more scientific mileage from the 
pages of the Journal. This was done 
by excluding questionable data, by 


555 


encouraging the reporting of research 
in larger, more substantial chunks, 
and by reporting research as com- 
pletely as necessary but also suc- 
cinctly. All of this stems from the 
conception of the Journal as an 
archive of our science, not as a news- 
sheet filled with a heavy load of 
transient, undigested, or fallible in- 
formation. However, this screening 
of information for such archival 
records is not an infallible procedure. 
The criteria are not reducible to 
formula, and the final judgment is 
intrinsically subjective. Therefore, 
studies have been accepted and pub- 
lished, only later to be judged inade- 
quate by the criteria; others have been 
rejected and published elsewhere, later 
to be widely acclaimed as containing 
important data. (1 do not include in 
the latter class those experimental 
articles that deserved to be rejected 
because of grievous faults in the 
design, but which, when published 
elsewhere, stimulated research on a 
problem owing to the very inade- 
quacies of the original experiment.) 

It should be clear from what has 
been said about criteria that heavy 
demands were placed on the editorial 
staff for detailed information about 
what is already known and for 
methodological sophistication, and 
this demand applied to a myriad of 
special problem areas in the case of an 
omnibus journal such as the Journal 
of Experimental Psychology. The 
heart of the editorial system is now 
and has been the board of Consulting 
Editors of the Journal, some of whom 
served for the entire 12 years and all 
of whom have made multiple, essential 
contributions to the implementation 
of the criteria and standards that all 
of us considered to be in the best 
interest of the science. It is fitting, 
therefore, that all of the Consulting 


556 EDITORIAL 


Editors of the Jowrnal be given this debtedness for making my editorship 
public repetition of my private ex- of the Journal possible. The list 


pressions of appreciation and in- follows: | 
‘ 


Norman H. Anderson (1959-62) Harold W. Hake (1957—62) 
E, James Archer (1957-62) Lloyd G. Humphreys (1951-59) 
Fred Attneave, III (1959-62) Arthur L. Irion (1954-62) 
Judson S. Brown (1951-58) Howard H. Kendler (1957-62) 
Cletus J. Burke (1953-62) Herschel W. Leibowitz (1962) 
James Deese (1960-62) Donald B. Lindsley (1951-62) 
Paul M. Fitts (1957-62) Kenneth MacCorquodale (1957-62) 
Frederick C. Frick (1957-59) Quinn McNemar (1957—62) 
Robert M. Gagné (1953-56) Neal E. Miller (1951-57) 
Wendell R. Garner (1954-56) Edwin B. Newman (1951-54) 
Frank A. Geldard (1951-62) Leo Postman (1961-62) 
James J. Gibson (1951-62) L. Starling Reid (1958-62) 
Clarence H. Graham (1951-61) Kenneth W. Spence (1953-62) 
David A. Grant (1951-56) Benton J. Underwood (1951-56) 
Delos D. Wickens (1951-56, 1959-62) 


In addition to those listed, many indicated acceptance, rejection, oF 
other psychologists have made im- requirements for revision. Perhaps 
portant contributions to the Journal we should take this opportunity to 
through their reviews of specific thank all those Consulting Editors 
articles where their competence was who tolerated our bad judgment when 
deemed necessary for knowledgeable we failed to follow their advice, and 
evaluations. Each of them, if he who did not resign forthwith—none 
reads this, will, I hope, consider him- did, to my knowledge. 
self again privately thanked for his Penultimately, I wish to express my 
contribution, deep appreciation to the three who 

This tribute to the Consulting served as Associate Editors of the | 
Editors of the Journal must not be Journal—David A. Grant (1957-62), 
interpreted as shifting to their should- Delos D. Wickens (1957-58), and 
ers responsibility for the Type I and William K. Estes (1959-62). Editing 
Type i errors we have made in the Journal during these last 6 years 
accepting or rejecting articles. Their would have been intolerable, if not 
relationship to the final decision, impossible, without each of them 
which was always made by myself or assuming roughly one-third of the 
by an Associate Editor, was under- responsibility for deciding what should 
stood at all times to be advisory. and should not be in the Journal. AS 
ue were times—not very many— many who have contributed to the 
when Consulting Editors did not Journal know, their roles were those 
agree in their evaluation, and there fC ih : : lete re- 
were times—again, not ver E eee n COP i 

y many ige s ing the rela- 

when we accepted even though the eporebitity ce coming t he 
Consulting Editor recommended re- tionship with authors up through tì 
jection, or rejected even though he decision to accept or reject. The 
recommended acceptance, In any ability of the Journal to judge api 
event, he was informed of the action propriately some of the technical an 
taken on his advice, since he received theoretical innovations of recent years 
copies of the letters to authors which is largely a consequence of their 


EDITORIAL 


participation in the editing of the 
Journal. 

Finally, it would be thoughtless of 
me to bring this swan song to a close 
without expressing our appreciation 
of the tolerance (sometimes requiring 
incubation) that the vast majority of 
contributors to the Journal have 
shown toward the editorial mayhem 
performed, sometimes repeatedly, on 
the products of their thought and 
work. Experimental work in psy- 
chology is terribly laborious business, 
as every one who has ‘done it knows. 
When, after all is done and a report of 
it is written, it is nothing short of 
mental cruelty to have an editor 
require that it be cut in half, and even 
worse to have him recommend that it 
serve as a lesson in how to do a better 
job next time. While I have no 
illusion that this editorial role has 


557 


increased the quantity of warm senti- 
ments that come my way, and | know 
that I have been hung in effigy in 
some laboratories and offices, it is still 
my hope that producing experimental 
psychologists who make our science 
grow apace, and who make the Jour- 
nal possible, recognize the attempt to 
be fair and explicit, if not the wisdom, 
in the decisions they have suffered. 
Some authors have even been so kind 
as to say that this is so. 

I feel no reluctance, only gratifica- 
tion and confidence, as I relinquish 
the editorship to my able friend and 
colleague, David A. Grant. These 
sentiments relate not only to his 
editorship, but also to the vigorous, 
sometimes combative, state of ex- 
perimental psychology and experi- 


mental psychologists. 
_ ARTHUR W. MELTON 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 558-564 


FREE RECALL LEARNING OF VISUAL FIGURES AS A 
FUNCTION OF FORM OF INTERNAL STRUCTURE 


JAMES R. WHITMAN 
Veterans Administration Hospital, Perry Point, Maryland 
anD W. R. GARNER 
Johns Hopkins University 


The literature on factors affecting 
free recall learning is voluminous. 
The kinds of factors which have been 
investigated involve such things as the 
number of items to be learned, prior 
experience with the items, meaning- 
fulness of the items, interitem simi- 
larity, etc. Most of the experiments 
reported have had as an assumption, 
either explicit or implicit, that char- 
acteristics of the individual items or 
stimuli are critical. 

Garner (1962) has emphasized an- 
other aspect of free recall learning, 
namely, the internal structure in- 
herent in groups of stimuli to be 
learned. A set of stimuli can be 
considered to be generated by a series 
of variables, and these variables can 
be interrelated in any subset of 
stimuli actually used in the learning 
experiment.! This relatedness con- 
stitutes the internal structure of the 
stimuli or the variables which make up 
the stimuli. The amount of internal 
structure is the same as the redun- 
dancy of the subset of stimuli and is 
determined by the number of stimuli 
used in an actual subset compared to 
the number which could have been 
generated with the same number (and 
levels) of the variables. 

If the number of stimuli used in an 
experiment is the same as the total 


1 In this paper we shall use the term set to 
indicate the group of all possible stimuli 
generated by the specified variables and 
levels; we shall use the term subset for any 
group of stimuli which does not include all 
possible stimuli. 


number which could be generated 
from the given variables, then all 
variables are orthogonal to each other 
and there is no internal structure to be 
learned. Knowing these variables, 5 
can reproduce the set of stimuli with- 
out practice. But if the number of 
stimuli used is less then the number 
which could have been used, then 
internal structure exists ; and both the 
amount and the form of this structure 
will affect ease of free recall learning. 

The internal structure is not de- 
termined by the relations between 
elements of any particular stimulus. 
Rather, it is determined by the rela- 
tions between the variables making up 
the stimuli across the subset of 
stimuli actually used. Thus the 
amount and form of internal structure 
cannot be specified without knowing 
the exact subset of stimuli used in the 
learning experiment. 

These considerations led Garner 
(1962) to state two specific hy- 
potheses, the tests of which are the 
purpose of this experiment. First, the 
ease of free recall learning is not 4 
question of the characteristics of the 
individual stimuli which make up the 
subset but is a question of the char- 
acteristics of the entire subset © 
stimuli. Thus the same stimuli im- 
bedded in two different subsets ° 
stimuli will be learned according to the 
characteristics of the subset within 
which they are imbedded, and the 
nature of the unique stimuli is 1- 
relevant. 


558 


| 


FREE RECALL LEARNING OF VISUAL FIGURES 


Second, the form of the internal 
structure is a critical factor in learning 
even with the same total amount of 
internal structure. Specifically, those 
forms of structure which involve 
direct contingencies between pairs of 
variables will produce easier learning 
than will forms of structure involving 
complex relations among three or 
more variables, i.e., interactions. 

While these hypotheses are relevant 
to free recall learning of any stimuli, 
the present experiment tests them 
specifically with free recall learning of 
visual figures. 


METHOD 
The Stimuli 


In carrying out an experiment to test the 
importance of the form of internal structure 
in free recall learning, it is important that the 
amount of internal structure be held constant. 
In more specific terms, it is important to 
specify not only the subsets of stimuli 
actually used but also the total set of stimuli 
which could have been used, since the ratio 
between these determines the amount of 
internal structure. 

Total set of potential stimuli.—The po- 
tential stimuli, or the complete set of stimuli 
from which the subsets were selected, were 
formed by using three levels or values of each 
of four variables. If all possible combinations 
of these variables are generated, the total 
number of possible stimuli is 81. The four 
variables and their values are: (a) Shape, with 
squares, triangles, or circles constituting the 
levels; (b) Lines, with two, one, or zero lines 
bisecting the shape; (c) Spaces, with a space 
on the left, on the right, or no space; and 
(d) Dots, with a dot above the shape, below it, 
or none. 

Subsets of actual stimuli —Three different 
subsets of stimuli to be used in the experiment 
were chosen from this total set. Each subset 
contained 9 different stimuli from the 81 
possible and differed only with regard to the 
form of the internal constraint. In selecting 
these three subsets, it is important that each 
subset demonstrate all four variables of the 
total set and, furthermore, that each level 
of each variable occur equally often. This 
precaution is necessary to ensure that the 
factor of total amount of internal structure 
not be confounded with the form of internal 
structure. All three subsets of actual stimuli 


SUBSET A suesct 8 


Tl Fela ae Gea 


nes O w3 


1333 


1222 


2123 


2222 


eme 


223 2232 


sm Hii 


s322 


a32 3331 


H Epb bP Ose O 


3213 3333 3333 


BHROUPP DP 9-9 
È 


Fic. 1. The three subsets of actual 
stimuli used. (The number to the right of 
each figure provides the coded values for the 
four variables, in the order, shape, space, line, 
and dot. The underlined coded values for 
three figures each in Subsets B and C repre- 
sent the three identical figures for these 
subsets.) 


are shown in Fig. 1, along with coded values 
of the four variables. 

Subset A: In the first subset of nine visual 
figures each of the four variables occurs three 
times at each level, but no two of the variables 
are directly correlated. Since there are four 
variables, there are six pairs of variables; 
and none of these pairs has a contingency 
greater than zero. This subset of stimuli is 
equivalent to a Graeco-Latin square, in which 
four variables are all orthogonal to each other. 

Subset B: The second set of figures was 
selected so that one of the six pairs of vari- 
ables was perfectly correlated and the other 
five were orthogonal. This subset provides a 
condition intermediate between subsets with 
minimum and maximum contingencies be- 
tween variables. 

Subset C: The last subset of figures was 
selected so that three of the six pairs of vari- 
ables were perfectly correlated while the 
other three were uncorrelated, This subset 
provides the maximum contingency between 
pairs which can exist. Since nine different 
stimuli are required, at least one pair of 
variables must be orthogonal, a restriction 
which also means that no more than three 
of the pairs can be correlated. 


560 


TABLE 1 
UNCERTAINTY ANALYSES OF THE FORM OF 
INTERNAL STRUCTURE OF THE THREE 
SUBSETS OF STIMULI, WITH SHAPE 
(W), Space (X), Line (Y), AND 
Dor (Z) AS VARIABLES 


Subsets 
Contingencies 
A B Cc 
Simple 
:X 0 1,58 1,58 
W:Y 0 0 1,58 
{W:Z 0 0 0 
eps 0 0 1.58 
X:Z 0 0 0 
Yz 0 0 0 
Interaction 
WXY 1.58 0 —1.58 
WXZ 1.58 0 0 
WYZ 1.58 1.58 4 
XYZ 1.58 1,58 i 
WXYZ —3.16 —1.58 /0 
Total 
(WX: ¥:Z 3.16 3.16 Bae 


Three of the stimuli in Subset C are iden- 
tical to three in Subset B. These identical 
stimuli were included to allow comparison of 
learning rates for these particular stimuli, to 
determine the extent to which learning is 
affected by the particular stimuli rather than 
by the characteristics of the subset. 

To summarize the characteristics of these 
three subsets of stimuli, each subset contains 
nine different stimuli; in addition, each subset 
contains exactly three occasions of each of the 
three levels of each of the four variables. 
Thus the number of specific elements which 
compose the different subsets is identical in 
all cases, 

These subsets differ only in regard to the 
form of the internal structure, and an un- 
certainty analysis (in bits) of these three 
subsets of stimuli is shown in Table 1. 
The total amount of internal structure 
(W:X:Y:Z), shown at the bottom, is the 
same in all three cases, 3.16 bits. In Subset A, 
however, all of this structure is in the form 
of interactions, and none of the simple con- 
tingencies (between pairs of variables) is 
greater than zero. In Subset B, one simple 
contingency exists, and the rest of the 
structure is in the form of interactions, The 
pattern here is somewhat more complex. 
Again the interaction involving all four vari- 
ables is negative and serves to correct the 
three-variable interactions, In Subset C, the 


JAMES R. WHITMAN AND W. R. GARNER 


maximum amount of simple contingency 
exists since three of the pairs are perfectly cor- 
related. Since this amount of structure is 
greater than the total structure, again the 
negative interaction term occurs to correct the 
total. 

Our hypothesis concerning form of struc- 
ture concerns the amount of structure which 
exists in the form of simple contingencies. In 
Subset A, none does; in Subset B, 1.58 bits 
does; and in Subset C, 4.74 bits does. 


Subjects 


All Ss were personnel associated with a 
large VA hospital and included 16 summer 
students and 9 staff members with professional 
degrees. They ranged in age from 15 to 58 
yr. There was a total of 39 Ss, and they were 
assigned randomly to each of three groups 
with the restriction that each group contain 
an equal number of professional staff and 
students, insofar as possible. (A median test 
showed no difference in performance between 
the different kinds of Ss.) Each group of Ss 
was required to learn just one of the three 
subsets offstimuli. 


Materials 


The stimulus figures were drawn with 
black India ink on individual white paste- 
board sheets, 8} X 11 in. The diameter of 
the circles and the sides of both triangles and 
squares were 6 in. All spaces were 2 in. in 
width and were centered. Dots were solid, 
3 in. in diameter, and centered 4 in. below of 
above an edge, All lines were solid, about # 
in. wide. Lines within the patterns were 
centered, and when two were present they 
were } in. apart. 


Learning Trials 


The Ss were tested cither individually or in 
small groups, depending on availability. The 
E stood in front of Ss and held each stimulus 
card from a subset for 5 sec., with one 
stimulus immediately following another. The 
order in which the stimuli were presented was 
predetermined so that no figure on any trial 
followed the same figure that it had on the 
preceding trial, and each figure was present 
once as the first and once as the last in â 
series of nine trials, ant 

The Ss were told that they were partici- 
pating in an experiment to see how fast they 
could learn to reproduce from memory nine 
different diagrams or figures which would 
shown to them. The Æ then described the 
four characteristics of the figures and the 
levels of each, giving illustrations by using 


a 


FREE RECALL LEARNING OF VISUAL FIGURES 


, figures not later used in the experiment. The 
words “angles,” “spaces,” “lines,” and “dots” 
were then written on a blackboard as a 
reminder of the four characteristics, and these 
remained in view throughout the experiment, 

The Ss were then told that (a) the figures 
were going to be presented one at a time; 
(b) the order in which they were presented 
would vary; and (c) after they had seen all 
nine on a given trial they were to draw the 
figures from memory in any order. For this 
purpose they were given answer sheets con- 
taining nine blank spaces in three rows of 
three each. The Ss were instructed to draw 
nine different figures each time, guessing if 
necessary. 

A trial consisted of the presentation of a 
complete subset of stimuli followed im- 
mediately by an answer period of 2 to 3 min. 
for Trial 1 and 1.5 to 2.5 min. for subsequent 
trials. At the end of 1.5 min. (2 min. for 
Trial 1) S was urged to complete nine different 
reproductions. At the end of the answer 
period, S covered his answers and was in- 
structed not to look at them again. During 
Trials 1-5 E described each stimulus in terms 
of the four variables as it was presented. 
Practice continued for 20 trials or until S had 
correctly reproduced all nine figures on a 
single trial. 


Measures 


The reproductions of S were scored in the 
following ways: (a) number of trials in order 
to reproduce correctly all nine figures; (b) 
number of correct responses on each trial; and 
(c) the amount of simple contingency be- 
tween pairs of variables in the reproductions, 
without regard to correctness of response. 
This latter measure is simply a matter of 
determining the form of the internal structure 
in each set of nine reproductions in the same 
way that the stimuli themselves are described. 
In determining contingent uncertainties from 
the reproductions, an approximation pro- 
cedure was used to facilitate computation. 
The number of pair coincidences was counted, 
and the total of these was translated into 
contingent uncertainty by 4 computed 
graphical function. 


RESULTS 


Form of structure. —The main re- 
sults pertain to the hypothesis con- 
cerning the effect of form of structure 
on free recall learning. Table 2 shows 
the number of trials required for a 
criterion of nine correct reproductions 


561 


for the three groups. Median trials 
were used, rather than means, because 
6 of the 13 Ss learning Subset A had 
not learned the stimuli to the criterion 
within the 20 trials allowed. Subset 
A had stimuli with no pair contin- 
gencies, and it is clear that such a 
subset of stimuli is very difficult to 
learn. By contrast, Subset C, with 
maximum simple contingencies, was 
extremely easy to learn, with a median 
of just two trials. In fact, 5 of the 13 
Ss learning Subset C correctly re- 
produced all nine stimuli on Trial 1. 

Analysis of the data in terms of 
number of correct reproductions per 
trial shows equally clearly the great 
difference between these subsets of 
stimuli. This analysis, shown in Fig. 
2, indicates how rapidly learning 
occurs with the high simple contin- 
gencies and how far from complete it 
is even after 20 trials with the zero 
contingencies (Subset A). The evi- 
dence in favor of the hypothesis could 
not be much clearer. 

Individual stimuli.—Three stimuli 
in Subset B were identical to three 
stimuli in Subset C. If the character- 
istics of the individual stimuli are 
important in free recall learning, then 
these three stimuli should have been 
learned at the same rate regardless of 
the subset within which they were 
imbedded. Analysis of number of 
correct reproductions of just these 
three stimuli was carried out for each 


TABLE 2 


NUMBER OF LEARNING TRIALS TO CRITERION 
FOR THE THREE SUBSETS OF STIMULI 


Subsets | N | Range | Median a aalo t 
A (13| 9-20+ 19 Avs. B, 
135.5* 
B 13| 7-19 12 B vs. C, 
one 
C43) TRE 2 
| Su a a 
*p = 05. 
P= 01. 


wo r — 
ale SUBSET (ch = 
a 
x 
S aie 
Š 
$ 
S wt 
Šo 
& 
M H 
Š 
§ 
8 aol 
eae 
2 
x sunser ta) 
zh 4 
oR re ae eee L 
5 10 is 20 


Taat 


Fic. 2, Percentages of correct responses as 
a function of trial for the various subsets of 
figures. (The filled points are data for all 
nine figures of each subset. The open points 
are data from just those three figures in 
Subsets B and C which were identical. Each 
point is the mean for 13 Ss.) 


subset separately, and the data ob- 
tained are plotted in Fig. 2 as the open 
squares and triangles. Since these 
data are plotted in terms of per- 
centages of correct responses, direct 
comparisons of the learning curves are 
possible. 

The curves for the three particular 
stimuli follow almost exactly the 
learning curves for the subsets within 
which they were contained and bear 
little relation to each other. The 
evidence could not be much clearer 
that the characteristics of the in- 
dividual stimuli are of little relevance 
in free recall learning of subsets of 
stimuli but rather that the character- 
istics of the entire subset of stimuli are 
important. 

In fact, when it is recalled that one- 
third of the stimuli in Subsets B and 
C were identical, the large difference 
in learning rates for these two subsets 
is even more impressive. Apparently 
what is learned is not the individual 
stimulus but a total set of relations 
between variables which make up the 
stimuli. 

Internal structure in reproductions. — 
These data show that subsets of 


JAMES R. WHITMAN AND W. R. GARNER 


stimuli in which there are high simple 
contingencies are easy to learn. In 
order to provide some additional 
understanding of the role of this 
factor in free recall learning, the 
amount of simple contingencies (in 
bits) was determined for Ss’ repro- 
ductions on each successive trial; and 
these contingency results are shown 
in Fig. 3. There are several factors of 
interest in these curves. 

First, in order to reproduce cor- 
rectly all nine figures, the reproduc- 
tions must contain the same pattern 


of contingencies as the stimuli them; 


selves had. But this pattern is simply 
a prerequisite condition since it is 
possible to have the same total 
amount of simple contingencies but 
not to have the correct pairing of 
variables. Thus part of the learning 
process involves learning to reproduce 
the correct pattern of contingencies. 
Second, analysis of the contin- 
gencies in the reproductions can give 
us some idea of what seems natural to 
Ss, and the data in Fig. 3 clearly show 
that Ss produce a very high level of 


i eo” en Sm a E A R A D E | 


Eivi 
sf a vg SN 
“a 


OF SIMPLE conrmoewcies 


p nee ee O a a O N ON S GA EN T ~ 
E] s s n 
Fic. 3. Simple contingencies in reproduced 
subsets as a function of trial. (Each point is 
the average of the total amount of the simple 
contingencies in bits in the subsets as repro- 
duced by S without regard to correctness of 
the reproductions. For Subsets B and C, 
each plotted point is the average of the 13 Ss 
scores. The Ss for Subset A are divided into 
seven learners and six nonlearners.) 


FREE RECALL LEARNING OF VISUAL FIGURES 


contingencies in their first few trials. 
This level, close to the maximum 
possible, is so high that there is 
actually very little learning for Subset 
C, in which the maximum possible 
contingencies are found. But for the 
other two subsets, this level is much 
too high, and in effect Ss must learn 
to undo this apparently natural tend- 
ency to produce high contingencies 
before they can reproduce the stimuli 
correctly. 

In order to obtain some idea of 
whether these high contingency values 
are simply the result of random pair- 
ings of variables, we produced 10 
random sets of stimuli, in which it was 
only required that each level of each 
variable occur equally often and that 
all 9 patterns be different. The mean 
contingency value for these 10 ran- 
domly selected patterns was 3.23 bits, 
with a standard error of .18 bits. This 
value is so far below those for early 
trials that it is clear that Ss do not 
just produce random amounts of 
simple contingencies but produce close 
to the maximum possible value. And 
these high values are produced even 
when the stimulus subsets themselves 
contained much lower values. Ap- 
parently these low values of simple 
contingency are contrary to Ss’ normal 
expectations. 

The data for Subset A are plotted 
separately for Ss who learned and 
those who did not learn the figures 
within the 20 trials. The nonlearners 
show very little evidence of learning 
at all, and there is the strong sugges- 
tion that some Ss never would learn 
the figures. Actually, while the ex- 
periment was cut off at 20 trials, an 
attempt had been made to continue 
it for these nonlearners. Two of the 
six nonlearners were continued to 30 
trials and had not yet learned. Two 
others refused to continue the experi- 
ment shortly after 20 trials, because 


563 


they felt they never would learn the 
figures. These stimuli are very diffi- 
cult indeed to learn, and there is the 
suggestion that some Ss cannot deal 
with or conceptualize completely un- 
correlated stimulus variables. 


Discussion 


The results of this experiment leave 
little doubt that the context of inter- 
relationships between variables within a 
subset (internal structure) is critical for 
free recall learning, and that the two hy- 
potheses initially stated are valid: Free 
recall learning is a function of the struc- 
tural characteristics of the entire subset 
of stimuli, not of the individual stimuli; 
and internal structure which exists in the 
form of simple contingencies between 
variables is better for free recall learning 
than are more complex forms of structure. 

Each of these points deserves some 
comment, and we shall do so in reverse 
order. Miller (1958) showed that free 
recall learning is easier for what he called 
redundant strings of letters. He gen- 
erated nonsense words by different 
statistical rules which affected the se- 
quential dependencies between successive 
letters in the words and found that high 
sequential dependencies gave better 
learning. Since the lists which he com- 
pared, however, were of the same length, 
and since the number of different letters 
possible was the same for each list, it is 
clear that his experiment concerned not 
the amount of redundancy but rather its 
form. It is more difficult to state the 
amount of simple contingency in his lists 
since all words were not of the same 
length, but the nature of the differences 
was certainly similar to the differences 
used in the present experiment. 

In this experiment, the amount of 
redundancy was the same in all three sets 
of stimuli, but the amount itself should 
be an important variable for free recall 
learning. Horowitz (1961) compared 
lists of letter trigrams differing in simi- 
larity by Underwood's (1954) definition 
of similarity as the extent to which words 
on the list share the same items. By this 
definition, low similarity is equivalent to 


564 


high redundancy or internal structure 
and vice versa since low similarity lists 
have many different levels or values per 
variable, a fact which means that the set 
of potential stimulus words is very high 
compared to the number of words 
actually used. He found better free 
recall learning in early trials for the high 
similarity (low redundancy) lists. 

In generating his lists, Horowitz used, 
for the low redundancy lists, a form of 
redundancy in which pairs of variables 
(letter positions) were very nearly un- 
correlated. As the present results show, 
such a form is poor for free recall learn- 
ing. His high redundancy lists, on the 
other hand, had 12 different letters; and 
he used all of them in each of his three 
letter positions. Such a procedure means 
that each letter in each position is paired 
uniquely with a letter in each other 
position so that pair contingencies are 
necessarily high. With so many letters 
per position no other relation is possible. 
Yet the net effect is that Horowitz used 
a good form of structure with his high 
redundancy and a poor form for his low 
redundancy. It is almost certain that if 
Horowitz had used a good form of 
structure with his low redundancy lists, 
or even random pairings of letters, he 
would have obtained much larger differ- 
ences between his low and his high 
redundancy lists. 

There is another point that stems from 
Horowitz’ experiment that needs em- 
phasis here. He showed that the rela- 
tions between similarity and learning 
depended on the kind of learning re- 
quired. In the present context, it is 
almost certain that the results we have 
obtained are true for free recall learning, 
but they will probably not be true for 
kinds of learning which involve dis- 
crimination between items. Garner 
(1962) has discussed this problem in 
detail. 

Many other experiments, summarized 
by Garner (1962), have shown that 
discrimination processes depend on the 
total set of stimuli rather than the in- 
dividual stimuli. Klemmer and Loftus 
(1958), for example, showed that identi- 
fication of numerals with brief visual 


JAMES R. WHITMAN AND W. R. GARNER 


exposures depends on the total set of 
forms within which the numerals are 
imbedded. Our experiment shows that 
similar considerations hold for learning 
processes. c 

It should be emphasized that the 
characteristics of a group of stimuli are 
not simply the sum of the characteristics 
of the individual stimuli but are char- 
acteristics which can exist and be 
specified only for the total subset. Thus 
what is learned is the entire subset. In 
actual fact, the problem must really be 
put in reverse: We cannot specify the 
characteristics of the individual stimulus 
until we know the characteristics of the 
entire subset since the nature of the re~ 
quired differentiations depends on the 
alternative stimuli within the subset. 


SUMMARY 


This experiment tested two hypotheses 
relating free recall learning to the form of the 
internal structure: (a) the ease of free recall 
learning depends not on the characteristics of 
the individual stimuli but on the character- 
istics of the entire subset to be learned; 
(6) when a subset of stimuli is characterized 
by simple contingencies between pairs of 
variables generating the set, free recall learn- 
ing will be easier than when the subset 1$ 
characterized by interactions'involving three 
or more variables. 

Three different forms of internal structure 
in subsets of visual figures were compared- 
The results showed clear differences in the 
predicted direction and both hypotheses were 
substantiated. 

REFERENCES 

Garner, W. R. Uncertainty and siructure os 
psychological concepts. New York: Wiley, 
1962. 

Horowitz, L. M. Free recall and ordering of 
trigrams. J. exp. Psychol., 1961, 62, 51-51. 

KLEMMER, E. T., & Lorrus, J. P. Numerals, 
nonsense forms, and information. AF 
Cambridge Res. Cent. tech. Rep., 1958, No. 
57-2. (ASTIA No. AD110063) 

Miter, G. A. Free recall of redundant 
strings of letters. J. exp. Psychol., 1958, 
56, 485-491. A 

UnpERWoop, B. J. Intralist similarity 
verbal learning and retention. Psye™e 


Rev., 1954, 61, 160-166. 
(Received December 27, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 565-571 


MUSCLE TENSION DURING MENTAL WORK 
UNDER SLEEP DEPRIVATION ! 
* ROBERT T. WILKINSON 
Applied Psychology Research Unit, Cambridge, England 


It has been shown (Wilkinson, 
1958) that there are some tasks which 
most people can perform as well as 
normally under 30 hr. sleep depriva- 
tion and also that there are some 
individuals whose performance seems 
quite unaffected by the stress what- 
ever the task. Is lack of sleep com- 
pletely without effect in these situa- 
tions or is the effect appearing in some 
form which is not being measured? 
It has been suggested (Wilkinson, 
1961) that motivational factors are 
important in deciding whether per- 
formance will be impaired; a man 
appears capable of performing nor- 
mally in spite of loss of sleep if the 
rewards for doing so or the penalties 
for failing to do so are sufficiently 
great. The present hypothesis is that 
this will only be done at the expense 
of extra effort and that electro- 
myographic (EMG) records of mus- 
cular responses may provide some 
indication of this. In this experiment, 
therefore, EMG has been measured 
concurrently with an assessment of 
the effect of loss of sleep on per- 
formance. 


1The British Medical Research Council 
provided financial support for this research. 
The British Royal Navy contributed Ss and 
technical research assistants. In particular 
R. C. Collecott rendered valuable assistance. 
The work was carried out under the general 
direction of D. E. Broadbent, and P. E. 
Donaldson gave advice on electronic matters. 
All these contributions are gratefully ac- 
knowledged. 3 

A brief review of the main finding of this 
experiment has formed part of a paper in the 
“CIBA Symposium on the Nature of Sleep,” 
the proceedings of which have been published 
by Churchill, London. 


METHOD 


Procedure.—Twelve Ss, enlisted men be- 
tween the ages of 18 and 30, carried out a 
20-min. test of addition twice at an interval 
of 2-4 days, once with sleep and once without. 
The design was balanced for practice effects 
and for the possibility of the two test papers 
being of unequal difficulty. While doing the 
test, and for 2 min, before and after it, 
records of muscle tension (EMG) were taken. 

The test—Sitting alone in a cubicle Ss 
were given a sheet of 100 sums and required 
to complete as many as possible in 20 min. 
Each sum comprised five two-digit numbers 
to be added, the total to be written down 
and also spoken into a microphone. At the 
15-min. point of the test E intervened, speak- 
ing to S through a loudspeaker in the cubicle. 
He said, “Now I want you to work faster and 
more accurately, and to help you I will tell 
you the time you take for each sum and 
whether you get it right or wrong.” This 
knowledge of results (KR) was given through- 
out the last 5 min. of the test. On the previ- 
ous day Ss were given a practice run, the 
procedure, including the recording of EMG 
being exactly the same as in the main tests 
except that the run lasted only 10 min. and E 
did not intervene with KR. 

Sleep deprivation.—In their experimental 
test 6 Ss had been without sleep for some 
56 hr. and the other 6 for about 32 hr. All 
were tested in the afternoon and this applied 
also to the control tests after normal sleep. 
The Ss carried out routine duties and some 
other tests while staying awake but were in no 
way overworked apart from the stress im- 
posed by enforced wakefulness. 

EMG recording—EMG records were taken 
from a placement over the pronator teres 
muscle of the left (inactive) forearm, Ss being 
asked to allow the arm to hang loosely by 
their side as they sat at the table doing the 
sums or relaxing. A single-channel machine 
of private design was used having an input 
impedance of 250 K9; pulses reflecting the 
integrated output were recorded on one 
channel of a tape recorder while the other 
channel recorded by microphone the proceed- 
ings in the test cubicle. This record com- 


565 


566 


18 Z 
= 
o 
: 
4 
17 b 
2 m 
° 
ta) 3 Pr! 
3| 
2 
3 2 
a A 
16 


\st 2mo 3R0 
S MIN. PERIODS OF THE TEST 


4TH (kK) 


Fic. 1. Speed of adding with 
and without sleep. 


prised mainly S's answers to the sums and E's 
encouragement and KR. Bipolar sponge 
electrodes were used and the skin was abraded 
to give reasonably equal and low resistance 
(between 10 KQ and 3 K9) from each electrode 
to the reference electrode on the active fore- 
arm. As each S was tested twice and com- 
parisons made between the levels of EMG on 
each occasion it was essential that the place- 
ment and recording sensitivity should be 
approximately the same each time. To 
achieve this a patch of adhesive tape was 
placed over the proposed site of the electrodes. 
There were two holes in this patch, 1} in. 
apart and yẹ in. in diameter. In preparation 
for the first test the skin was abraded through 
these holes and the sponge electrodes placed 
immediately over them. The adhesive patch 
remained in place until the second test 2 or 4 
days later when the electrodes were again 
placed over the holes and the skin abraded 
where necessary to achieve an electrode-to- 
electrode resistance stabilizing at approxi- 
mately the same level as preceded the first 
test. Each test was preceded and followed 
by 2 min. relaxation when S sat back in his 
chair and rested. EMG records were taken 
throughout and the score of “level of EMG" 
is the ratio of the average EMG during the 
test to the average during the preliminary 
period of relaxation. A further index of 
EMG is that of its variability in any given S$ 
during a test. This score of EMG variability 
reflects the variance (calculated as the 
coefficient of variability) of the minute to 


ROBERT T. WILKINSON 


minute counts of EMG in each of the four 
5-min. periods of the test. 

Statistical treatment.—Significance of single 
means were tested by Wilcoxon’s matched- 
pairs signed-ranks test, and differences be- 
tween means by the Mann-Whitney JU test. 
Kendall’s rank correlation coefficient (7) was 
used for all correlations. All these procedures 
are described by Siegel (1956). All signifi- 
cance levels refer to two-tail assessments 
except where otherwise stated. 


RESULTS 


This section will give the results and 
their immediate implications; in the 
following section more general im- 
plications will be considered. The 
analysis that follows will be concerned 
mainly with the period of No KR (the 
first 15 min. of the test), but in addi- 
tion we shall consider the changes 
that occurred when this feedback was 
added in the last 5 min. Finally, 
attention will be drawn to a possible 
predictor of the degree to which 
individual performance will be im- 
paired under sleep deprivation. 

Period of No KR.—During the first 
15 min. of No KR sleep deprivation 
had no effect upon errors but it re- 
duced the number of sums, done 
(Fig. 1). This result was significant 
(P < .01) when the Practice X Order 
interactions were corrected for as 
follows: half the Ss carried out theit 


TABLE 1 


EMG LEVEL anp EMG VARIABILITY 
WITH AND WITHOUT SLEEP 


eeo 


EMG Level» EMG Variability 


5-Min. Test : 
Periods 
Sleep | NoSleep| Sleep Notes 
1 (No KR) | 1.88 | 1.95 | .119 a 
2 (No KR) | 1.77 | 1.56 | .205 an 
3 (No KR) | 1.63 | 1.82 | .137 a 
4 (KR) 2.88 | 2.01 | .244 | 3 


ME ing 4 
* EMG level is the average EMG count during @ 
given S-min. test period divided by the average EM 
during the preliminary 2-min. relaxation. any (0 
© EMG variability is the coefficient of variability 
of the minute to minute counts of EMG. 


<_< a 


MUSCLE TENSION DURING MENTAL WORK 


first test under sleep deprivation and 
the second with normal sleep; for the 
other half the order of the conditions 
was reversed. All improved with 
practice from first to second test and 
the null hypothesis was that if sleep 
deprivation had no effect the practice 
effects of the two groups of Ss would 
not differ. Lack of sleep had little 
effect upon the level of EMG but it 
increased EMG variability (P < .02). 
These trends are shown in Table 1. 

To examine concurrent trends of 
performance and EMG, Ss were 
ranked in order of impaired perform- 
ance and of increased EMG due to 
loss of sleep, and the correlation of the 
two rankings was assessed. This 
operation had to be performed sepa- 
rately on each of the four teams of 3 Ss 
treated alike with respect to order and 
degree of sleep deprivation. The 
combined significance of the correla- 
tions was then assessed on a permuta- 
tional basis to give an overall level of 
significanceéover all12.Ss. To explain 
this further there are six possible 
combinations of the rankings of two 
sets of three scores. We have four 
teams in each of which any of these six 
combinations may occur. Over all 
four teams there are then 6t = 1296 
possible combinations of rankings. If 
we emerge with a combination of 
rankings whose correlations are pre- 
dominantly negative, for example 
—1.0, —1.0, —0.33, and —0.33, in the 
four teams we can calculate the 
number of combinations out of the 
whole 1296 which are as negative as 
this or more negative. There are 41 
in this case. The one-tailed prob- 
ability of a negative combination as 
great or greater than this is then 
41/1296 or .031. 

The negative correlations which 
emerged were almost all significant. 
Increased level of EMG due to loss of 
sleep correlated negatively with im- 


567 


wre 
MOS) 


IMPAIRED PERFORMANCE UNDER [LEAST 
SLEEP DEPRIVATION 


o— 
meore Y= =% 
T >. o 


INCREASE 


KNOWLEDGE OF RESULTS GIVEN 


INCREASE IN EMG LEVEL DUE TO SLEEP DEPRIVATION 


DECREASE —»-——— 


sr no 


aro a7 (K) 
5-MIN. PERIODS OF THE TEST 


Fic. 2. Increase in EMG level due to 
sleep deprivation, i.e., Logio (No Sleep EMG 
level /Sleep EMG level) in three groups of Ss 
showing the least, the most, and an inter- 
mediate impairment of performance due to 
sleep deprivation. 


paired performance in terms of both 
speed (P= 031), and accuracy 
(P = .094). Similarly increased vari- 
ability of EMG under sleep depriva- 
tion correlated negatively with re- 
duced speed (P = .061) and reduced 
accuracy (P = .007) Thus when 
sleep was lost those Ss whose per- 
formance was impaired least were the 
ones whose EMG was raised most and 
this holds good whether we correlate 
speed or accuracy of performance with 
either level or variability of EMG. 
To illustrate this (in terms of speed 
only) Ss have been divided into three 
groups containing the members of 
each team showing the least, the most, 
and an intermediate impairment of 
performance due to lack of sleep in 
the first 15 min. of the test. The 
tendency for these groups to show 
increased EMG as a result of losing 
sleep can be seen in terms of level of 
EMG in Fig. 2 and its variability in 
Fig. 3. Clearly there is an almost 
complete separation of the three per- 


568 


IMPAIREO PERFORMANCE UNDER | LEAST oo 
Tee {iifetweoiare n= =X 
SLEEP DEPRIVATION. MOST oo 


tO 


id 


KNOWLEDGE OF RESULTS GIVEN 


DECREASE = —= INCREASE 
i 
Q 
8 


INCREASE IN E MG VARIABILITY DUE TO SLEEP DEPRIVATION 


1 
(e) 
rs 
° 


Ki zmo yao 

S-MIN. PERIODS OF THE TEST. 

Fic. 3. Increase in EMG variability due 
to sleep deprivation, i.e., Logio (No Sleep 
EMG variability/Sleep EMG variability), in 
Ss with the least, the most, and an inter- 
mediate impairment of performance due to 
sleep deprivation. 


am @) 


formance groups in the extent to 
which their muscle tension rose as a 
result of working without sleep. 
These results seem very clear in this 
particular context, but they should be 
considered with due regard to the 
limitations of the experiment. They 
apply to only one form of activity. 
Only one physiological measure was 
taken, the EMG, and this was re- 
corded from only one site. The con- 
clusions which follow immediately and 
in later discussion should be regarded 
therefore as topics for confirmatory 
experiment rather than firm proposi- 
tions. There are two immediate con- 
clusions. The first is that although 
some men may be able to forego sleep 
and perform as well as normally on 
tasks of the present nature, this per- 
formance may be accompanied by 
abnormally high levels of muscle ten- 
sion. Secondly, we may recall that 
Edwards (1941) concluded from inci- 
dental observation that work under 
sleep deprivation is accompanied by 
abnormally high expenditure of effort. 


ROBERT T. WILKINSON 


If we can assume that higher and more 
variable EMG is a sign of such effort 
the present result may provide more 
direct experimental evidence for this. 
A possible corollary is that sleep de- 
prived men may be more uniformly 
inefficient than has been thought 
hitherto if we interpret efficiency ina 
mechanical sense as being the ratio of 
output to input. Previous implica- 
tions have often been that where per- 
formance is maintained, so also is 
efficiency. This may only be true if 
effort remains the same also, and 
present results suggest that this is not 
always the case. Where output was 
maintained effort or input as judged 
by the EMG, was often higher. In 
such cases sleep deprivation may be 
reducing efficiency no less than when 
no extra effort is made and output 
falls. 

Period of KR.—Errors showed no 
important changes as a result either of 
adding KR, or of sleep deprivation 
when this feedback was present. 
Performance is discussed therefore m 
terms of speed only. 

When KR was given in the last 5 
min. there ceased to be any difference 
between sleep deprived and normal 
performance (Fig. 1). Previous work 
(Wilkinson, 1961) has led us to expect 
this, but in the present experiment the 
result was brought about in an un- 
usual way. When KR was added 
under sleep deprivation it raised EMG 
moderately and improved perform- 
ance. When it was added after 
normal sleep, however, it raised EMG 
much more (P < .01) and this was 
accompanied by a deterioration M 
performance. These changes can 
seen in Table 1 and Fig. 1. Now 
Stennett (1957) has shown that, be- 
yond a certain point, increases M 
EMG may lower performance rather 
than improve it and it seems reason- 
able to account in this way for the 


MUSCLE TENSION DURING MENTAL WORK 


decline in performance among the 
sleepers when KR was added. They 
became overtense. In the circum- 
stances it is not surprising to find that 
the negative correlation of the first 
15 min. between impaired perform- 
ance and increased EMG under sleep 
deprivation was reduced almost to 
zero when KR was added in the last 
5 min. The lesson from this is that 
we should be careful not to generalize 
too far from results obtained in 
a relatively unstimulating situation 
(like the first 15 min. of the present 
test) to one in which incentives make 
S anxious to do well. In terms of 
efficiency as defined above the non- 
sleepers were no longer at a dis- 
advantage when KR was added; in- 
deed it could be argued that they were 
more efficient than the sleepers, for 
they performed as well and their EMG 
was lower. Clearly research of the 
present nature must be extended to 
more stimulating tasks. 

Prediction of individual impairment 
from EMG under normal conditions.— 
Subjects may be ranked in terms of 
the ratio of their working level of 
EMG to that of their preliminary 
2-min. period of relaxation, which, 
indeed, has been the index of level of 
EMG throughout. Three independ- 
ent measures of this kind were ob- 
tained from each S, the first from the 
initial practice test and the second and 
third from the two main tests, one 
with and one without sleep. These 
three measures are in considerable 
agreement in their rankings of Ss, 
Kendall's coefficient of concordance 
being .67 and significant (Pi .03). 
This suggests that the extent to which 
EMG rises in the transition from 
resting to working varies consistently 
from person to person. Table 2 
summarizes the results of correlating 
these three assessments of this pa- 
rameter with impairment of perform- 


569 


ance due to lack of sleep in the periods 
of No KR, of KR, and in the two 
combined, that is the whole test. 

All the correlations involving speed 
of performance are negative, most of 
them significantly so. The main test 
after normal sleep is a reliable pre- 
dictor, but the data concerning the 
practice test are the most interesting 
in that this measure was a truly pre- 
dictive one, that is its results were 
quite independent of those to be 
predicted, namely the effect of lack of 
sleep on individuals. Unfortunately 
this value of the practice test as a 
predictive measure appeared only 
when all the results were analyzed. 
It was administered as no more than a 
practice run and with less care over 
EMG recording than was exercised in 
the main tests. But in spite of this it 
yields values of working-to-resting 
EMG ratio which predict in advance 
the impairment of speed of perform- 
ance under sleep deprivation with 
fair accuracy and at nearly the 05 
level of significance. The rankings 
also correlate (r = 44) (P = .023) 
with those of the highly predictive 
main test carried out under normal 


sleep. In short there seems good 
TABLE 2 
CORRELATIONS (7) OF THREE MEASURES OF 


WorKING-TO-RESTING RATIO OF EMG 
WITH THREE INDICES OF IMPAIRED 
PERFORMANCE (SPEED) UNDER 
SLEEP DEPRIVATION 


Impaired Performance 
VATI SENINI EEE ee 
Working- 
to-Resting NoKR| KR Wio Tir 


Ratio of EMG 


— 
Main bat (with .55 | 006] .53 | .008|  .67 


sleep 

Cee a eee EEN PEE an pron EEr Mea 

Main test (no 
sleep) 


Practice 24| 13 |.33|.06 | 36 


Note,—All 7 coefficients are negative. 


570 


reason to believe that if a careful 
preliminary assessment of the ratio of 
working-to-resting EMG is made this 
index should predict the degree to 
which performance will be impaired 
by lack of sleep in any subsequent 
performance of the task, the higher 
the ratio the less the impairment. 


Discussion 


Muscle tension (EMG) is one of a 
number of physiological measures which 
are sometimes (Malmo, 1959) thought to 
reflect the level of arousal of the body as 
defined by Duffy (1957), Lindsley (1951), 
and Hebb (1955). Other possible meas- 
ures include pulse, respiration and meta- 
bolic rates, skin conductance, urinary 
excretion of catechol amines, and alpha 
depression in the EEG. When these are 
recorded under sleep deprivation their 
levels are sometimes higher than normal 
(Freeman, 1932; Hasselman, Schaff, & 
Metz, 1960; Laird & Wheeler, 1926; 
Malmo & Surwillo, 1960; Tyler, Good- 
man, & Rothman, 1947) and sometimes 
lower (Armington & Mitnick, 1959; Ax 
& Luby, 1961; Bjerner, 1949). Similar 
contrasts occurred with EMG in the 
present experiment, The fact that in- 
creased EMG under the stress correlated 
positively with maintained performance 
Suggests that this, and perhaps other 
physiological indices may rise under 
sleep deprivation as the experimental 
situation is stimulating and provokes 
effort. If we examine the conditions 
under which physiological measures were 
taken in previous experiments the im- 
pression is reinforced; where levels in- 
creased the Ss were usually engaged in 
relatively stimulating tasks; where they 
fell the tasks appear less stimulating or 
else the Ss were merely sitting passively, 

If these physiological indices reflect 
the level of arousal we must conclude 
with Malmo and Surwillo (1960) that 
sleep deprivation can either raise or lower 
arousal according to the situation in 
which the S is placed during recording. 
But do they? With No KR in the 
present experiment performance was im- 


ROBERT T. WILKINSON 


paired under sleep deprivation but the 
level of EMG was unchanged (Fig. 1 and 
Table 1); if this implies unchanged 
arousal the relationship between arousal 
and performance is broken. Similarly 
with KR different levels of EMG accom- 
panied the same level of performance 
with and without sleep. Perhaps if we 
wish to retain the inverted u relationship 
between arousal and performance (Hebb, 
1955) we must sacrifice the notion that 
EMG level always reflects the level of 
arousal. In particular when sleep has 
been lost it seems likely that higher levels 
of EMG are required for given levels of 
arousal. This suggests an explanation of 
the abnormally high levels of the so- 
called arousal measures which occurred 
under certain circumstances in the 
present and other experiments: they may 
reflect, not raised arousal, but the effort 
associated with maintaining normal 
arousal and customary standards of per- 
formance in face of the influence of sleep 
deprivation per se which may be always 
towards lowered arousal. 


SUMMARY 


Twelve Ss performed a 20-min. test of 
addition, once after normal sleep and once 
under 32-56 hr. sleep deprivation. Records of 
muscle tension (EMG) were taken from the 
inactive arm. The Ss who maintained per- 
formance best under the stress showed the 
greatest rise in EMG over normal levels. 
Knowledge of results disturbed this relation- 
ship. An independent measure of EMG 
taken under normal conditions predicted 
those Ss whose performance was impaired. 
Sleep deprivation may cause inefficiency even 
in Ss who maintain performance if their 
raised EMG reflects greater effort or energy 
expenditure; this may be the cost of maintain- 
ing normal levels of arousal and performance 
in face of the depressing influence of sleep 
deprivation per se. 


REFERENCES 

ARMINGTON, J. C., & Mrrnick, L. L. Elec- 
troencephalogram and sleep deprivation. 
J. appl. Physiol., 1959, 14, 247-250. 

Ax, A., & Luby, E. D. Autonomic responses 
to sleep deprivation. Arch. gen. Psychiat, 
1961, 4, 55-59. 

BJERNER, B. Alpha depression and lowered 
pulse rate during delayed actions in a serial 


MUSCLE TENSION DURING MENTAL WORK 


reaction test. Ada physiol. Scand., Stock- 
holm, 1949, 19(Suppl. No. 65). 

Durry, E. The psychological significance of 
the concept of “arousal” or “activation.” 
Psychol. Rev., 1957, 64, 265-275. 

Epwarps, A. S. Effects of the loss of 100 
hours of sleep. Amer. J. Psychol., 1941, 
54, 80-91. 

Freeman, G. L. Compensatory reinforce- 
ments of muscular tension subsequent to 
sleep loss. J. exp. Psychol., 1932, 15, 267- 
283. 

HassELMAN, M., SCHAFF, G., & Merz, B. 
Respective influences of work, ambient 
temperature and sleep deprivation on the 
urinary excretion of catechol amines in the 
normal man. CR Soc. Biol., Paris, 1960, 
154, 197-201. 

Hess, D. O. Drives and the C.N.S. (con- 
ceptual nervous system). Psychol. Rev., 
1955, 62, 243-254. 

Lamp, D. A., & WHEELER, W. What it costs 
to lose sleep. Industr. Psychol., 1926, 1, 
694-696. 

Linpstey, D. B. Emotion, In S., S. Stevens 
(Ed.), Handbook of ex, i È 
New York: Wiley, 1951. Pp- 473-516. 


571 


Maruo, R. B. Activation: A neuropsycho- 
logical dimension. Psychol. Rev., 1959, 
66, 367-386. 

Mat{o, R. B., & SurwuLo, W. W. Sleep 

ivation: Changes in performance and 
physiological indicants of activation. Psy- 
chol. Monogr., 1960, 74(1S, Whole No. 502). 

Sicer, S. Nonparametric statistics, New 
York: McGraw-Hill, 1956, 

Stennett, R. G. The relationship of per- 
formance level to level of arousal. J. exp. 
Psychol., 1957, 54, 54-61. 

TYLER, D. B., GOODMAN, J., & Rotuman, T. 
The effect of experimental insomnia on the 


1961, 62, 263-271. 


(Received September 5, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 572-579 


THE PARASITIC REINFORCEMENT OF VERBAL 
ASSOCIATIVE RESPONSES ! 


W. D. KINCAID, Jr.,2 W. A. BOUSFIELD, ann G. A. WHITMARSH 3 


University of Connecticut 


In recent papers Bousfield, Cohen, 
and Whitmarsh (1958b), and Bous- 
field, Whitmarsh, and Danick (1958) 
have attempted to account for the 
phenomenon of verbal stimulus gen- 
eralization on the basis of the overlap 
of verbal associative responses elicited 
by the given words. Their basic ap- 
proach rests upon the assumption that 
the presentation of a meaningful stim- 
ulus word leads to the elicitation of a 
composite of implicit verbal associa- 
tive responses. They reason that the 
verbal conditioning of a response to a 
stimulus word involves not only the 
conditioning to the stimulus word, 
but also a simultaneous conditioning 
to the composite of verbal associative 
responses to that word. Thus, during 
conditioning trials, members of the 
associative response composite are 
involved in the learning process 
through higher-order conditioning, 
The term parasitic reinforcement may 
be used to describe this concurrent 
conditioning of the members of the 
composite of verbal associative re- 
sponses. This term was introduced 
by Morgan and Underwood (1950) to 
explain the following phenomenon. 
After the learning of a given verbal 
response, B, to a stimulus word, A, the 
synonyms of B will have a greater 


1 This paper is based on Technical Report 
No. 32 under Contract Nonr-631 (00) between 
the Office of Naval Research and the Uni- 
versity of Connecticut. Reproduction in 
whole or in part is permitted for any purpose 
of the United States Government. 

*Now at Androscoggin Mental Health 
Clinic, Lewiston, Maine. 

3 Now at the Springfield State Hospital, 
Sykesville, Maryland. 


than chance probability of subse- 
quently being elicited by A. Bous- 
field, Whitmarsh, and Danick (1958) 
extended the concept of parasitic rein- 
forcement to include the aggregate of 
verbal associative responses to a given 
stimulus word. Support for the as- 
sumption underlying the concept of 
parasitic reinforcement was found in 
the fact that the degree to which an 
observable response which had been 
conditioned to one word was elicited 
by the presentation of a second word 
was a function of the verbal associa- 
tive responses common to the two 
stimulus words. Studies by Cohen 
(1958) and by Whitmarsh and Bous- 
field (1961) have replicated these find- 
ings, and have shown them to be 
independent of a specific technique 
used to measure generalization. While 
the theoretical rationale introduced to 
account for the contribution of asso- 
ciative responses to generalization as- 
sumed the conditioning of the implicit 
verbal associates of the first stimulus 
word to the observable response, 
these studies provide only indirect 
support for this assumption. 

The present study was undertaken 
to test the deduction that after paired- 
associate learning the associates of the 
learned response word may also be 
elicited by the stimulus item of the 
learned pair. Specifically we wished 
to test the following hypothesis: the 
paired-associate learning of a mean- 
ingful response word to a nonsense- 
syllable stimulus has the consequence 
of establishing connections between 
the nonsense syllable and the members 
of a group of verbal associative re- 


572 


VERBAL ASSOCIATIVE RESPONSES 


573 


TABLE 1 
MEANINGFUL Response Worps USED IN TRAINING 


AND THEIR TESTED ASSOCIATES: Exe. I 


of the Learned Responses and their Cultural Frequencies of Occurrence 


Associates 
Words Used as Responses 
in Training 
High 
ANIMAL Dog 81 
ICE Cold 93 
LETTUCE Tomato 38 
MOSQUITO Bite 67 
PETAL Flower 39 
RAYON Silk 36 
TABLE Chair 45 
TIN Can 72 
TYPHOID Fever 74 
WAGON Wheels 76 
EE u E a a E 

Mean 62.1 


Medium Low 
Cat 17 Man 5 
Water 13 Cream 12 
Green 18 Leaf 11 
Bug 26 Insect 13 
Rose 16 Leaf 11 
Nylon 25 Material g 
Write 9 Office 5 
Metal 29 Roof 9 
Disease 22 Sickness 7 
Train 12 Red 4 
ee oe ea eS 
18.7 8.4 


Note,—See text for definition of high, medium, and low grouping. 


sponses to the learned response word. 
For example, if the word RAYON were 
learned as a response to the nonsense 
syllable GOX, we should expect to find 
evidence of acquired connections be- 
tween Gox and the associative re- 
sponses to RAYON, as for example, 
Silk, Nylon, Material, and Soft. The 
relative strengths of the members of 
the composite of verbal associative 
responses to a given word may be 
measured from their cultural fre- 
quencies of occurrence as responses to 
that word in free associational norms 
of the Minnesota type (Russell & 
Jenkins, 1954). Our second experi- 
mental hypothesis concerns the rela- 
tion between cultural habit strengths 
of the associates of a given word and 
their susceptibility to conditioning: 
the strength of the connections €s- 
tablished between the nonsense syl- 
lable and the associates of the learned 
response word is an increasing func- 
tion of the cultural habit strengths 
of the associates as responses to the 
learned response word. 


EXPERIMENT | 
Method 


The materials for the initial learning were 
10 pairs of nonsense syllables and meaningful 


words. The following 10 nonsense syllables 
were selected from the Glaze (1928) list on the 
basis of their having association values rang- 
ing from 0 to 47%: GOX, HAJ, MUP, NID, QOL, 
RUC, SIW, VEK, YEF, and zaB. The 10 mean- 
ingful words, which are listed in Table 1, 
were selected from a list of 150 words for 
which free-associational norms had been com- 
piled from a population of 150 Ss. Three 
different randomized pairings of these items 
were then prepared so that no nonsense 
syllable was paired with the same word more 
than once. The items for the testing phase 
of the experiment were free associational 
responses to the 10 response members of the 
initial learning pairs selected on the following 
basis. The 150 associational responses to 
each of the 10 learned words were divided 
into tertiles on the basis of their cultural 
frequencies of occurrence in the normative 
data. The associates in each of these three 
groups then comprised the pools of low-, 
medium-, and high-frequency associates. 
In choosing associates for the testing phase of 
the experiment the restriction was imposed 
that a chosen associate to a given word should 
not appear in the gradient of associational 
responses to any of the other 9 learned words. 
Within this restriction, the associate having 
the highest frequency in each frequency 
group was chosen as one of the three test 
words for a given learned word. The three 
associates thus chosen for each of the 10 
learned words are listed in Table 1 along with 
their corresponding cultural frequencies of 
occurrence as associates to the learned word. 
It may be noted that Leaf appears as a low- 
frequency associate to both LETTUCE and 


574 W. D. KINCAID, JR., W. A. BOUSFIELD, AND G. A. WHITMARSH 


PETAL. This violation of the restriction im- 
posed in the selection of associates was 
necessitated by the relative lack of degrees of 
freedom imposed by the limited pool of 150 
stimulus words. 

The Ss were 140 undergraduate students 
who were trained and tested in three groups 
comprising 51, 44, and 45 Ss, respectively. 
Each group received a different randomization 
of the paired-associate lists for learning. The 
items were presented one at a time to Ss by 
means of a Selectroslide projector set for 
exposures of 2.5 sec. The instructions and 
procedure employed for the paired-associate 
learning were the same as those devised by 
Cohen (1958) for group-method experiments. 
Eight learning trials were administered to all 
Ss. On alternate trials Ss were asked to 
anticipate and write in a booklet provided 
for this purpose the response member of the 
pair when shown the nonsense syllable. They 
were then verbally presented with the correct 
word. A total of 27 Ss failed to reach the 
criterion of all correct anticipations on the last 
trial and were therefore dropped. This re- 
sulted in Ns of 43, 33, and 37 Ss, respectively, 
for the three experimental groups. 

After the initial learning, Æ proceeded 
immediately to the testing phase. A pilot 
study had demonstrated that the procedure 
of simply presenting the nonsense syllable with 
instructions for free association was effective 
in eliciting associates to the learned words in 
only 40% of the cases. This consideration led 
to the development of an alternative pro- 
cedure. The S was given 10 data sheets, in 
booklet form, one sheet for each of the non- 
sense syllables of the training phase of the 
experiment. The nonsense syllable was fol- 
lowed by five words, one of which was one of 
the three chosen associates of the learned 
word, For example, for Ss who learned the 
pair GOX-RAYON, one of the five words was 
Material, a low-frequency associate of RAYON, 
The remaining four control words were 
selected at random from a dictionary with the 
restriction that they should not appear as an 
associate of any one of the 10 words used in 
the learning. The $ was instructed to check 
the one word which he felt to be “most related 
to the nonsense syllable.” * The order of the 
syllables in the booklet was randomized be- 
tween Ss as was the position of the critical 


* In a subsequent study these instructions 
were rephrased so that Ss were asked to 
check the one word which the stimulus item 
“most makes you think of.” There was no 
evidence to indicate the choices of Ss were 
altered by this change in the instructions, 


x 


i 


associate among the four control words, 
Each group of Ss received booklets containing 
high-, medium-, and low-frequency associates 
distributed among the 10 nonsense syllables 
and among the three groups of Ss in a counter- 
balanced design. This design required the 
use of 120 control words. 

With the forced-choice test instructions it 
might be supposed that the probability of 
selecting any one of the five choices would 
be .2. Such an assumption, however, ap- 
peared unsafe in view of the possibility that an 
alternative might be selected on the basis of 
extraneous factors such as phonetographic 
similarity to the nonsense syllable. It ap- 
peared advisable, therefore, to obtain what 
may be called base frequency data from a 
group of control Ss. For this purpose three 
control groups of undergraduates comprising 
51, 33, and 34 Ss, respectively, were presented 
the test booklets and the same instructions as 
were given to the three groups of experimental 
Ss. Thus, each of the 30 forced-choice 
association tests, i.e., 10 for the low-, 10 for 
the medium-, and 10 for the high-frequency 
responses, was taken by 51, 33, or 34 Ss. 


Results 


The first step in the treatment of 
the data was that of tabulating the 
total frequency with which each of the 
30 associates used in the testing was 
selected as most related to its asso- 
ciated syllable. For example, the pair 
MUP-MOSQUITO appeared in the initial 
learning. In the testing situation the 
group of 43 Ss who had received this 
pair for learning was presented with 
MUP and asked to select the word most 
related to Mup from five alternatives 
comprising Insect, the low frequency 
associate of MOSQUITO, and the control 
words Knife, Field, Crazy, and Word. 
In view of the predicted facilitation 
of the associative responses to MOS 
QUITO, the checking of Insect as the 
preferred alternative was for con- 
venience labeled “correct.” Control 
word choices were designated as “‘in- 
correct.” In these terms all 43 of the 
experimental Ss who were presented 
with this set of choices gave “correct 
responses, whereas 17 of the 51 control 
Ss who received the same test gave the 


VERBAL ASSOCIATIVE RESPONSES 


575 


TABLE 2 


PROPORTION OF EXPERIMENTAL (E) 


AND THE DIFFERENCES (E—C) BETWEEN 


AND Controi (C) Ss SELECTING EACH ASSOCIATE 


THESE PROPORTIONS: Exp. I 


High Medium Low 
Word 

E G E-C E c E-C E Cc E-C 
ANIMAL 953 .196 Fiyi 973 .238 735 .939 303 636 
ICE 1,000 157 843 784 | .206 578 -158 .152 606 
LETTUCE 891 | .238 653 970 | .273 697 907 | .118 .789 
MOSQUITO 865 | .206 659 939 | .121 818 1.000 | .333 667 
PETAL 953 294 659 838 | .176 662 818 | .121 697 
RAYON 865 118 -147 .970 | .121 .849 977 235 742 
TABLE 838 | .176 662 424 121 .303 814 .160 654 
TIN .730 | .059 671 953 | .260 693 909 182 727 
TYPHOID .909 212 .697 1,000 | .314 686 .973 -294 .679 
WAGON 939 303 636 .953 | .140 813 838 | .235 603 
Mean 698 -683 .680 
Note.—All E—C values are significant in the predicted direction at less than the .01 level. 

so-called ‘“‘correct’’ responses and 34 frequency associates and least for 


gave responses labeled ‘‘incorrect.” A 
chi square analysis of these data with 
a two-tailed test indicates that the 
difference between experimental and 
control group responses is significant 
beyond the .01 level in the direction 
predicted by the experimental hy- 
pothesis. A similar treatment of the 
experimental and control group data 
for the remaining 29 associates indi- 
cated that all differences were signifi- 
cant beyond the .01 level in the pre- 
dicted direction. 

The next step taken in the analysis 
of the data was that of determining 
the nature of the relationship between 
the cultural frequencies of the asso- 
ciates as represented in the three 
groups of high, medium, and low on 
the one hand, and the extent to which 
these associates were facilitated in 
the testing phase of the experiment. 
The mean cultural frequencies of these 
associates, based on the normative 
population of 150 Ss, were, respect- 
ively, 62.1, 18.7, and 8.4. The predic- 
tion was that the number of responses 
labeled correct by the experimental 
Ss should be greatest for the high- 


those of low frequencies. The follow- 
ing steps were taken in this analysis. 
First, the proportion of Ss who gave 
the so-called correct responses was 
determined for each of the 30 associ- 
ates listed earlier in Table 1. These 
proportions appear in Table 2, and are 
listed in Column E for the experi- 
mental Ss and in Column C for the 
control Ss who supplied the base 
frequency data. Thus, the high-, 
medium-, and low-frequency associ- 
ates of ANIMALwere, respectively, Dog, 
Cat, and Man. Table 2, Column E, 
shows that the proportion of experi- 
mental Ss who checked Dog as related 
to the nonsense syllable which had 
been paired previously with ANIMAL 
was .953. The proportion of control 
Ss, Column C, who selected Dog was 
196. The difference between these 
proportions, 757, is listed in Column 
E—C. This difference may be said 
to represent the effect of learning. 
As indicated earlier, this difference is 
significant. ‘The means of these ad- 
justed proportions for the high, me- 
dium, and low associates are, respect- 
ively, .698, .683, and .680. Three CR 


576 W. D. KINCAID, JR., W. A. BOUSFIELD, AND G. A. WHITMARSH 


tests of the differences between the 
proportions were performed for these 
three adjusted means. The differ- 
ences between the means of the ad- 
justed proportions for the high vs. 
medium, medium vs. low, and high vs. 
low groups are .015, .003, and .018, 
respectively. These mean differences 
do not differ significantly. Thus, 
while the findings of Exp. I support 
the first hypothesis, the variation in 
strength of the associates of the re- 
sponse words used in the initial train- 
ing did not prove to be a significant 
parameter as predicted in the second 
hypothesis. 


EXPERIMENT II 


In light of the unexpectedly strong 
effects of the so-called low-frequency 
associates in Exp, I, Exp. II was 
undertaken to extend the range of 
cultural frequencies tested to associ- 
ates occurring only once in the norma- 
tive population of 150 Ss. 


The same nonsense-syllable meaningful- 
word pairs used in Exp. I were learned by the 


TABLE 3 


MEANINGFUL RESPONSE Worps Usep IN 
TRAINING AND THEIR TESTED 
Associates: Exp, II 


SSS 


Associates of the ne 
Words Used and Their Coiturat Mo arno nnes 
as Responses of Occurrence 
in Training 
Low Low-Low Al|Low-Low B 
ANIMAL Bear 3 |U lys Huma 
ICE Berg 3 Hard Winter 
LETTUCE | Money 4 | Potato Chow 
MOSQUITO | Gnat” 3 | Pest Nasty 
PETAL Push 4 | Fall Brake! 
RAYON Soft 3 |Skirt Yarn 
TABLE Paper 4 | Book Brown 
TIN opper 4 | Rubber*® | Pail 
TYPHOID | Illness 3 | Gear® Neck 
WAGON Red 4 ‘| Children Drunks 
Mean | 3.5 1 1 
| 


* Indicates the five associates which experimental Ss 
did not select with frequencies significantly different 
from the choices indicated in the base frequency data. 


136 undergraduate Ss participating in Exp. I. 
The Ss were trained and tested in three groups 
of 43, 49, and 44 Ss, respectively. The 
critical associates tested in Exp. II were 
divided into three groups: one of 10 associates 
with cultural frequencies of either 3 or 4 
which is designated the Low group, and two 
groups of 10 associates, each associate having 
a cultural frequency of 1. These groups were 
designated, respectively, Low-Low A and 
Low-Low B. The associates having the fre- 
quency of 1 were selected at random from the 
normative data with the restriction that they 
did not appear among the free associates to the 
other 19 words in the two Low-Low lists. The 
same forced-choice test procedure, counter- 
balanced experimental design, and instruc- 
tions used in Exp. I were employed. The 
items used in Exp. II are presented in Table 3. 
A total of 27 Ss served as controls and pro- 
vided the base frequency normative data used 
in this experiment, An analysis of the norma- 
tive data collected in Exp. I indicated that 
adequately stable data could be obtained with 
an N of this size. 


Results 


In Table 4, Column E shows the 
proportion of experimental Ss who 
gave the so-called correct responses 
for each of the 30 associates used in 
this experiment. Column C lists the 
proportions of “correct” responses for 
the control Ss, and Column E-C 
shows the differences between the 
experimental and control proportions. 

Individual chi square tests were 
performed on the differences between 
the “correct” choices of the experi- 
mental Ss and the control Ss for each 
of the 30 associates. ‘This analysis 
indicated that with one exception all 
associates in the low-frequency group 
were chosen by Ss at significance 
values beyond the .01 level in the 
direction predicted by the experi- 
mental hypothesis. The associate 0 
PETAL, namely, Push, was significant 
at the .05 level. Thus, the results 
strongly confirm the first experimental 
hypothesis even when the cultural 
frequencies of the associates tested are 
further reduced in magnitude as com- 


VERBAL ASSOCIATIVE RESPONSES 57 


~< 


TABLE 4 


PROPORTION OF EXPERIMENTAL (E) AND CONTROL (C) Ss Sevectinc EACH ASSOCIATE 
AND THE DIFFERENCES (E—C) BETWEEN THESE Proportions: Exe. I 


Low Low-Low A Low-Low B 
Word 

E c E-C 
ANIMAL .909 | .074 | .835** 
ICE .864 | .296 | .568** 
LETTUCE 721) 111 | .610** 
MOSQUITO 861 | .074 | .787** 
PETAL 341] 185 | .156* 
RAYON .861 | .259 | .602** 
TABLE 302 | .111 | .191** 
TIN 1.000 | .370 | .630** 
TYPHOID 837 | .296 | .541** 
WAGON .673| .185 | .488** 
Mean 541 


a Mean E—C es Low-Low A and Low-Low B combined = .392. 


pared to the low-frequency associates 
of Exp. I. Similarly, chi square tests 
were made on the 20 Low-Low asso- 
ciates having cultural frequencies of 
occurrence of 1. This analysis 
indicated that 15 of these associates 
were selected by the experimental Ss 
at or beyond the .05 level of signifi- 
cance when compared with the base 
frequency data by means of two-tailed 
chi square tests. The five associates 
which did not attain significance are 
indicated in Table 3. Thus, connec- 
tions were established between the 
nonsense syllables and 75% of the 
Low-Low associates of the learned 
response words even when the cultural 
frequencies of these associates as 
responses to the learned words were so 
low as to occur only once in a norma- 
tive group of 150 Ss. 

Several comparisons were ‘made be- 
tween the data provided by the two 
experiments. The means of the E—C 
proportions of the two Low-Low 
groups used in Exp. II were combine: 
after a CR test indicated that the 
difference between these two groups 
was not significant. Two-tailed CR 


tests for uncorrelated proportions 
were made on the differences between 
all frequency groups in Exp. I and 
those in Exp. 11 and between the low- 
frequency group and the combined 
Low-Low groups of Exp. II. No 
significant differences between any of 
these adjusted mean proportions were 
obtained. Although there is a trend 
of decreasing mean proportions of 
correct responses for the frequency 
groups used in both experiments, the 
statistical analyses indicated that 
none of the means involved in this 
trend showed significant differences be- 
tween each other. Even the differ- 
ence between the data for the High 
group of Exp. I and the combined 
Low-Low groups of Exp. II was not 
significant. The second hypothesis 
was not supported. 


Discussion 


The findings support the assumption 
that the learning of a meaningful verbal 
response to a nonsense syllable stimulus 
results in the establishment of measur- 
able associative relationships between 
the stimulus and the verbal associative 


578 


responses to the meaningful word. Two 
alternative explanations of this phe- 
nomenon may be considered. The effect 
may be a consequence of mediation in 
the testing phase of the experiment 
provided by recall of the learned response 
to the nonsense syllable. The S may 
recall that he has learned, for example, 
RAYON as the response to Gox. The 
associates of RAYON, namely, Soft, Ma- 
terial, etc., are then mediated by the 
recall of RAYON, and S proceeds to check 
the associate Soft as the response most 
related to Gox. On the other hand the 
theoretical approach of Bousfield and his 
associates suggests that the phenomenon 
is attributable to the higher-order condi- 
tioning of the implicit verbal associates 
of the learned response word during the 
training phase of the experiment. A test 
of the assumption of the training phase 
locus of the effect would require the 
demonstration of parasitic reinforcement 
of the associative responses when the 
learned response had been forgotten and 
was no longer available to S. Failure 
to demonstrate the phenomenon under 
this condition, however, would not 
necessarily indicate that the locus was in 
the testing phase since it may very well 
be that the time interval necessary for 
the forgetting of the originally learned re- 
sponse word is also sufficient for the 
forgetting of the associational responses, 
While the locus of the effect has not been 
tested directly, some support for a 
training phase locus may be found in a 
study by Yavuz and Bousfield (1959) 
who showed that the connotative mean- 
ing of a foreign word could be recalled 
after the supposed English translation of 
the word had been forgotten. They sug- 
gested that the conditioning of the asso- 
ciational responses of the English word 
in the training phase mediated the mean- 
ing judgments of Ss. 

It would seem that the failure to find 
a differential effect as a function of the 
habit strengths of the associative re- 
sponses may be attributed to either of 
two factors or to a combination of these 
factors. In the first place, it is evident 
that the findings here reported derive in 
part from the use of a particular method 


W. D. KINCAID, JR., W. A. BOUSFIELD, AND G, A. WHITMARSH 


for appraising the presence of the asso- 
ciative connections assumed to have been 
established in the initial learning. In 
each of a series of tests, the Ss were given 
one of the nonsense syllables encountered 
in the initial learning which was followed 
by five different words. The Ss were 
told to choose the one word of these five 
which they judged to be most related to 
the given nonsense syllable. In each 
case one of the five alternatives was an 
associate of the word learned as a re- 
sponse to the nonsense syllable. Accord- 
ing to the theory outlined by Bousfield 
et al. (1958), this associate should have 
been elicited implicitly in contiguity with 
the presentation of the nonsense syllable 
during learning. It may therefore be 
said that the testing method actually 
employs the method of recognition. This 
method typically yields relatively high 
scores in tests of retention as long as the 
learned items are embedded in dissimilar 
new items as was the case in the present 
study (Luh, 1922), The sensitivity of 
the method of recognition is most likely 
due to the opportunity it provides S for 
making use of relatively weak asso- 
ciations. 

An alternative explanation of, the 
failure of the findings to discriminate 
between the strengths of the associative 
habits is possible. It is conceivable that 
associative response strengths repre- 
sented by a cultural frequency of 1 in a 
population of 150 are of sufficient potency 
in certain situations to become as effect- 
ive as the associative responses whose 
strengths are reflected in higher fre- 
quencies of occurrence. If this is so, It 
would suggest that more attention needs 
to be paid to the so-called weak associa- 
tive habits in the study of verbal be- 
havior. Perhaps these habits are not a8 
weak in effect as might be supposed from 
their cultural frequencies of occurrence. 
A similar phenomenon has been found 
in several studies employing Thorndike- 
Lorge frequency of usage values in whic 
differences in performance as a function 
of high- or low-frequency values, while 
significant, are small in absolute differ- 
ences (Bousfield, Cohen, & Whitmarsh, 
1958a; Hall, 1954). In discussing this 


VERBAL ASSOCIATIVE RESPONSES 


Underwood and Schulz (1960) suggest 
that “even words with the lowest fre- 
quencies may in fact have been ex- 
perienced many times by a subject who 
serves in learning experiments” (p. 59). 
It may be that in studies of this type a 
distinction must be made between rela- 
tive and absolute differences. 


SUMMARY 


Two experiments were designed to test the 
following hypothesis: (a) The paired-associate 
learning of a meaningful response word toa 
nonsense syllable stimulus has the conse- 
quence of establishing connections between 
the nonsense syllable and the members of a 
group of verbal associative responses to the 
learned response word. (b) The strength of 
the connections established between the 
nonsense syllable and the associates of the 
learned response word is an increasing func- 
tion of the cultural habit strengths of the 
associates as responses to the learned response 
word, 

Ten nonsense-syllable meaningful-word 
pairs were presented for paired-associate 
learning to 113 Ss in Exp. I and to 136 Ss in 
Exp. II. The response words were selected 
from a set of stimulus words for which free 
associational norms had been previously ob- 
tained from a population of 150 Ss. Three 
classifications of associates, namely, high, 
medium, and low, based upon cultural fre- 
quencies of occurrence in the free associational 
norms, were used in Exp. I for testing the 
hypothesis. Experiment II employed low- 
frequency associates and associates having a 
cultural frequency of 1. A forced-choice test 
was developed which involved presenting 
each S with the nonsense syllable stimulus 
followed by five words, one of which was an 
associate of the learned word. The S was 
instructed to select the one word which he felt 
was most related to the nonsense syllable. 
The selection by the S of the given associate 
was assumed to demonstrate the prior es- 
tablishment of a connection between the 
associate and the nonsense syllable. The 
results strongly supported the first hypothesis, 
but failed to support the second hypothesis 
as no significant functional relationship was 
found to exist between the choices of asso- 
ciates in the testing situation and their 


579 


cultural frequencies of occurrence as responses 
to the meaningful words. 


REFERENCES 


Bousrieip, W. A., Conen, B. H., & Wait- 
MARSH, G. A. Associative clustering in the 
recall of words of different taxonomic fre- 
quencies of occurrence. Psychol. Rep., 
1958, 4, 39—44. (a) 

BousriELD, W. A., Comen, B. H., & Wait- 
MARSH, G. A. Verbal generalization: A 
theoretical rationale and an experimental 
technique. Of. Naval Res. tech. Rep., 
1958, No. 23. (b) 

BousrieLD, W. A., WHITMARSH, G. A., & 
Danick, J. J. Partial response identities 
in verbal generalization. Psychol. Rep., 
1958, 4, 703-713. 

Conen, B. H. An evaluation of three associa- 
tional rationales of verbal generalization. 
Unpublished doctoral dissertation, Uni- 
versity of Connecticut, 1958. 


GUNZE, J. A. The association value of 
nonsense syllables. J. genet. Psychol., 1928, 
35, 255-267. 


Haut, J. F. Learning as a function of word 


frequency. Amer. J. Psychol., 1954, 67, 
138-140. 

Lun, C. W. The conditions of retention. 
Psychol. Monogr., 1922, 31(3, Whole No. 
142). 


Morcar, R. L., & UNDERWOOD, B. J. Pro- 
active inhibition as a function of response 
similarity. J. exp. Psychol., 1950, 40, 592- 
603. 

RusseLL, W. A., & JENKINS, J.J. The com- 
plete Minnesota norms for responses to 100 
words from the Kent-Rosanoff Word 
Association Test. Of. Naval Res. tech. 
Rep., 1954, No. 11. 

Unperwoop, B. J., & SCHULZ, R. W. Mean- 
ingfulness and verbal learning. Chicago: 
Lippincott, 1960. 

WHITMARSH, G. A., & BousrieLD, W. A. 
Use of free associational norms for the 
prediction of generalization of salivary 
conditioning to verbal stimuli. Psychol. 
Rep., 1961, 8, 91-95. 

Yavuz, H. S., & BOUSFIELD, W. A. Recall of 
connotative meaning. Psychol. Rep., 1959, 
5, 319-320. 


(Received October 30, 1961) 


Journal of Experimental Psychology 
1962, Vol, 64, No. 6, 580-585 


REVERSAL AND NONREVERSAL SHIFTS WITHIN 
AND BETWEEN DIMENSIONS IN 
CONCEPT FORMATION 


I. DAVID ISAACS anb CARL P. DUNCAN 


Northwestern University 


In a number of studies of concept 
learning in human adults (Buss, 1953, 
1956; Harrow & Friedman, 1958; 
Kendler & D’Amato, 1955; Kendler & 
Mayzner, 1956), Ss were first rein- 
forced for different responses to two 
stimuli varying on some dimension 
(e.g., circle vs. square, form dimen- 
sion) while the stimuli simultaneously 
varied on one or more momentarily 
irrelevant dimensions (e.g., color). 
After mastery of this task (hereafter, 
the training task), some Ss were 
shifted to a transfer task in which each 
of the two stimuli that had been 
reinforced in training was now paired 
with the opposite response (reversal 
shift). Thus, in transfer, reversal Ss 
had to learn two re-paired S-R 
associations. Other Ss were shifted to 
a transfer task provided by reinforcing 
the stimuli (e.g., red vs. blue) on a 
previously irrelevant dimension (non- 
reversal shift toa different dimension). 
All of the studies using human adults 
as Ss (those cited above, the only ones 
of concern here) consistently found 
that nonreversal shift to a different 
dimension provided a more difficult 
transfer task, in terms of trials to 
learn, than reversal shift, 

This finding has been used to sup- 
port a mediation theory of the way 
human adults learn and transfer in 
such concept tasks (for details, see, 
e.g., Goss, 1961; Kendler & D'Amato, 
1955). However, a mediation theory 
also predicts, according to Kendler 
and D'Amato, that the reversal 
condition should yield positive, not 
negative, transfer in comparison to a 


control group that learns only the 
transfer task. Since it is usually found 
that re-pairing of S-R associations 
produces negative ` transfer (e.g 
Porter & Duncan, 1953), it is im- 
portant to determine if this prediction 
can be confirmed. Of the three studies 
that used a control group, one (Kend- 
ler & D'Amato, 1955) did find that 
the reversal group learned the transfer 
task more quickly than the control 
group; one (Buss, 1953) found the 
control learned faster than the reversal 
group; and one (Harrow & Friedman, 
1958) found no difference. This dis- 
agreement among the studies is prob- 
ably unimportant because, it is sug- 
gested here, none of the studies 
actually used an appropriate control 
group. In all cases the control group 
learned only the transfer task; no 
attempt was made to equate control 
and experimental groups on non- 
specific transfer variables (e.g., learn- 
ing to learn, warm up) which would 
be developed in the experimental 
groups by the training task. Since 
nonspecific transfer factors are likely 
to have a net positive transfer effect, 
performance of the control groups 1 
the three cited studies was probably 
poorer than would have been the case 
if nonspecific transfer had been con- 
trolled. So, the present study is 4 
further comparison of reversal shift 
(R) and nonreversal shift to 4 
different dimension (NRD) in trans- 
fer, along with an attempt to provide 
a more appropriate transfer control 
for these groups. 3 

In addition to NRD 


the usual 


580 


— 


CONCEPT FORMATION 


condition, it is also possible, as Har- 
row and Friedman (1958) point out, 
to provide another kind of nonreversal 
shift in transfer, viz., nonreversal 
shift on the same dimension that was 
relevant in training (NRS). Harrow 
and Friedman suggest that this NRS 
condition should also, like the R 
condition, be easier to learn, in 
transfer, than NRD. This prediction 
is also tested in the present study. 


METHOD 


Apparatus —The S and E, seated on 
opposite sides of a table, were separated by a 
vertical plywood panel 29 in. high, 48 in. 
wide. The side of the panel viewed by S was 
painted gray and contained a plastic window 
24 in. high, 44 in. wide, centered in the panel 
11 in. above the table. Two lights, one on 
each side of the window, were used to provide 
reinforcement. Two push buttons were fixed 
to the table, one below each light. If S 
pushed either button, the light above it came 
on to signal a correct choice, provided that E 
had previously set a mercury switch on E's 
side of the table. 

On E's side of the panel a deck of stimulus 


581 


cards was pressed against the window by 
means of a drawbar and springs. Thus, when 
E removed the card appearing in the window, 
the next card was immediately revealed. 

Stimuli —F or Ss in experimental groups the 
stimuli varied on two dimensions, form and 
number (of forms), one or the other of which 
was relevant at some time during the experi- 
ment for all Ss. In addition, the stimuli 
varied in color (all forms on any one stimulus 
card were either red or blue), a dimension 
that was always irrelevant. All stimuli were 
drawn with colored pencils on white 3 X 5 in. 
cards. Cards were inserted in plastic en- 
velopes. 

There were four values on the form dimen- 
sion (circle, square, hexagon, triangle), and 
four on the number dimension (one, two, 
three, or four forms on a card), At any one 
time during the experiment $ had to respond 
to just two of the values, on one of the 
dimensions, paired against each other, €.g., 
circle vs. square. Only the following pairs 
were used: circle vs. square, hexagon vs. 
triangle, one vs. three forms, two vs. four 
forms. 

The training stimuli for the control group 
were vertical arrows, colored black, drawn on 
3X5 in. cards. These control stimuli also 
varied on two dimensions, each relevant for 
some Ss: direction (up-pointing or down- 


TABLE 1 
STIMULI AND EXPERIMENTAL DESIGN 


Training Transfer 
Group S 
Left Right Left Right 
‘ee [ee | #8 | Be 
R (Reversal to same 2 C2, C4 4 , S3 C3. 
Proa bma 3 H1, H3 T1; T3 T2, T4 H2, H4 
4 H2, H4 T2, T4 Ti, T3 H1, H3 
| ar 
NRD (Nonreversal to 2 2s, , 4C Same as for Group R 
different dimension) a me oe nee a p 
TEH | BE 
NRS (Nonreversal to 2 H1, H3 PAS Same as for Group R 
same dimensi 3 C2, C4 $2, S4 c 
same dimension) : GLO S1. S3 
| EE | RE 
Control 4 a UZ DX, DZ Same as for Group R 
4 XU, XD ZU, ZD 


Note.—C, S, H, T = circle, square, hexagon, trian 
card: U or D = up-pointing or down-pointing arrow; 
indicate reinforced stimuli or dimension. 


X = short arrow, Z = tallarrow. 
Left and right indicate responses. 


gle; 1, 2, 3, 4 = number dimension, number of forms on a 


Symbols in bold face print 


582 


pointing arrowhead), and height ( in. or 
2 in.). There was also a dimension that was 
always irrelevant, width: an arrow was either 
din. or din. wide. There was only one arrow 
on each card. 

Conditions.—The design is shown in Table 
1. There were three experimental groups and 
a control group, all given different training 
tasks but the same transfer task. As may 
be seen in Table 1, all four combinations 
formed by putting together a pair of stimuli 
from one dimension and a pair from the other 
dimension were used, for different Ss within 
each group, in both training and transfer 
tasks. Group R was trained on a form 
discrimination, either circle vs. square, or 
hexagon vs. triangle, Group NRD was 
trained on a number discrimination, either 
one vs. three, or two vs. four forms. Group 
NRS was, like Group R, trained on forms 
but was transferred to different forms, whereas 
Group R was transferred to the same forms 
reversed. Group C (control) was trained 
either on direction (up- or down-pointing 
arrow), or on height (tall or short arrow). 
All groups were transferred to the same form 
discrimination. 

Table 1 also shows that for Group NRD, 
partial reinforcement on the transfer task 
was controlled. When this group was shifted 
to the transfer task, the previously reinforced 
values on the number dimension were changed 
to new values so S would not receive partial 
reinforcement (by continuing to respond to 
the training stimuli) in transfer, A more 
detailed presentation of this partial reinforce- 
ment issue in concepts shifts is given by 
Harrow and Friedman (1958). Since, for 
Group NRD, shifting from training to 
transfer involved changing stimulus values 
on one dimension while not changing values 
on the other dimension, it was decided to use 
this same Kone of change” for all three 
experimental groups, as ma i 
Tabled. group: y be seen in 

There is one more important feature of the 
design shown in Table 1. It can be seen that 
in both Group R and Group NRS, any 
particular form discrimination (e.g., in Table 
Fi Ci, C3 vs. S1, $3) required of some one S 
(subject) during training, was also required 
of some other § during transfer, Therefore. 
when the training task for either Group R or 
Group NRS is taken as a whole (ignoring the 
counterbalancing of stimuli for individual Ss) 
each of these training tasks provides a measure 
of difficulty of the transfer task in the absence 
of nonspecific transfer from a prior task. In 
other words, performance of Groups R and 
NRS during training is a measure of how an 


I. DAVID ISAACS AND CARL P. DUNCAN 


inappropriate control group (not corrected 
for nonspecific transfer) would perform on the 
transfer task. 

Subjects—The Ss were students in intro- 
ductory psychology courses, assigned to 
groups in turn. Each of the three experi- 
mental groups (R, NRD, NRS) was assigned 
32 Ss. Two Ss in Group NRD failed to reach 
criterion on the training task and were 
replaced. 

It soon became clear that for Ss in Group 
C, discrimination of height of arrows was 
much more difficult than discrimination of 
direction. Therefore, 48 Ss were run in 
Group C, 24 Ss on the height discrimination 
(Group Ch), 24 on direction (Group Cd). 
Ten Ss failed to learn the height discrimina- 
tion and were replaced. 

Procedure.—For half the Ss in each group, 
the stimuli reinforced by pressing the left 
or the right button are shown under the Left 
or Right columns in Table 1. For the remain- 
ing Ss this was reversed; stimuli appearing 
under Left were reinforced for pressing the 
right button, etc. 

With stimuli varying on two dimensions, 
plus an always irrelevant dimension, a deck 
of 8 cards was necessary to represent all 
possible combinations for either experimental 
or control Ss. Two such decks were prepared, 
so 16 cards in all were available. Four 
different orders of these 16 cards were used. 
This permitted presentation of each of the 
four combinations of possibly relevant 
stimuli, number and form (or height and 
direction of arrow in Group C), equally often 
as the first card shown an S. Then, for each 
S, the second card presented revealed the 
other values of number and form (or height 
and direction) that had not appeared on the 
first card. Thus, in the first two cards 
presented S saw all possibly relevant stimulus 
values and dimensions that were to appear 1n 
the particular task on which he was working. 
Four different random orders were used to 
determine the order of presentation of the 
remaining 14 cards, with the restriction that 
the 8 different stimulus cards be shown before 
any card was repeated, p 

The instructions to S essentially told him 
to press the left or right button for each card 
appearing in the window, and that if he were 
correct, the light above the button would 
come on. 

For both training and transfer tasks, S was 
required to reach a criterion of six successive 
correct responses. If S had not met criterion 
after three presentations of the pack of 16 
cards (six presentations of the 8 different 
stimulus cards), S was dropped as a nonsolver 


CONCEPT FORMATION 


The S was allowed to proceed at his own 
pace; on the average, about 7 sec. elapsed 
between presentation of successive cards. 
There was no interruption between training 
and transfer tasks. 


RESULTS 


Training.—The left portion of 
Table 2 summarizes performance on 
the training task as measured by 
number of trials to the criterion of six 
successive correct responses. The 
six criterion trials are not included in 
the data. Although 32 Ss were taken 
to criterion on the training task in 
Groups R, NRD, and NRS, 1 S in 
Group NRS and 1 Sin NRD failed to 
reach criterion on the transfer task. 
These 2 Ss were eliminated, and 1 S 
in Group R, with median performance 
in training, was also eliminated to 
reduce N to 31 in each of the three 
experimental groups. 

There was no significant difference 
among the three experimental groups 
(top three lines in Table 2) on the 
training task (F < 1). Hartley’s test 
indicated that the group variances 
were homogeneous (Fmax = 1.87, 
df = 3/30). 

It was noted earlier that it was 
necessary to run separate subgroups 
in Group C because of differential 
difficulty of the dimensions of training 
stimuli. The difference between the 
mean (see Table 2) of the subgroup 
that discriminated direction (Group 
Cd) and the subgroup that dis- 
criminated height (Group Ch) was 
highly significant (t = 4.06). 

When Group Ch was included with 
the three experimental groups in 
analysis of variance of training means, 
F was 3.22 (P < 05, df = 3/113). 
By ¢ test, the mean for Group Ch 
differed significantly from the means 
of each of the three experimental 
groups at the 5% level or less. 
Analysis of variance of Group Cd and 
the experimental groups yielded F<1. 


583 
TABLE 2 
MEAN TRIALS TO CRITERION 
IN TRAINING 
| Training | Transfer 
Group N |— odrske 
| Mean | ow | Mean | ow 
31 | 703| 1.70 | 7.52 | 1.72 
NRD | 31 8.81 | 1.98 | 13.77 | 2.10 
NRS 31 7.00 | 1.45 3.03 .93 
Cd 24 4.67 | 1.63 | 2.67 | .49 
Ch 24 |13.83 1.76 


1.63 | 7.42 | 


Hereafter, Group Cd will be con- 
sidered the more appropriate control 
group. 

Transfer.—Mean trials to criterion 
on the transfer task are shown in 
Table 2. Again, the means do not 
include the six criterion trials. Since 
the variances of the three experi- 
mental groups were heterogeneous 
(Fax = 5.03, P < .01), and since the 
distributions were also positively 
skewed, the scores were transformed 
to log (X +1). This eliminated the 
heterogeneity of variance and pro- 
duced approximately normal distri- 
butions. Analysis of variance of the 
transformed scores of the experimental 
groups gave F= 19.4 (P < .001, 
df = 2/90). By t test, Group R 
differed significantly from Group 
NRD (t= 3.14), and from Group 
NRS (t = 3.32). Groups NRD and 
NRS also differed significantly 
(t = 6.46). The fact that Cond. R 
was easier than Cond. NRD is in 
agreement with all previous studies 
that have made this comparison. The 
new finding is that Cond. NRS was 
easiest of all. 

Analysis of variance of transformed 
scores of experimental groups and 
Group Cd yielded F = 16.6 (P <.001, 
df = 3/113). By # test, Group Cd 
differed significantly from Group R 
(t = 2.42) and from Group NRD 
(t = 5.35) but not from Group NRS 
(ety. 


584 


TABLE 3 
MEAN Error RATIOS 


Training Transfer 
Group N 
Mean o Mean oM 

R 31 52 013 70 | .020 
NRD | 31 53 009 49 | .007 
NRS 31 43 013 42 029 
Cd 24 50 029 44 .022 
Ch 24 50 004 66 .020 


Errors.—The number of trials on 
which S$ pressed the wrong button 
(errors), divided by the number of 
trials to criterion, was computed for 
each S. These error ratios for both 
training and transfer are summarized 
in Table 3. There were no significant 
differences among groups in training. 
Analysis of variance of the transfer 
data for experimental groups and 
Group Cd yielded F = 4,80 (P < 01, 
df = 3/113). By £ test, Group R 
differed significantly from Group 
NRD (¢ = 2.53), from Group NRS 
(£ = 3,39), and from Group Cd 
(t = 2.92). Other comparisons were 
not significant. 


Discussion 


_ The data show that Group R, operat- 
ing under a negative transfer paradigm, 
did in fact show significant negative 
transfer when compared to a control 
group in which nonspecific transfer was 
controlled. Group R also showed the 
highest error ratio in transfer, another 
index of intertask interference, 

The need to control for nonspecific 
transfer in studies of this kind is indicated 
by the powerful effects such transfer had 
in the present study. Recall that for a 
group of Ss as a whole, the training task 
for both Groups R and NRS was iden- 
tical to the transfer task for all groups; 
therefore, performance of these groups 
in training yields a measure of difficulty 
of the transfer task for Ss not provided 
with training for nonspecific transfer. 
This measure was essentially the same 
for both Groups R and NRS (7.03 and 


I, DAVID ISAACS AND CARL P. DUNCAN 


7.00 mean trials to criterion in training, 
respectively). Nonspecific transfer was 
presumably controlled in Group Cd, 
and this group required a mean of only 
2.67 trials (transfer mean) to learn the 
same task. It seems likely that in the 
studies of Buss (1953), Harrow and 
Friedman (1958), and Kendler and 
D’Amato (1955), the reversal groups — 
would have shown negative transfer had — 
the control groups been trained so as to 
minimize nonspecific transfer. 

Most of the data on reversal shifts in 
concept learning in human adults has 
been interpreted in terms of ‘mediating 
mechanisms” or “implicit cues’? (Goss, 
1961; Harrow & Friedman, 1958; Kend- 
ler & D'Amato, 1955). The interpreta- 
tion of the present data, which follows, 
avoids this particular theoretical lan- 
guage. Instead, the interpretation is 
based largely on a single, and presumably 
fairly simple, assumption. 

Assume that Ss reinforced on a par- 
ticular dimension and extinguished on all 
other dimensions during training, ten 
to continue to respond, initially, to the 
reinforced training dimension during” 
transfer. If so, then Group NRS- 
(trained on forms, transferred to new” 
forms) would have responded primarily 
to the two new forms on the transfer 
task. Since the two new forms woul i 
have had roughly equal probabilities of | 
association with the two responses, the 
transfer task would essentially reduce 
to a simple two-choice discrimination for 
these Ss. Group NRS should, and did; 
learn the transfer task very rapidly. i 

According to the same assumption, 
Group R (trained on forms, transferre 
to the same forms re-paired with the 
responses) should also have continu 
to respond to stimuli on the form dimen- 
sion early in transfer. But because the 
forms available to these Ss had been 
differentially reinforced in training, an4 
were re-paired in transfer, initial pro” 
ability of association between the toa 
and responses would not be ern 
Thus, although Group R was also facets 
it is assumed, with only a two-choicé 
discrimination in transfer, the training 
associations had to be extinguish 


i 
i 
j 


CONCEPT FORMATION 


before the transfer task could be learned. 
Group R should, and did, transfer more 
slowly than Group NRS, and should 
make many errors. And as has been 
shown, Group R should and did transfer 
more slowly than an appropriate control 
group. 

Still following the basic assumption, 
Group NRD (trained on number, trans- 
ferred to forms) would continue to 
respond to the number dimension during 
transfer. Since no number stimuli, new 
or old, were consistently reinforced 
during transfer, the task for these Ss 
became quite difficult. First, responses 
to stimuli on the number dimension had 
to be extinguished. There now remained 
two dimensions, form and color, from 
which to choose; since both these dimen- 
sions had been extinguished during 
training, there was no basis for choosing 
between them. So Group NRD next 
had to discover that it was forms, not 
colors, that was being reinforced during 
transfer. Finally, these Ss had to dis- 
cover which form went with which 
response. It seems clear that the total 
number of alternatives from which to 
choose was greater for Group NRD than 
for any other group (Goss, 1961, has 
come to the same conclusion), and Group 
NRD showed the poorest performance of 
allin transfer. Viewed this way, it is not 
surprising that Group NRD should be 
inferior to Group Cd and Group NRS. 
But in this and in all previous studies 
that have made the comparison, Group 
NRD also learned the transfer task more 
slowly than even the negative transfer 
group (R). This finding simply shows 
that having to deal with several stimulus 
alternatives that have previously been 
subjected to differential reinforcement 
and extinction is more difficult than 
having to deal with a re-paired situation 
involving basically only two associations, 
a difference in task difficulty that would 
seem to have little theoretical import. 


SUMMARY 


In a study of human concept formation, 
two experimental groups were trained on a 
two-choice form discrimination, with number 
and color stimuli irrelevant. For one group 


585 


(reversal shift) the transfer task consisted of 
re-pairing the training stimuli with the re- 
sponses; for the other group (nonreversal 
shift to the same dimension), two new forms 
were used as transfer stimuli. A third ex- 
perimental group (nonreversal to a different 
dimension) was trained on number stimuli and 
transferred to forms. A control group was 
trained on stimuli differing from any of those 
used for experimental groups, then trans- 
ferred to forms. The same two-choice form 
discrimination, with number and color 
irrelevant, was used as the transfer task for 
all groups. 

The results showed three significantly 
different levels of performance (in terms of 
trials to learn) on the transfer task. In order 
of best to poorest performance, the levels 
were: (a) nonreversal to same dimension, 
and control; these groups did not differ, 
(b) reversal shift, and (c) nonreyersal to 
different dimension. As compared to the 
control, the reversal group showed significant 
negative transfer. It was suggested that 
performance of all groups could largely be 
accounted for by a combination of two 
factors: nonspecific transfer, and a specific 
tendency to continue to respond in transfer 
to the dimension of stimuli reinforced in 
training. 

REFERENCES 

Buss, A. H. Rigidity asa function of reversal 
and nonreversal shifts in the learning of 
successive discriminations. J. exp. Psychol., 
1953, 45, 75-81. 

Buss, A. H. Reversal and nonreversal shifts 
in concept formation with partial rein- 
forcement eliminated. J. exp. Psychol., 
1956, 52, 162-166. 

Goss, A. E. Verbal mediating responses and 
concept formation. Psychol., Rev., 1961, 
68, 248-274. 

Harrow, M., & FRIEDMAN, G. B. Comparing 
reversal and nonreversal shifts in concept 
formation with partial reinforcement con- 
trolled. J. exp. Psychol., 1958, 55, 592-598. 

KexpLER, H. H., & D'AMATO, M. F. A 
comparison of reversal and nonreversal 
shifts in human concept formation behavior. 
J. exp. Psychol., 1955, 49, 165-174. 

KENDLER, H. H., & MAYZNER, M. S., JR- 
Reversal and nonreversal shifts in card- 
sorting tests with two or four sorting 
categories. J. exp. Psychol., 1956, 51, 244- 
248. 

PORTER, L., & DUNCAN, 
transfer in verbal learning. N 
Psychol., 1953, 46, 61-64. 


(Received November 2, 1961) 


C. P. Negative 
exp. 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 586-588 


EFFECTS OF SECONDARY REINFORCEMENT SCHEDULES 
IN EXTINCTION ON CHILDREN’S RESPONDING?! 


N. A. MYERS anD J. L. MYERS 


University of Massachusetts 


Strong secondary reinforcement ef- 
fects have not been consistently 
demonstrated. Nor is there agree- 
ment regarding the appropriate ex- 
planatory concepts. In particular, 
doubt has been cast upon the ex- 
planation of St (secondary reinforce- 
ment) in terms of “discrimination” 
between conditioning and extinction 
trials (Bitterman, Fedderson, & Tyler, 
1953). 


Support for the discrimination hypothesis 
comes from a study by Melching (1954). 
He presented two groups of rats with 50% 
neutral stimulus (buzzer) in training and 
found no difference in extinction responding 
between the group given no buzz in extinction 
and the group given 100% buzz in extinction. 
A study by Myers (1960) presents negative 
evidence for the discrimination hypothesis, 
She trained children, using tokens as po- 
tential secondary reinforcers, and found that 
of the two groups trained with 50% token, 
the group receiving 100% token during ex- 
tinction made significantly more responses 


than the group receiving no tokens during 
extinction. 


Resolution of the differences in the 
results of Myers and Melching is 
difficult without further data. The 
studies differed in the species of § 
and in the type of neutral stimulus, 
The present study was designed to 
provide a further test of the “dis- 
crimination” hypothesis with children 
as Ss (as in the Myers’ study) and the 
buzzer as reinforcer (as in Melching’s 
study). Furthermore, a low rate of 
presentation of both the primary and 
neutral stimuli has been used during 
training, in accord with recent data 


1 This research was supported by funds 
from National Institute of Mental Health 
Grant M-2620 (C-1). 


on the effectiveness of such schedules 
in establishing secondary reinforcers 
(Fox & King, 1961; Zimmerman, 
1957, 1959). An even more im- 
portant reason for using such sched- 
ules is that the difference between the 
training rate and 100% buzzer in 
extinction should be clearly greater 
than the difference in training rate and 
0% in extinction, yielding a better 
test of the “discrimination” hy- 
pothesis than either the Myers or the 
Melching study. 


METHOD 


A pparatus.—The apparatus employed was 
a portable box designed to attract the interest 
of preschool children. On the front be 
painted a clown face, having red jewel-light 
eyes, a push-button nose, and a slot-tray 
mouth. M & M coated chocolate candy was 
dispensed through a tube to the mouth of the 
clown, while a }-sec. buzz was heard from 
the interior of the box. The Æ had access. y 
and operated two silent knife switches whic 
allowed administration of the predetermine 
reinforcement. The number of responses was 
recorded on an electric magnetic counter 
mounted on the back of the box, out of 5S 
sight. The Æ recorded the number y 
responses made during each successive minut 
of extinction. i 

Subjects —The Ss were 75 boys and gi 
between the ages of 4 yr., 7 mo. and 5 Yi 
11 mo., attending kindergartens in Northamp 
ton, Massachusetts,? a 

Procedure—Each S was asked in t 
classroom if he would like to play a cloni 
game, and if he acquiesced he was led to 
small testing room. The clown was place! 


3 The authors wish to thank W. Barty: 
Superintendent of Schools, and Esther W a 
Eleméñtary School Supervisor, Nort bamia "A 
Massachusetts, for providing facilities Mrs 
subjects for this study, and teachers, i fal 
Grace and Mrs. Suprenaut for their help 
cooperation. 


586 


SECONDARY REINFORCEMENT 


TABLE 1 
DESIGN OF EXPERIMENT AND MEAN NUMBER 
oF EXTINCTION RESPONSES 
FOR EACH Group 


Conditioning E 
Mean 
Group No. of 
Candy Buzzer | Buzzer Seas? 
(%) (%) (% 

Ei 20 20 100 102.73 
Es 20 20 20 55.80 
oF 20 20 0 36.40 
Cy 20 0 0 72.80 
Co 20 0 100 66.60 


on a small table and S was instructed to sit 
at the small chair in front of it. No written 
instructions were read to S, but E standard- 
ized the verbal instructions as much as 
possible. Attention was called to the clown 
face, especially the nose. The Ss were told 
that “something happens when you press his 
nose. Let's see what happens.” The E 
pressed the clown’s nose and thereby received 
abuzzandanM &M. The S was encouraged 
to try it also and was given one rewarded 
preliminary trial. He was then told he could 
stay and play the game “as long as you 
want.” The Æ then sat down behind the 
table, facing the open back of the box and S. 
Each S was reinforced according to a 
predetermined 20% reinforcement schedule 
which delivered a total of 15 M & M candies. 
Immediately following the fifteenth rein- 
forcement, the extinction period commenced; 
no candy reinforcement was administered, 
and each S was run until he stopped and 
indicated a desire to return to the classroom or 
until 5 min. had elapsed, at which time E 
terminated the session. One last candy was 
offered at the end of the extinction period. 
Design—Eight boys and 7 girls were 
assigned randomly to each of four groups. 
Fifteen more children were assigned to a 
second control group, run after the others. 
Three E groups received a }-sec. buzz every 
time a candy was received during training 
(20% reinforcement with M & M and buzz). 
They differed only with respect to extinction 
treatment: one group (100% buzz) received 
the buzz for every button press in extinction; 
one group (20% buzz) heard the buzz on 
approximately every fifth response, as in 
training; the third group (0% buzz) never 
heard the buzzer in extinction. A control 
group (C;) never received the buzzer either 
during training or extinction; they received 
20%, reinforcement with M & M candy alone 


587 


during training, and no reinforcement during 
extinction. A second control group (C2) also 
received 20% reinforcement with M & M 
candy alone during training, but received the 
buzz for every button press in extinction. 
The design is presented in Table 1, along 
with the mean number of extinction responses 
for each group. 


RESULTS 


The mean numbers of responses, for 
successive minutes of extinction, for 
the five groups are presented in Fig. 1. 
An analysis of variance was performed 
on these data and yielded a significant 
difference between groups (F = 5.98, 
df = 4/70, P < 001). There was a 
significant decrease in responding for 
all groups over time (F = 60.49, 
df = 4/280, P < 001), but the 
Groups X Time interaction was not 
significant. 

Duncan’s multiple range test was 
applied to compare the groups with 
one another. All differences were 
significant at the .01 level except those 
between Ex and C, (where P < .05), 
between Ex and C2, and between Cı 
and Cə. 

Discussion 


A simple discrimination explanation of 
St effects (Melching, 1954) would predict 
greatest number of extinction responses 
from the 20% buzz group in this study, 
since the schedule of St presentation 
during conditioning and extinction is 
identical, and therefore, the extinction 
period is less discriminable from the 


Al aka 
100% Buzz 


NOVEL STIMULUS CONTROL 
NO-BUZZ CONTROL 


10 
oS apa Buzz 
ox BUZZ 


1 2 3 4 5 
SUCCESSIVE MINUTES OF EXTINCTION 
Fic. 1. Mean number of responses for 
successive minutes of extinction. 


MEAN NO, RESPONSES 
a 


588 


former conditioning period than for any 
other group. Zimmerman. (1957, 1959) 
also would predict greatest response 
strength for the 20% buzz group, arguing 
that any S" value of the buzzer accrued 
during conditioning would be dissipated 
more slowly by more occasional pres- 
entation during extinction. However, 
the results quite clearly refute these 
hypotheses: the 100% buzz group made 
almost twice as many extinction re- 
sponses as the 20% buzz group. And, 
when the 20% buzz group was compared 
with the primary control group, it was 
seen that the buzz presented 20% of the 
time did not operate to increase response 
strength above primary extinction level. 
Furthermore, the simple discrimination- 
generalization model would predict a 
higher level of extinction responding for 
the 0% buzz group than for the 100% 
buzz group, in this experiment, since the 
change from 20% buzz to 0% buzz is not 
as great as the change from 20% to 100% 
buzz, therefore not as discriminable, and 
conditioned responses should be gen- 
eralized more easily. Again, the results 
do not support this prediction: the 100% 
buzz group made almost three times as 
many extinction responses as the 0% 
buzz group. 

It appears that some notion of*a 
supplementary reinforcing role of the 
buzzer stimulus, as suggested by Myers 
(1958) and Myers (1960) is needed to 
account for the significantly greater 
number of responses made with 100%, 
buzz presentation in extinction. It may 
be noted that the significant difference 
between the 100% buzz group and the 
novel-stimulus control group which also 
received 100% buzz in extinction is 
evidence that the reinforcing effect is 
due to previous association with the 
candy. 

This modified discrimination model 
assumes that response strength in ex. 
tinction is a function of the difference in 
percentage buzz from training to ex- 
tinction. In contrast to the Bitterman- 
Melching approach, the sign of the differ- 
ence is retained ; increments in percent- 
age buzz should yield more responses 
than no change, which in turn should 


N. A. MYERS AND J. L, MYERS 


result in more responses than decrements. 
This prediction is clearly borne out in the 
present study. The only incompatible 
finding is the significant difference be- 
tween Groups E: (20% buzz in condi- 
tioning and extinction) and Cı (0% buzz 
in conditioning and extinction). The 
theory would predict no difference, as 
would also the Bitterman-Melching ap- 
proach. However, it should be noted 


“that this difference was of considerably 


less statistical significance than those 
differences predicted by the theory. i 


SUMMARY 


Kindergarten children were trained in a 
free operant situation with candy asa reward. 
A group receiving 20% buzzer presentations 
in training, and shifted to 100% buzzer in 
extinction responded significantly more than 
similarly trained groups shifted to 20% 
buzzer and 0% buzzer in extinction. This” 
100% buzzer group also performed better 
than a group which was similarly extinguished . 
but which had not experienced the buzzer in 
training. It was concluded that a secondary 
reward effect was demonstrated. 


REFERENCES | 


BITTERMAN, M. E., Fepperson, W. E & 
TyLer, D, W. Secondary reinforcement 
and the discrimination hypothesis. Amer 
J. Psychol., 1953, 66, 456-464. 

Fox, R. E, & KinG, R. A. The effects of 
reinforcement scheduling on the strengt 
of a secondary reinforcer. J. comp. 
physiol. Psychol., 1961, 54, 266-269. dl 

MEvcuinG, W. H. The acquired rewar 
value of an intermittently presented nut | 
stimulus. J. comp. physiol. Psychol. 1954 | 
47, 370-374. A 

Myers, J. L. Secondary reinforcement: 
review of recent experimentation. Psyehok 
Bull., 1958, 55, 284-301. 

Myers, N. A. Extinction following pat 
and continuous primary and secon i 
reinforcement. J. exp. Psychol., 1960, 9 
172-179, Å 

ZIMMERMAN, D. W. Durable secondary a 
forcement: Method and theory. sya 
Rev., 1957, 64, 373-383. o 

ZIMMERMAN, D. W. Sustained performan 
in rats based on secondary reinforcemed 

J. comp. physiol. Psychol, 1959, 52,4 

358. 


tial 


(Received November 24, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 589-592 


SIMULTANEOUS INDUCTION OF MULTIPLE ANCHOR 
EFFECTS IN THE JUDGMENT OF FORM* 


EDWARD D. TURNER anD WILLIAM BEVAN 
Kansas State University 


The traditional approach in psycho- 
physics has been to hold constant all 
properties of the stimuli to be judged 
except one and to plot responses as a 
function of this variable. Meanwhile, 
perhaps the most obvious character- 
istic of judgmental situations outside 
the laboratory is that stimuli to be 
judged vary among themselves on a 
number of dimensions. A recent 
solution to the problem of multi- 
dimensionality has been to have 
stimuli judged for similarity and to 
express these relationships as dis- 
tances in a Cartesian space (Torger- 
son, 1958). An alternative experi- 
mental strategy, when the stimulus 
dimensions can be identified, consists 
of limiting these dimensions to some 
small number greater than one, and 
allowing them to vary with reference 
to each other in certain prescribed 
ways. This not only allows for an 
assessment of the psychophysical rela- 
tionships involved but may provide 
some information on the processes of 
judgment. 

The present experiment employs 
the method of single stimuli. It differs 
from the usual application in several 
ways: the stimuli differ with respect 
to three different physical dimensions; 

ariation on each dimension is in- 

dependent of variation on the other 
two; and three judgments, one for 
each dimension, are made following 
the presentation of each stimulus. 

1 This experiment was performed under 
Contract Nonr-3290 (01), Project NR142-155, 
between Kansas State University and the 
Office of Naval Research. The authors are 


indebted to Paula Oppy and Joan Wyche for 
their help in the collection of data. 


The purpose of the experiment was 
to determine whether or not anchoring 
effects typically obtainable for stimuli 
varying on a single dimension (Wood- 
worth & Schlosberg, 1954) could also 
be obtained for multidimensionally 
varying stimuli? 


METHOD 


Subjects: —The Ss were 30 female under- 
graduates. They were divided randomly into 
three groups of 10 each. Group A received 
anchor stimuli which deviated from series 
stimuli in size and shape but were of a medium 
lightness. Group B received anchors which 
deviated in color and shape but which were of 
a medium size. Group C received anchors 
which were deviant in color and size but of an 
intermediate shape. All Ss judged all stimuli, 
including anchors, on all three dimensions. 

Stimuli.—The series stimuli consisted of 
gray rectangular shapes, each mounted on 
heavy white (Crescent No. 100) illustration 
board, 31.5 X 22.5 cm., for presentation in a 
Gerbrands tachistoscope. The series members 
differed from each other such that there were 
4 each of 4 different shapes, 4 sizes and 4 
degrees of lightness. In order to keep the 
stimulus series to a manageable length, 16 
combinations of shape, size, and color (light- 
ness) were selected from the 64 possible. This 
was done by arranging the 16 possible size- 
shape combinations in a 4X4 matrix and 
then superimposing on this, in Latin square 
fashion, the 4 degrees of lightness such that 
each color appeared in each column and each 
row only once. The colors were Color-Vu 
grays No. 7, 9, 10, and 11. Their Munsell 
equivalents as well as the other physical 
properties of the stimuli are presented in 
Table 1. 


2 What we here refer to as anchors, Helson 
(personal communication) prefers to call 
predominant stimuli, reserving the term, 
anchor, for deviant stimuli identified by E 
through instructions or otherwise as referent 
rather than series stimuli, Anchors generally 
are more potent than predominant stimuli in 
their influence upon series judgments. 


589 


590 


TABLE 1 


PHYSICAL PROPERTIES OF THE 16 
SERIES STIMULI 


Size Sha; Lightness 
(Approx. idth in Cm, and| (Munsell 
Areata | Coan NR gon and | (lage 

SAIS oro (hed) 6.5 

14 3.60 X 3.90 ane 5.5 

3.45 X 4.05 (1:1.21) 5.0 
3.30 X 4.70 (121.27) 4.5 
5.00 X 5.00 (1:1) 4.5 
25 4.80 X 5.20 (1:1.08) 6.5 
4.60 X 5.40 (1:1.21) 5.5 
4.40 X 5.60 (1:1.27) 5.0 
6.25 X 6.25 (1:1) 5.0 
38 6.00 X 6.50 (1:1.08) 4.5 
5.75 X 6.75 a 6.5 
5.50 X 7.00 (1:1.27) 5.5 
7.50 X 7.50 (1:1) 5.5 
56 7.20 X 7.80 (1:1.08) 5.0 
6.90 X 8.10 (1:1.21) 4.5 
6.60 X 8.40 (1:1.27) 6.5 


Two similar stimuli were used as anchors 
for each group. These represented extreme 
values on two dimensions and an intermediate 
value on the third (control) dimension. 
Group A received the size-shape anchors. 
These were two relatively large rectangles, 
96 cm.? in area, with a length by width ratio of 
1:1.50. (Their dimensions were 8 X 12 cm.) 
They had Munsell values of 5 and 5.5 and 
thus were intermediate grays. Group A, 
therefore, provided anchor data on size and 
shape and control data for color. Group B 
received the color-shape anchors. These were 
two black 1:1.50 rectangles of intermediate 
(4 X 6cm. and 5.0 X 7.5 cm.) size. Group B 
thus provided anchor data for the color and 
shape dimensions and control data for size, 
Group C received the color-size anchors, two 
large black rectangles, 9.6 X 10.4 cm. and 
9.2 X 10.8 cm. They were intermediate in 
shape (length to width ratios of 1:108 and 
1:21, respectively). Group C provided 
anchor data on color and size and control data 
on shape. 

Assuming the stimuli designated as anchors 
to be effective, it was expected that the size 
judgments for the anchored groups would be 
reliably smaller than those of the control, the 
shape judgments would shift toward greater 
squareness, and the color judgments toward 
greater lightness. 

Each S made a total of 72 judgments for 


EDWARD D. TURNER AND WILLIAM BEVAN 


each dimension: each of the 16 series members 
was presented 3 times and each of the two 
anchors 12 times. The order of presentation 
on the 72 trials was random. 

Procedure—The Ss were tested individ- 
ually. Presentations of stimuli were at 
intervals of 10 sec. for durations of .5 see. 
The psychophysical method was the rating 
scale version of the absolute method. Ratings 
were required on all three dimensions for each | 
stimulus presentation. Thirteen categories 
were available for each judgment: The shape 
categories varied from 0 (perfectly square) 
to 12 (extremely nonsquare). The size 
categories were —6 (extremely small) through 
0 (neither large nor small) to +6 (extremely 
large). The color categories varied from —6 
(very dark gray) through 0 (neutral gray) 
to +6 (very light gray). The Ss were also 
encouraged to use additional categories at 
either or both ends of any scale when they 
regarded this to be necessary to the expression 
of their judgments. The anchors were in no 
way identified by Æ as special stimuli. EachS 
recorded her judgments upon a mimeographed 
data sheet provided for her, Median judg- 
ments were computed for each S’s judgments 
of each individual stimulus. Means of these 
medians were then used as cell entries in 
analyses performed upon the data. 


RESULTS AND DISCUSSION 


Figure 1 summarizes the data for 
each dimension separately. The an- 
chor data consists of the average 
judgments of the several series stimuli 
made by two groups; the control data 
derives from the judgments of the 
third group. Table 2 presents sum- 
maries of analyses of variance used t0 
evaluate these data. Three separate 
analyses were performed, one on the 
data of each dimension. In the 
interest of simplicity of presentation 
the within-Ss sources (Between Stim- 
uli, Stimuli X Ss, etc.) have been cole 
lapsed, so that the summaries indicate 
differences between groups of Ss, p 
of whom is represented by one avd 
judgment per dimension. Similar 4 
the Stimuli X Groups interaction ! 
not identified. Meanwhile, the E 
tween-groups source has been parts 
tioned into predicted differences (@ 


MULTIPLE ANCHOR EFFECTS 591 


JUDGED MAGNITUDE 


3°04 
STIMULI 


Fic. 1. Average size, shape, and color anchor effects for the several multidimensional groups. 
(The solid line represents the anchor data, the dotted the control data. The anchor curve for 
size derives from the data of Groups A and C, its control from Group B. The shape anchor 
curve represents data of Groups A and B; its control is Group C. The color data are obtained 
from Groups B and C; its control is Group A.) 


TABLE 2 


SUMMARIES OF ANALYSES OF VARIANCE PERFORMED UPON THE JUDGMENTS FOR EACH 
OF THE THREE DIMENSIONS ON WHICH THE SERIES STIMULI VARIED 


Dimension Source dj MS F AF bays 
Between groups 
‘Anchor (Groups A, C) vs. No 1 150.4 3.20** Hy 
Size Anchor (Group B) 
Group A vs. Group C 1 31,2 .66* Ho 
Pooled between Ss 27 47.02 
Between groups 
Anchor oar Ae) vs, No 1 283.8 4,08** Hg 
Sha Anchor (Group 
ue Group A vs. Group B 1 277.5 3.99* Ho 
Pooled between Ss 27 69.6 
Between groups 
‘Anchor (Groups BA) vs. No f 2597.0 31.90*** Ha 
Color Anchor (Group 
Group B vs. Group C 1 171.2 2.10* Ho 
Pooled between Ss 27 81.41 
* P > .05; two-tailed. 
** P < 105; one-tailed. 
+++ p = 001; one-tailed. 


592 


chor vs. no anchor) and nonpredicted 
differences. Since the direction of 
each possible anchor effect can be 
predicted (H,), a one-tailed criterion 
of significance is applied. At the same 
time there is no basis for expecting the 
anchor effect to be greater in one 
anchor group than the other (H,). 
Therefore, a two-tailed criterion is 
used in these cases. 

Inspection of Fig. 1 indicates the 
simultaneous induction of anchor 
effects for all three dimensions. In 
every case the solid line lies in the 
predicted relationship to the dotted. 
This is supported by the data of 
Table 2. The judgments of the con- 
trol group are significantly different 
from the judgments of the combined 
anchor groups for all dimensions. In 
no case, however, were there reliable 
differences between the two anchor 
groups. Further evidence that the 
differences between groups are anchor 
effects is indicated by the difference 
in slope between each pair of curves, 
When the anchor is above the Series, 
as in the case of size and shape, the 
curves should be most widely sepa- 
rated at their upper end; when it is 
below, the separation should be great- 
est at the lower end. The data of 
Fig. 1 are in line with this expectation. 
Finally, it is interesting to note that 
simultaneous anchor effects may be 
either in the same or opposite direc- 
tions. In the case of Group A, which 
displayed size and shape anchor 
effects, both anchors were above the 
series and the judgmental shifts were 
downward. However, in Group B, 


EDWARD D. TURNER AND WILLIAM BEVAN 


which displayed color and shape 
anchor effects, and Group C, which 
showed size and color effects, one 
anchor was above and the other below 
the series, and the anchor differences 
were in opposite directions. 

An incidental finding is the dip in 
the shape curves. It will be remem- 
bered that Shape 1 is a perfect square 
and Shapes 2, 3, and 4 are rectangles 
of increasingly greater width. Since 
squares tend to appear taller than 
they are wide (the horizontal-vertical 
illusion), it is not unreasonable that 
Shape 2 is judged more square than 
the square itself. 


SUMMARY 


The purpose of the present experiment was 
to determine if an anchor stimulus which 
differed from its psychophysical series on more 
than a single dimension could effect shifts in 
judgment on each of the dimensions on which 
it differed from the series. Accordingly, $5 
were asked to judge a series of rectangular 
figures which varied in shape, size, and light- 
ness. Anchor stimuli which represented 
marked deviations from the series values on 
two but not on the third dimension were 
included in the order of presentation. Three 
groups of Ss were used so that all combina- 
tions of two dimensions were anchored with 
the third available for control data. Analyses 
of variance performed on the data for each 
of the three dimensions indicated that 
multiple anchoring had occurred. 


REFERENCES 


Torcerson, W. S. Theory and methods of 
scaling. New York: Wiley, 1958. 

Woopwortn, R. S., & Scutosperc, H. Et- 
perimental psychology. (Rev. ed.) New 
York: Holt, 1954. 


(Received November 30, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 593-399 


DISCRIMINATION AND MEDIATED GENERALIZATION 


IN PROBABIL 


ITY LEARNING 


JULIET POPPER SHAFFER * 


Universit: 


When responses learned to a given 
stimulus occur in the presence of other 
stimuli which are not physically 
similar to the original, we have an 
example of secondary or mediated 
stimulus generalization. The gen- 
eralization in such cases appears to be 
based on previous experiences of the 
organism being studied. Some of the 
best evidence for mediated generaliza- 
tion comes from studies of what has 
been called semantic generalization, in 
which a response conditioned to a 
word generalizes to other words 
similar in meaning to the original. 
Reviews of the experimental literature 
on semantic generalization may be 
found in Cofer and Foley (1942) and 
Osgood (1953). 

Behavior theorists have attempted 
to account for these phenomena by 
assuming that Ss make implicit re- 
sponses preceding the overt response 
and that these implicit responses 
produce stimuli which partly deter- 
mine the overt response to the pre- 
sented stimulus. For example, in the 
case of generalization from one word 
to another word similar in meaning, 
it has been assumed that there are 
learned mediating responses to words 
which represent their meanings. The 
more nearly synonymous two words 
are, the greater is the similarity of 
these responses, i.e., the greater is the 
physical similarity between the pat- 
terns of stimulation produced by the 

1 This research was conducted at Indiana 
University while the author was a National 
Science Foundation Postdoctoral Fellow. 
The helpful advice of W. K. Estes has been 
much appreciated. 


593 


y of Kansas 


mediating responses. A response 
learned to a word will also be learned 
to the stimuli produced by the mediat- 
ing response representing the meaning 
of that word. Therefore, by physical 
similarity, the response will generalize 
to the stimuli produced by mediating 
responses to other words similar in 
meaning to the original, generating a 
semantic gradient of generalization 
(Osgood, 1953). 

This paper reports the results of an 
experiment designed to test a model 
for mediated generalization, developed 
within the framework of statistical 
learning theory. The model specifies 
the assumed mediation process more 
precisely than has usually been the 
case, and yields quantitative predic- 
tions of the effects of mediation in a 
specific experimental situation. A 
brief theoretical review will be given 
here; for a full account see Popper 
(1959). 

The mediation model is based on a 
model for discrimination learning 
developed by Burke and Estes (1957). 
Their model applies to discrimination 
problems which consist of a series of 
trials. Each trial is initiated by a 
stimulus to which S responds, and is 
terminated by a reinforcing event. 
A stimulus is conceptualized as a set 
of elements available for sampling by 
S, with each element conditioned to 
one and only one of the response 
alternatives in the situation. Each 
available element has a probability 0 
of being sampled on a particular trial. 
The probability of each response is 
equal to the proportion of sampled 
elements conditioned to that response. 


« 


594 


When a reinforcing event terminates 
a trial, all elements in the sample 
become conditioned to the response 
corresponding to that event. 

The mediation model is an exten- 
sion of the Burke and Estes approach 
to problems in which mediating 
responses are assumed to be occurring 
and influencing the final overt re- 
sponse. Specifically, it is assumed 
that mediating responses in the pres- 
ence of a stimulus produce cues which 
can be represented as additional 
elements in the set corresponding to 
that stimulus. These elements are 
therefore available for sampling when 
that stimulus is present, and if their 
conditioning status is known, their 
effect on the overt response can be 
predicted. 

In this experiment, Ss were given 
two successive probabilistic discrimi- 
nation problems. Their performance 
on a third problem was predicted on 
the assumption that it would be 
affected in a specified way by mediat- 
ing responses, resulting from the 
training on the two initial problems. 


METHOD 


The first two probabilistic discrimination 
problems will be designated Discrimination a 
and Discrimination b, respectively. On each 
trial of Discrimination a, one of two stimuli, 
a green light or a white light, appeared, 
followed by one of two reinforcing events, the 
letter X or the letter O. Immediately after the 
light appeared, S was to respond by saying 
either X or O, to indicate which outcome he 
expected on that trial, The reinforcing events 
were probabilistically related to the stimuli, 
i.e., the probability of X or O on each trial 
pega: only on the stimulus initiating the 
trial, 

The Ss were then trained on Discrimina- 
tion b, which was another two-stimulus, two- 
response problem. The stimuli were X and 0, 
and the reinforcing events were two nonsense 
syllables, Mar and kuv, Discriminations a 

and b were related in that the reinforcing 
events (and responses) of Discrimination a 
were the same as the stimuli of Discrimination 
b. Interspersed among the trials on Dis- 


JULIET POPPER SHAFFER 


crimination b were a few trials on which the | 
stimulus was a green light or a white light, 
as in Discrimination a, but the .S was required 
to respond with Mar or KUV, as in Discrimina- 
tion b. No reinforcing event occurred on 
those trials, which will be referred to as test . 
trials, 

Finally, a third problem, Discrimination ¢, 
was given in which the stimuli were the green 
and white lights, the reinforcing events were 
MAF and Kuv, and Mar and kuv each had 
probability .50 of occurring, regardless of | 
which stimulus initiated the trial. 

The stimuli, responses, and reinforcing 
events in Discrimination a will be denoted by 
Tı and Ts, A, and As, and E; and Es, re 
spectively, where reinforcing event E; means 
reinforcement of response A;. Similarly, the 
stimuli, responses, and reinforcing events in | 
Discrimination b will be denoted by Ts and © 
Ta Asand Ay, and E; and Ey. i 

Subjects—The Ss were 96 Indiana Uni — 
versity students taking the first semester of — 
introductory psychology. They were as- 
signed randomly to experimental groups, and 
tested individually. 

Apparatus—A vertical black wooden 
board, 30 in. high and 36 in. wide, was sup- 
ported on a table 30 in, high. A diffusing 
screen made of a double layer of sanded 
Plexiglas, 4 in. high and 21 in. wide, was 
mounted on the board. Three inches below 
the center of the screen was a window of one 
way mirrored glass, 2 in. in diameter, which 
became transparent only when lighted from 
behind. Another window of the same kin 
was below the first, with 3 in. between the 
centers of the two windows. A door on the 
back of the apparatus permitted the insertion 
of cards immediately behind the windows. 

Two 6.3-v. pilot light assemblies were 
mounted 12 in. apart behind the Plexiglas 
sereen. Colored jewel caps covered the lights 
so that from a frontal view the left light 
was green and the right light was white. Two 
6.3-v., .15-amp. incandescent bulbs were 
mounted behind each window, one on eac 
side. A cam-operated timer controlled the 
time intervals during which the appropriate 
lights came on, 

A 5X8 in. index card was behind the 
windows on each trial, Some of the cards 
used had either X or O typed in pica capitals 
so that it would appear in the center of the 
upper window when illuminated, and eithet 
MAF OF KUV, typed in pica capitals, so that it 
would appear in the center of the lower 
window when illuminated. Other cards were 
blank in either the upper or lower position. 


DISCRIMINATION AND MEDIATED GENERALIZATION 


Procedure and experimental design.—Each 
S sat 3 ft. in front of the table which sup- 
ported the stimulus panel. The room was 
dark, except for a 100-w. bulb shining on the 
front of the windows. 

Instructions were given for Discrimination 
a, presenting it as a prediction experiment 
and emphasizing the importance of trying to 
make as many correct choices as possible. 
After 4 practice trials, Ss were given 140 
trials on Discrimination a, 70 Tı and 70 Ts 
trials. 

Immediately following this phase, Ss were 
given instructions for the remainder of the 
experiment: Discrimination b, with test trials 
interspersed, and Discrimination c. They 
were told that any one of four events could 
begin a trial: the green light or the white light, 
as before, or X or O, On every trial, they 
were to guess whether MAF or KUV would 
follow. They were told also that on some of 
the trials a blank card would follow their 
guesses, indicating that they were not being 
informed of the correct answer for those trials. 
An additional 98 trials were given, dis- 
tributed in the following way. Trials 6 and 17 
were unreinforced Discrimination b trials, ie., 
the stimuli were T; and T4, in random order, 
and no reinforcing event occurred. They 
were included so that Ss would have some 
experience with unreinforced trials prior to 
the test trials, and they have been omitted in 
all analyses. Trials 33, 45, 56, and 66 were 
test trials, with stimuli Tı and Tz and no 
reinforcing events. For half the Ss in each 
experimental subgroup, the stimuli appeared 
in the order T,-Te-Ts-T1, and for the rest in 
the order T2-Ti-Ti-T2. The remainder of the 
trials up to Trial 66 were reinforced Dis- 
crimination b trials, 30 Ts and 30 Ta trials. 
Trials 67-98 were Discrimination C trials, 
16 Tı and 16 Ts trials. f 

On each trial of the experiment, the stim- 
ulus appeared for 2 sec., followed immediately 
by the reinforcing event (or a blank white 
background) for 2 sec., and there was a 6-sec. 
intertrial interval. The complete experi- 
mental session lasted about 45 min. 

The probability of reinforcing event E; 
following stimulus Ti will be designated Tij 
The Ss were divided into two main experi- 
mental groups. On Discrimination a, for 
Group I, mu was equal to .90 and 72 was 
equal to .10, and for Group II, m1 was equal 
to 1.00 and ra was equal to .50. Except for 
the different values in Discrimination a, 
both groups were treated identically; for 
both, the Discrimination b values were 73 
equal to 1.00 and ra equal to 00. he fees} 

The sequences of trials on Discrimination a 


595 


were randomized with the restriction that 
within each successive block of 20 trials, each 
combination of stimulus and reinforcing event 
was presented a number of times exactly equal 
to its expected number, considering the rein- 
forcement probabilities. The sequences of 
trials on Discrimination b were randomized 
with the same restriction within each succes- 
sive block of 10 trials. On Discrimination c, 
the randomization was restricted in the same 
manner over the total set of 32 trials. On all 
problems, a different randomization was used 
for each S. The design was counterbalanced 
by having eight subgroups, with different 
identifications of the stimuli and reinforcing 
events, within each main experimental group, 
making a total of 16 subgroups, 6 Ss in each. 


RESULTS 


Discriminations a and b.—The pro- 
portion of A; responses on T; trials, 
within a given block of trials, will be 
designated P(A;|T)). The changes in 
P(Ay|Tx) and P(A,|T2) for both 
groups, over 20-trial blocks on Dis- 
crimination a and 10-trial blocks on 
Discrimination b, are illustrated in 
Fig. 1 and 2, According to the Burke 
and Estes discrimination model, the 
final mean probabilities of response Ai 
given stimulus Tı and response Ai 
given stimulus T: should both be be- 
tween .10 and .90 for Group |, and 
above .50 for Group II. However, 
the final P(A,|T2) for Group Il, .44, 
is significantly below 50 (t = 2.08, 
P <.05). In Group l, both final 


-2-3 


od 


Peat i 


o— GROUP I 
o-----0 GROUP O 


RESPONSES 


PROPORTION OF A, 
o 


PALT) 


i 2 3 4 5 6 
BLOCKS OF 20 TRIALS 


Fic. 1. Mean P(A |T:) and P(A;|T2) over 
20-trial blocks for Groups I and Il. 


596 


o 


P(A, |Ts) ©——* GROUP I 
o----0 GROUP IT 


o 
x 


o 


o 


PROPORTION OF A, RESPONSES 


2 


4 
BLOCKS OF IO TRIALS 


Fic. 2. Mean P(A;3|T;) and P(A;|Ts) over 
10-trial blocks for Groups I and II. 


Proportions are outside the theo- 
tetical limits—P(A,|T,) = -91, 
P(Aj|T:) = .06. Since P(A,|T;) and 
P(A;|T2), ie, 1— P(Ai|T»), are 
measures obtained under identical 
experimental conditions in Group 1, 
they were combined in order to get an 
overall test of the deviation from 
theoretical bounds in that group. As 
the distribution of scores is highly 
skewed, there is no Teally adequate 
test for the statistical significance of 
the deviation from .90. The obtained 
deviation, .03, is 1.82 times its 
standard error. Furthermore, a £ test 
of the difference between the propor- 
tions on Blocks 6 and 7 indicates a 
significant increase (t = 3.21, P <.01), 
suggesting that continued trials might 
have led to a larger discrepancy. 


TABLE 1 


PROPORTIONS OF As RESPONSES OVER Four- 


TRIAL BLOCKS ON DISCRIMINATION ¢ 


JULIET POPPER SHAFFER 


Test trials. —Two T; test trials and | 


two T> test trials were given in order 
to investigate the dependence among 
successive unreinforced responses, A 
preliminary study had shown that a 


series of unreinforced test trials did 


not give independent estimates of 


response probability, since most ss 


adopted a consistent pattern, always — 


making one response to one stimulus 
and the other response to the other 
stimulus. Chi square tests were used 
to investigate response dependence on 
these trials.? The responses on the 


first T, test trial and the first Ta test 


trial did not deviate significantly from 
independence, while responses on the 
second test trial with each stimulus 
were significantly dependent in the 
direction suggested by the preliminary 
study, Therefore, only the first Tı 
test trial and the first Ts test trial for 
each S were used in testing the 
predictions derived from the model. 
The observed proportions on 
these test trials were: For Group |, 
P(Ai|T:) = .73 and P(A, |T) = .27; 
for Group II, P(A,|T;) = .58 and 
P(A;|T:) = 44. The difference be- 
tween the two proportions was sig- 
nificant for Group I (x? = 12.97, 
P <.001), but not for Group II 
Gè = 1.16, P > .10). 
Discrimination ¢.—The 16 Tı trials 
and the 16 Ts trials on Discrimination 
© were each divided into four 4-trial 
blocks, and the proportion of As 
responses in each block was computed 
for Groups I and II. The results are 
given in Table 1. It had been ex- 
pected that, as training progressed on 
Discrimination ¢, P(A;|T;) and 


*The model does not imply strict in- 
dependence of responses, but the expect 
degree of dependence cannot easily 
determined, and in any case would be % 
small that the hypotheses of strict independ- 
ence provide a very close approximation to the 


predictions which could be derived, 


DISCRIMINATION AND MEDIATED GENERALIZATION 


P(A;|T2) would change towards .50. 
To investigate changes, the difference 
between the proportion of Aj re- 
sponses on the first 8 trials and the 
last 8 trials was computed for each 
type of trial and each group. The 
significance of the differences was 
evaluated with £ tests, and only the 
difference for Group I on T; trials was 
significant (tf = 2.94, P < .01). Since 
so little change occurred over the 32 
trials on Discrimination c, the propor- 
tions over all trials were used in 
further analyses. 

An analysis of variance was carried 
out, using the proportions of A; 
responses over the trials of Dis- 
crimination c, to determine the effects 
of group, subgroup, type of trial 
(T, or T2), and the interactions among 
these. No effects even approached 
significance except type of trial 
(F = 14.76, P < .001). Individual ¢ 
tests indicated that the difference be- 
tween P(A3|T:) and P(As|T2) was 
significant beyond the .02 level for 
each of the two experimental groups. 


Discussion 


In specifically applying the mediation 
model to the experiment reported here, 
the assumed situation on the test trials 
will be discussed first. On a test trial, a 
stimulus from Discrimination a was pre- 
sented, but S was required to make one 
of the two responses learned in Dis- 
crimination b. It is assumed that, on the 
presentation of Tı or Ts, S responded 
implicitly with Ai or Ag, the responses 
conditioned to these stimuli in Discrimi- 
nation a. The probability of each was 
assumed to be equal to the probability of 
the same overt response, given that 
stimulus, at the end of training on 
Discrimination a. It is assumed that the 
implicit response Ar produced stimulus 
elements which were a subset of the 
elements associated with the presence of 
Ts in Discrimination b, and that the 
same relationship held for As and Ts. 


597 


The probability with which these ele- 
ments were conditioned to As or Ay was 
determined, therefore, by the training on 
Discrimination b. The predicted re- 
sponse probabilities on the test trials are 
therefore a function of the training on 
both Discriminations a and b. 

In Discrimination c, it is assumed that 
the initial probabilities of responses As 
and A, were equal to their probabilities 
on the test trials, and that the probabili- 
ties would have gradually approached 
.50 as training progressed. No predic- 
tions about the rate of change can be 
derived from the model in its present 
form. 

For this experiment, the model implies 
that P(As|T1) should be greater than 
P(A;|T2) for both groups. This is a 
result of the reinforcement probabilities 
on the discrimination problems. In the 
presence of Ti, an implicit A; response 
should be more probable than an implicit 
A; response in both groups, because of 
the Discrimination a training. There- 
fore, with high probability, a subset of 
the elements associated with stimulus Ts 
should become available for sampling on 
those trials, and those elements have a 
very high probability of being condi- 
tioned to Response As due to the Dis- 
crimination b training. As a conse- 
quence, As should be the more frequent 
response on the Tı test trial, and Dis- 
crimination c trials. On the other hand, 
using the same reasoning, A; should be 
the less frequent response on the T: test 
trial, and Discrimination c trials. 

This prediction was confirmed on both 
the test trials and Discrimination c trials, 
with only the difference on the test trials 
for Group II failing to reach statistical 
significance. This result indicates that a 
mediation process was occurring during 
the trials, since the experiment was 
designed to insure against any possibility 
that physical similarity could account 
for the generalization of responses from 
the stimuli of Discrimination b to those 
on the test trials and Discrimination c. 

The model implies further that there 
should have been a preponderance of As 
responses on the test trials and Dis- 
crimination c trials for Group Il. This is 


< 


598 


because the average probability of an 
implicit A; response across both types of 
test trials and Discrimination c trials 
should have been greater than the 
probability of an implicit As response, 
due to the asymmetrical reinforcement 
probabilities in Discrimination a. There- 
fore, the implicit response should have 
produced a subset of the elements of 
stimulus T; more than half the time, 
making the response A; more likely than 
As. Specifically, the average of P(A;|T;) 
and P(A;|T:) should be greater than .50 
for Group II. The obtained averages, 
-51 on the test trials and .52 on the 
Discrimination c trials, are very close to 
-50, and their deviations from it do not 
approach statistical significance. 

This failure of the model is reflected in 
deviations from the quantitative pre- 
dictions. On the test trials, both pro- 
portions for Group I, and P(A;|T:) for 
Group II, were consistent with the 
specific predictions, while P(A;|T;) for 
Group Il was substantially below the 
predicted value. For derivations and 
tests relating to the quantitative predic- 
tions, see Popper (1959), 

The final proportion of A, responses on 
Tə trials for Group II was significantly 
below the minimum value predictable 
from the Burke and Estes discrimination 
model. That model implies furthermore 
that the asymptotic mean proportion of 
A, responses for Group II over both Tı 
and T; trials should have been 75. The 
obtained proportion on the last block, 
-70, was significantly below the pre- 
dicted proportion (t = 3.56, P< 001). 
In an experiment performed by Estes 
and Burke (1955) testing the model, 
they used the same values as those for 
Group II of the present experiment, 
Their observed mean Proportion of A, 
responses over the last block of trials was 
approximately .71 (as estimated from 
the published curves), Thus, although 
P(A,|T1) and P(Ay|T:) were consider- 
ably different in the Estes and Burke 
experiment as compared with the present 
experiment, their mean value was almost 
the same in the two experiments, and was 
below the predicted value. 

The observed results on Discrimina- 


JULIET POPPER SHAFFER 


tion a trials deviated from the predictions 
based on the Burke and Estes model, 
then, in a specific way: the theoretically 
more frequent response did not occur as 
frequently as predicted. The same 
description would apply to the deviation 
of the observed results on the test trials 
and Discrimination c from the predic- 
tions derived from the mediation model 
proposed here. While no explanation 
for the discrepancy is suggested by the 
results, the fact that both models err in 
the same way suggests that some as- 
sumptions common to them are inade- 
quate in this experimental context. Since 
all of the assumptions of the Burke and 
Estes discrimination learning model are 
incorporated into the mediation model, 
modification of the more general assump- 
tions of the discrimination model seems 
to hold the greatest promise for achieving 
a more adequate quantitative formula- 
tion of the process of mediated gen- 
eralization. 


SUMMARY 


This experiment was designed to test a 
quantitative model, based on statistical learn- 
ing theory, for mediated generali: tion. The 
Ss were given training on two discrimination 
problems (a and b). These problems con- 
sisted of a series of trials, each trial beginning 
with the appearance of one of two stimuli, 
with Ss required to guess on each trial which 
one of two possible outcomes would follow 
the presented stimulus. Each outcome had a 
Prearranged probability of following each of 
the stimuli. Discriminations a and b were 
related in that the possible outcomes on 
Discrimination a were the stimuli with which 
trials began on Discrimination b. The Js 
were then given Discrimination c, and their 
performance on it was predicted on the as- 
sumption that it would be affected in @ 
specified manner by mediating responses 
resulting from the training on I Jiseriminations 
a and b. Specifically, the trials of Dis 
crimination c began with presentation of one 
of the stimuli from Discrimination a, with $$ 
required to guess which of the two outcomes 
used in Discrimination b would follow. The 
probabilities of their initial guesses in this 
case were predicted on the assumption that 
they would first respond covertly on the basis 
of the outcomes of Discrimination a, and = 
their covert response would produce interna 
Stimulation similar to the corresponding stim- 


DISCRIMINATION AND MEDIATED GENERALIZATION 


ulus on Discrimination b. Stimuli from the 
covert responses would therefore mediate 
generalization of the responses learned in 
Discrimination b to Discrimination c. 

The results indicated a significant effect of 
the pretraining on the final problem, along the 
lines predicted from the model. The precise 
quantitative predictions were only partially 
confirmed. The discrepancies between ob- 
served and predicted results were compared 
with discrepancies of a similar nature between 
observed data on discrimination problems 
and predictions based on a statistical model 
for discrimination learning. 


REFERENCES 


Burke, C. J., & Estes, W. K. A component 
model for stimulus variables in discrimina- 


599 


tion learning. Psychometrika, 1957, 22, 
133-145. 

Corer, C. N., & Fouey, J. P., JR. Mediated 
generalization and the interpretation of 
verbal behavior: I. Prolegomena. Psychol. 
Rev., 1942, 49, 513-540. 

Estes, W. K., & Burke, C. J. Application 
of a statistical model to simple discrimina- 
tion learning in human subjects. J. exp. 
Psychol., 1955, 50, 81-88. 

Oscoop, C. E. Method and theory in experi- 
mental psychology. New York: Oxford 
Univer. Press, 1953. 

Popper, J. Mediated generalization. In 
R. R. Bush and W. K. Estes (Eds.), 
Studies in mathematical learning theory. 
Stanford Univer. Press, 1959. Pp. 94-108. 


(Received December 6, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 600-607 


SEMANTIC SATIATION AND PAIRED-ASSOCIATE 
LEARNING! 


R. N. KANUNGO, W. E. LAMBERT, anv S. M. MAUER 
McGill University 


The phenomenon of satiation has 
been described by Smith and Raygor 
(1956) as “the reduction in the 
effectiveness of a stimulus with con- 
tinued exposure.” Two different 
methods, have been used to produce 
the satiation effect on verbal stimuli. 
One involves the overt verbal repeti- 
tion of the stimulus while the other 
relies on prolonged visual exposure to 
the stimulus. The verbal satiation 
effect has also been observed in 
various ways. For instance, Basette 
and Warne (1919) reported lapses of 
the meaning of words following their 
verbal repetition, and, more recently, 
Lambert and Jakobovits (1960) re- 
ported measurable decrements in the 
intensity of semantic ratings of con- 
tinuously repeated words. Using the 
prolonged visual exposure method, 
Smith and Raygor (1956) demon- 
strated that a word loses its familiarity 
in the sense that associational re- 
sponses to a stimulus word become 
uncommon. 

The present studies explored the 
role of the satiation process in paired- 
associate learning. The main ques- 
tion considered was whether the 
reduction of the meaning of words has 
a detrimental effect on subsequent 
acquisition tasks involving those very 
words (Exp. 1). In view of the role 
that meaning plays in the response 


1 This research was supported in part by 
the Canadian Defense Research Board, 
Grant 9401-10, and in part by a subvention 
to W. E. Lambert from the Carnegie Cor- 
poration of New York. 

2 Now at Ravenshaw College, 


Cuttack, 
India. 


position of the paired-associate tasks 
(Cieutat, Stockwell, & Noble, 1958), 
it was decided to administer the 
satiation treatment to response ele- 
ments of S-R pairs. A second experi- 
ment (Exp. II) was performed to 
study the role of interpolated semantic 
satiation on the recall of responses of 
the paired associates. 


EXPERIMENT I 
Method 


Subjects. —The Ss were 30 undergraduate 
students. None had previously participated 
in a similar experiment. 

Material and apparatus.—Using nonsense 
syllables and words as stimulus and response 
members, respectively, two lists of paired 
associates, each containing eight pairs, were 
prepared. Nonsense syllables were chosen 
from Hull's list of less than 20% association 
value (Hilgard, 1958), and the response 
words were chosen on the basis of their high 
frequency of usage (Thorndike & Lorge, 
1944) and their high connotative meaning 
(Jenkins, Russell, & Suci, 1958). Each list 
was printed on a strip of paper in five different 
random orders in a manner suited to the 
standard anticipation procedure with a 
memory drum. The stimulus term alone was 
presented for 3 sec, and immediately following 
it the stimulus-response pair was presented 
for 3 sec. Then followed the next stimulus 
exposed for 3 sec, and so on. The intertrial 
interval was 6 sec. 

Another eight words were chosen as con- 
trols on the same basis as described above for 
response words, except that each of them was 
made equal in length toa response word of the 
second list. These words were used as con- 
trols in the sense that they were not to enter 
into the learning task after they had been 
given satiation treatment. Care was taken 
that the. control words were neither struc- 
turally nor semantically related to the re 
sponse words of the paired associates whi 
were to be learned. 

Three semantic differential scales (Good 


600 


SEMANTIC SATIATION 


Bad, Active-Passive, Strong-Weak) represent- 
ing the three major factors of connotative 
meaning (Osgood, Suci, & Tannenbaum, 
1957) were used for measuring the intensity 
of semantic ratings of words. Each paired- 
associate response word and control word was 
printed on a separate 3 X 5 in. index card. 
Each semantic scale was also printed on a 
separate card. All cards were placed in a 
Kardex folder so that E could expose them in 
a predetermined random order, one at a time, 
first a word, and then a semantic scale along 
which S gave his ratings of the immediately 
preceding word. 

Procedure.—All 30 Ss were tested in- 
dividually. Initially, S was presented the 
first paired-associate list (List I) with 
standard instructions for the anticipation 
procedure involving the use of a memory 
drum. Before the actual presentation of the 
list, S was made familiar with the anticipation 
procedure by a single presentation of two 
practice pairs. 

Three consecutive successful anticipations 
were considered as the learning criterion. On 
the basis of their learning scores, Groups C 
(control) and E (experimental), equated for 
both trials and errors, were formed for the 
main stage of the experiment. There were 
15 Ss in each group. 

The main part occurred approximately 1 
wk. after each 5’s initial testing. For each S 
of Group E, the normal semantic profile was 
obtained for each of the eight response words 
of the second paired-associate list (List II). 
The procedure was the same as that used by 
Lambert and Jakobovits (1960). Briefly, 
each word was exposed for 1 sec. and then S 
was asked to indicate the appropriate 
semantic placement by pointing to one of the 
seven positions on the semantic scale. Then, 
for the satiation treatment, each of the re- 
sponse words was again exposed for 1 sec., 
and S was asked to repeat the word aloud 
continuously for 15 sec., at a rate of 3-4 
repetitions per sec. Immediately after the 
repetition, E exposed a semantic scale an: 
made his rating for the word. This procedure 
was repeated three times for each of the eight 
words, one time for each semantic scale. The 
words and the scales were presented in an 
order which maximized the separation of re- 
occurrence of a word and a scale. For Group 
C, however, the eight control words were 
used instead of the List IT response words. 
From each S of Group C, first, the normal 
semantic profile was obtained for each of the 
control words, and then satiation treatment 
was administered to these words. Thus the 
Ss of Group C were given exactly the same 


601 


type of treatment as given to Group E, except 
that the eight words which were rated and 
satiated were not those to appear as response 
words in the paired-associate list. 

Immediately after the satiation treatment, 
each S of Groups E and C was presented the 
second paired-associate list on the memory 
drum with exactly the same instructions as 
given for learning List I. The same procedure 
and learning criterion as described for the 
initial stage were used again. 


Results 


Both the trial and the error meas- 
ures for learning of List I make it 
clear that Groups C and E were in fact 
equated for the main stage of the 
experiment. The mean number of 
trials to reach criterion for Group Cc 
was 10.20 (SD = 3.17), and for Group 
E was 10.07 (SD = 2.46). Likewise, 
the mean error scores for Groups C 
and E were 20.00 (SD = 12.58) and 
20.07 (SD = 9.86), respectively. 

An examination of Table 1 indicates 
that for Group C, the satiation treat- 
ment of the control words led to a 
significant decrement in their rated 
meaning. For Group E however, the 
meaning decrement does not quite 
reach significance (05 < P < .10). 
A t test applied to the mean satiation 
scores of both groups revealed no 
reliable differential effect of the satia- 
tion treatment on the two groups 
(t = .55). Since Groups C and E do 
not differ significantly with respect to 
their satiation scores, the data from 
both the groups were combined to see 
if the overall effect of satiation treat- 
ment is to reduce the meaning in- 
tensity of the words. The combined 
mean semantic rating scores presented 
in Table 1 show that the meaning 
decrement is significant (P < .01). 

The effect of satiation of response 
words on the acquisition of the second 
paired-associate list is shown in Table 
2, Group C, given satiation treat- 
ment for control words immediately 
before learning, was significantly su- 


602 


R. N. KANUNGO, W. E. LAMBERT, AND S. M. MAUER 


TABLE 1 
EFFECT OF SATIATION TREATMENT ON THE SEMANTIC PLACEMENT OF WORDS 


Before Satiation After Satiation Change 
Group N 
Means SD Mean SD Mean SDpitt. t 
G 15 4.20 1.68 3.88 1.94 0.32 0.52 2:294 
E 15 4.68 1.77 4.47 1.70 0.21 0.41 1.94% a 
C+E 30 4.44 1.90 4.18 1.84 0.26 0.47 3.30 


* Entries are average polarity scores per word over the sum of three semantic scales, 


* 05 <P < 10. 


perior to Group E with respect to 
acquisition of the list. In terms of 
error scores the difference between the 
groups is significant beyond the 
-01 level, but in terms of trials to 
criterion, the difference is not reliable 
GOS i<P <10). 


Discussion 


Two general conclusions can be drawn 
from the results of the study. First, in 
support of the earlier findings of Lambert 
and Jakobovits (1960), the study shows 
that the overall effect of the satiation 
treatment of words is to reduce the 
intensity of their meaning. The reason 
for not obtaining a significant satiation 
effect in Group E, while Group C showed 
such an effect, is unclear, However, 
there is one possibility. It will be 
observed that in Group E, the initial 
ratings of the response words are higher 


TABLE 2 


EFFECT OF SATIATION TREATMENT OF 
RESPONSE WORDS ON THE LEARNING 
OF PAIRED ASSOCIATES 


Mean SD 


Mean | SD 
C 15 6.47 | 2.12 8.67 | 5.11 
E 15 8.20 | 2.45 | 14.47 5.35 
t 1.20* 2.949% 
#05 < P < 10, 
ORS Oi 


than the initial ratings of the control 
words in Group C (see Table 1). Such 
higher semantic ratings imply greater 
polarization of judgments on the part 
of Ss in Group E. According to Osgood, — 
Suci, and Tannenbaum (1957, pp. 155 ff.) 
polarization of judgments is an index of 
habit strength. Thus it would be ex- 
pected that in Group E the ‘“word- 
meaning” habit is stronger than the 
similar habit in Group C. Consequently, 
Group E would show stronger resistance 
than Group C to any semantic change as 
a result of satiation treatment. 

The second and most interesting find- 
ing is that satiation treatment applied to 
response words has a negative transfer 
effect on the later learning of a paired- 
associate list. Lambert and Jakobovits 
(1960) conceptualized the phenomenon 
of semantic satiation as “a cognitive 
form of reactive inhibition” and related 
it to Osgood’s theory of representational 
mediation processes. Their explanation 
could account for the superiority of 
Group C over Group E in paired- 
associate learning by assuming that 
reduction in the meaning of response 
members makes them more difficult to 
associate. However, the results can also 
be accounted for in terms of principles of 
associative learning. When a response 
member (R) is continuously repeated, 
the different associations elicited by the 
word (m components) may gradually 
extinguish whereas the R-R connection 
gets strengthened. This could be an 
instance where experimentally dev eloped 
frequency of stimulation (7#) may lead tO 


SEMANTIC SATIATION 


decrease in m. Decrease in meaning asa 
function of satiation treatment, there- 
fore, can be interpreted in terms of 
increasing S's tendency to connect the 
word with itself rather than to any of its 
common associates, 

Thus the effect of satiation of response 
words on subsequent acquisition can be 
interpreted in terms of transfer from one 
learning situation to another. For the 
experimental group, the meaning of the 
response words decreased possibly be- 
cause of the formation of an association 
of the response word with itself which 
would produce an impairment in the 
subsequent learning of the paired associ- 
ates. The situation is analogous to 
developing R-R connections for the ex- 
perimental group where all the m com- 
ponents (“hooks” or associations) of R 
extinguished, and similarly X-X con- 
nections for the control group where all 
the m components of R remain un- 
affected before S-R learning. Extinction 
of m components of R before learning for 
the experimental group would explain 
the superiority of the control group. The 
importance of m components in verbal 
learning is well recognized (Noble, 1952). 

More recently Cieutat (1960) in trying 
to clarify some of the conflicting data 
concerning the locus of familiarization 
and its effect on paired-associate learn- 
ing, noted that, “familiarity only with 
the response member inhibits learning” 
(p. 274). It should be observed that his 
method of familiarization involved con- 
tinued visual presentation for 60 sec. 
similar to the prolonged visual exposure 
method of satiation. To explain his 
results he argues “that the monotony of 
continued visual presentation evokes an 
inhibiting influence” (p. 274). 

Another possible interpretation of the 
present results makes use of response 
similarity. The prelearning satiation 
treatment given to the response words 
reduced their meaning, possibly making 
them more alike semantically. If so, one 
would expect to find more intralist 
response competition for Group E than 
for Group C. An examination of errors 
revealed that 61% of all errors for 
Group E are intralist intrusions in com- 


603 


parison with 67% for Group C, a com- 
parison which rules out this inter- 
pretation. 


EXPERIMENT I] 


We were interested in extending 
this line of reasoning to another 
aspect of verbal learning. The pres- 
ent study compared the effects of the 
satiation treatment on stimulus and 
response members of paired associates 
when the treatment was presented 
after the associates had been learned. 
In this case both stimulus and re- 
sponse members were meaningful 
words. Use was made of a simple 
retroactive inhibition design. During 
the original learning phase, the S-R 
connections were established, while 
during the interpolated phase either 
stimulus (for one group) or response 
elements (for a second group) were 
given the satiation treatment, and 
finally recall of response elements was 
tested when stimuli were presented. 


Method 


Subjects—The Ss were 52 university 
students. None had previously participated 
in an experiment of this type. 

Materials and apparatus.—Several quite 
different methodological procedures were em- 
ployed in Exp. II. Using meaningful words 
as stimulus and response members, a list of 12 
paired associates was prepared. The words 
were chosen on the basis of their high fre- 
quency of usage in print (Thorndike & Lorge, 
1944) and their high connotative meaning 
(Jenkins et al., 1958). Each of the 12 pairs 
was judged (by 12 students acting as judges) 
to have little or no immediate association be- 
tween its stimulus and response members. 

Each paired associate was printed on a 
separate 3 X 5 in. card. Further, each stim- 
ulus and response member was printed on a 
separate card. These cards were placed in a 
Kardex folder so that E could expose them in 
a predetermined random order. Each stim- 
ulus word was placed immediately before the 
paired associate to which it corresponded so 
that E could expose the stimulus-response 
pair after the exposure of the stimulus word in 
a reliably constant manner with a minimum 
of delay. 


604 


Three semantic scales were used for 
semantic ratings. These were: Good-Bad, 
Active-Passive, Strong-Weak. 

Procedure—The study used two test con- 
ditions, a “Stimulus condition” and a “Re- 
sponse condition.” Each test condition was 
in the form of a retroactive inhibition para- 
digm and was divided into three phases. 

Learning phase.—This phase was identical 
for both test conditions. Each S was given 
four trials, a complete trial consisting of the 
exposure, in a predetermined, random order, 
of each stimulus member of the paired associ- 
ates followed by the stimulus-response pair. 
Each stimulus member and each pair was 
exposed for 3 sec. and a 10-sec. delay was 
given between trials. 

After four learning trials Ss were assigned 
to either the Stimulus or Response condition 
depending on their learning efficiency, equat- 
ing the two groups on _paired-associate 
learning ability. 

Stimulus condition.—First, S's normal 
semantic profiles for all 12 stimulus words 
were obtained. Each word was presented 
three times (for 1 sec, each time) for measure- 
ment on the three semantic scales. The words 
and scales were also presented in a pre- 
determined randomized order. 

j [Each of the 12 stimulus words was placed 
in one of two categories, Satiation Category 
(SC) or Nonsatiation Category (NSC). An 
attempt was made to group one half of the 
stimulus members of paired associates which 
had been learned by the fourth learning trial 
in SC, and the other half in NSC. Cases 
where odd numbers of associations had been 
learned were balanced through the total 
group. Further, one half of the stimulus 
members of paired associates which had not 


TABLE 3 


AVERAGE CHANGE IN POLARITY OF PAIRED-ASSOCIATE MEMBERS 
OVER THE SuM OF THREE SCALES: EXP, II 


R. N. KANUNGO, W. E. LAMBERT, AND S. M. MAUER 


grouped in SC, the other half in NSC, 
Each word in SC was exposed for 1 sec, 
and Ss were asked to repeat the word aloud 
for 15 sec. at a rate of 2-3 repetitions per sec, 
Immediately after the continual repetition, 
Ss rated the word on one of the three semantic 
scales. Each word in NSC was exposed for 1 
sec. and Ss rated it immediately after ex 
posure. After the list had been subjected to 
this treatment once (each word in SC re 


ceiving satiation treatment and being meas 
ured on one scale, and each word in NSC 


been learned by the fourth learning trial were | 


merely measured on one scale) all words were 
then rated in the usual way on the remaining 
scales. That is, each stimulus word was ex- 
posed for 1 sec. and then rated immediately 
on one of the two remaining scales. Note that 
the satiation treatment was only given once, 
before one of the semantic ratings, not before 
each rating as was the case in Exp. I. Initial 
and final semantic ratings were subsequently 
compared. 

Response condition.—The procedure for 
this condition was identical to that for the” 
Stimulus condition except that the response 
rather than the stimulus members were 
grouped into SC or NSC categories and then 
given the satiation treatment. 

It can be seen from this procedure that 
words in SC and words in NSC were expose 
an equal number of times to Ss. Furthermort 
due to the equal division of the words belong: 
ing to correctly learned paired associates into 
SC and NSC in each test condition, a basf 
was established for comparing the effects of 
satiation and nonsatiation treatments on the 
recall of learned paired associates. Likewisër 
due to the division of the study into two tes 
conditions, a basis was created for comparing 


$F 
Ly First Rating Second Rating Change 
: Mean SD Mean | SD |_ _ 
Stimulus 
Satiated 4.18 1.25 2,95°** 
satiated . oi, 59 99 9 
Nonsatiated 5.09 1.16 20 62 1.66 
gorse. xe ee | 
Satiated 4.46 1 3,40°°* 
Nonsatiated 4.66 138 ro a |} 50 


Note, —Twenty-six 
oP < St. 


Ss took part in each of the test conditions (Stimulus and Response) 


| ae — 


SEMANTIC SATIATION 605 
TABLE 4 
EFFECT or SATIATION TREATMENT OF PAIRED-AssocIATE MEMBERS 
ON THE RECALL OF PAIRED ASSOCIATES 
Mean Scores Drop in Recall SC vs. NSC Words 
Condition 
On Trial 4 | tnte ointion|, Mean SD | Meanpy) | SDour. t 

Stimulus 

SC Words 3.00 1.73 1.27 -86 

NSC Words | 2.85 2:27 0.58 | ‘84 4 PUN SOR 
Response 

SC Words 2.65 1.96 0.69 .82 è 

NSC Words | 2.65 ist | 081 | 68 32) Heata, R: 


Note.—Twenty-six Ss took part in each of the conditions, 


rE ay DOR 


the effect of satiation treatment given to 
stimulus and response words on their recall. 

Recall stage—This stage of the study was 
identical for both test conditions. The Ss 
were shown each stimulus word for 3 sec. and 
asked to recall the response word paired 
with it. 


Results 


Table 3 presents the mean change 
in polarity scores for stimulus and 
response words, respectively. It can 
be seen that in both cases the re- 
duction in intensity of meaning as 
measured by the semantic differential 
is significant for words given satiation 
treatment (P < .01 for both stimulus 
words and response words). On the 
other hand, words not given satiation 
treatment showed no significant se- 
mantic change. 

Table 4 presents the mean number 
of paired associates learned by the 
fourth trial. In the Stimulus condi- 
tion, an attempt was made to ad- 
minister interpolated satiation treat- 
ment to half of the stimulus members 
of these learned paired associates 
and not to the other half. A similar 
attempt was also made in the Re- 
sponse condition except that instead 
of stimulus members, the response 
members of the learned paired asso- 
ciates received the interpolated treat- 


ments. An examination of Table 4 
reveals that such an attempt was 
successful. The mean number of 
correct responses on the recall trial 
after the interpolation treatments, 
also presented in Table 4, reveals how 
much interference resulted from the 
satiation or no satiation treatments 
of the words learned by the fourth 
trial. In the Stimulus condition, a 
mean drop of 1.27 in recall of responses 
of the learned paired associates is 
noticed when the stimulus members of 
those paired associates are given 
satiation treatment. But the mean 
drop in the response recall of the 
learned paired associates of which the 
stimulus members were in NSC is .58. 
The difference between these means is 
highly significant (P < .001). 

It can be seen that the mean drop 
in recall scores for learned paired 
associates of which the response 
members were given satiation treat- 
ment was .69 and the mean drop in 
recall for learned paired associates 
whose response members were in NSC 
is .81. The difference between these 
two means, of course, was not sig- 
nificant. 

Some of the paired associates which 
were originally unavailable to Ss after 
four learning trials were available at 


606 


recall. Itis difficult to speculate as to 
whether these paired associates were 
at an “oscillation period” of avail- 
ability (Osgood, 1953, pp. 503-504) or 
whether they were learned during the 
fourth trial when the correct response 
to the stimulus was exposed, or 
whether they were somehow made 
available during the interpolated 
period. Whatever the source of 
learning, its pattern is consistent with 
the other results. Of the 30 paired 
associates unavailable after four trials 
in the stimulus condition which were 
subsequently available at recall (a 
total of 30 paired associates for the 
group) 19 were ones whose stimulus 
members were in NSC while only 
11 were ones whose stimulus members 
were in SC. Further, of the 25 paired 
associates unavailable after four trials 
in the Response condition which were 
available at recall, 16 were ones whose 
responses were in the NSC while 9 
were ones whose responses were in the 
SC. These observations clearly follow 
the trends established by the results 
presented in Table 4. 


Discussion 


The findings of Exp. II demonstrate 
that paired-associate connections can be 
retroactively disrupted if the connotative 
meanings of their stimulus members are 
satiated. However, associational bonds 
are not affected by satiating response 
members of already learned paired 
associates, These results could be ex- 
plained in terms of the associational 
interpretation of semantic satiation pre- 
sented earlier in connection with Exp. I. 
Here it is argued that continual repetition 
of a word (TABLE, TABLE, TABLE, etc.) 
would strengthen the tendency for the 
word TABLE to be made as a response to 
the stimulus word TABLE. Thus, if the 
interpolated satiation treatment involves 
formation of a positive reaction tendency 
or a word-word habit, then in the present 
experiment, the stimulus satiation condi- 


R. N. KANUNGO, W. E. LAMBERT, AND S. M. MAUER 


tion can be considered analogous to the 
response variation retroaction paradigm, 
and retroactive interference would be 
expected (Osgood, 1953, pp. 525 fi). 
On the other hand, the response satiation 
condition is analogous to the stimulus 
variation retroaction paradigm where 
retroactive facilitation is expected. The 
reason why retroactive facilitation could 
not be obtained in the response satiation 
condition must depend upon factors 
other than the formation of the word- 
word habit per se during the interpolated 
period. In view of the importance of 
meaning in the response positions of the 
paired-associate tasks, it seems logical 
to presume that the reduction in meaning 
of the response items during the inter- 
polated period might have counteracted” 
the facilitating effect of the word-word 
habit. This explanation, however, leads” 
to the theoretical expectation that the 
retroactive facilitation effect can be ob- 
tained after interpolated satiation treat- 
ment to the response items if one uses” 
nonsense verbal units as responses. T 
| 


findings of Exp. II when considered with 
the findings of Exp. I, make it clear that 
the effects of reduction of meaning 0 
response items on the formation (as in 
Exp. I) or on the maintenance (as ™ 
Exp. IT) of associational bonds are always 
detrimental. 


The role of verbal satiation in paired 
associate learning was investigated. 
groups of 15 Ss each were matched on the 
basis of their learning measures in an initi 
test using a paired-associate list. In the mail 
test both the groups learned a second paired 
associate list. But immediately before lear 
ing, Group E (experimental) was give 
satiation treatment of the response meme” 
while Group C (control) was given simi | 
treatment to words which were not response 
members. Results indicated that (0) ° 
satiation treatment of words caused a decrease 
in their connotative meaning as measu E 
semantic scales and (b) Group E was slower? 
learning than Group C. sol 

In Exp. II, using a retroactive inhibit 
paradigm, the effect of satiation treatment 
of stimulus words on the recall of alres 
learned paired associates was studied. 54 


l 
SUMMARY 
| 
Í 


SEMANTIC SATIATION 


tion treatment resulted in significantly more 
retroactive interference than did the non- 
satiation control treatment. The interpolated 
satiation of response words produced no 
significant effect on later recall. Satiation 
treatment given to both the stimulus and the 
response words resulted in a significant 
reduction in the intensity of their meanings 
as measured by semantic differential scales. 

The results were discussed in terms of 
an associational interpretation of semantic 
satiation. 


REFERENCES 


Basetre, M. F., & Warne, C. J. On the 
lapse of verbal meaning with repetition. 
Amer. J. Psychol., 1919, 30, 415-418. 

Cieutat, V. J. Differential familiarity with 
stimulus and response in paired-associate 
learning. Percept. mot. Skills, 1960, 11, 
269-275. 

CIEUTAT, V. J., STOCKWELL, F. E., & NOBLE, 
C. E. The interaction of ability and 
amount of practice with stimulus and re- 
sponse meaningfulness (m, m’) in paired- 
associate learning. J. exp. Psychol., 1958, 
56, 193-202. 


607 


HiLGard, E. R. Methods and procedures in 
the study of learning. In S. S. Stevens 
(Ed.), Handbook of experimental psychology. 
New York: Wiley, 1958. Pp. 517-567. 

Jenkins, J. J., Russert, W. A., & Suci, G. J. 
An atlas of semantic profiles for 360 words. 
Amer. J. Psychol., 1958, 71, 688-699. 

LAMBERT, W. E., & Jaxosovits, L. A. 
Verbal satiation and changes in the in- 
tensity of meaning. J. exp. Psychol., 1960, 
60, 376-383. 

Noste, C. E. An analysis of meaning. 
Psychol. Rev., 1952, 59, 421—430. 

Oscoop, C. E. Method and theory in experi- 
mental psychology. New York: Oxford 
Univer. Press, 1953. 

Oscoon, C. E., Suci, G. J., & TANNENBAUM, 
P. H. The measurement of meaning. 
Urbana: Univer. Illinois Press, 1957. 

Smrtn, D. E. P., & RAYGOR, A. L. Verbal 
satiation and personality. J. abnorm. soc. 
Psychol., 1956, 52, 323-326. 

THORNDIKE, E. L., & LORGE, l. The teacher's 
wordbook of 30,000 words. New York: 
Teachers College, Columbia University, 
1944. 


(Received December 11, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 608-614 


EFFECTS OF VISUAL AND VERBAL CUES } 
ON LEARNING A MOTOR SKILL! 


LAWRENCE KARLIN anb RUDOLF G. MORTIMER ? | 


New York University . 


In the training of motor skills 
additional cues may be supplied that 
will not be present in the operational 
situation. Improvement during train- 
ing produced by such cues has fre- 
quently been found not to persist in 
subsequent tests in which these cues 
were not present. On the basis of such 
results, Miller (1953) has distin- 
guished between cues that tell S$ what 
to do next, which he labels ‘action 
feedback,” and cues that tell S what 
he should have done, which he labels 
“learning feedback.” The same cue 
may function in varying degree both 
as action and as learning feedback. 
According to Miller, cues which 
function primarily as action feedback 
do not produce improvement in per- 
formance in tests from which they 
have been removed , and they may 
even produce a decrement relative to 
control conditions. 

A study by Lincoln (1954) on the 
effects of different cues on learning to 
turn a crank at a specified rate is 
relevant to the above distinction. 
One group was given verbal informa- 
tion on the amount and direction of 
the average rate error after each 
training trial. A second group was 
given this information plus a con- 
tinuous visual cue during each train- 
ing trial which indicated instantane- 
ous rate error. Both groups yielded 


‘This research was part of a program 
carried out under contract with the United 
States Naval Training Device Center, Port 
Washington, New York and described in 
Technical Report: NAVTRADEVCEN 
558-2. 

? Now at Purdue University. 


similar learning curves but in criterion 
(retention) tests in which only the 
intrinsic kinesthetic cues remained, 
the verbal group did significantly 
better than the verbal-visual group. 
These results may mean that the 
visual cue functioning as action feed- 
back was a useful guide to perform- 
ance but did not promote learning to 
use the intrinsic kinesthetic cues. The 
verbal cue when used alone may have 
functioned as learning feedback but 
in combination, the visual cue func- 
tioning as action feedback was $0 
much more available that it minr 
mized the use of the verbal cue. 
Continuing along these lines a study 
by Karlin (1960) investigated the 
effects of visual, auditory, kinesthetic, 
and verbal error cues, both singly and 
in a number of combinations, on per 
formance of a task similar to that u 
by Lincoln (1954). It was found that 
a combined visual and verbal cue was 
consistently but not significantly st” 
perior to a verbal cue in learning, andi 
equally good in retention. R 
These results did not agree with 
those obtained by Lincoln, and sugi 
gested that certain differences between 
the experimental conditions might be 
important. Thus while the verbal cue 
used in both studies was the same, Uf 
visual cue was continuous in Lincolns 
study and discrete in Karlin's study: 
It is possible that the failure to fine 
similar results in retention was due t0 
the fact that the discrete visual CC” 
was less informative than Lincoll™ 
continuous visual cue. One of the 
objectives of the present investigat! 
(Karlin & Mortimer, 1961) was i 


608 


MOTOR SKILL 


check this possibility by using a con- 
tinuous visual cue both alone and in 
combination with a verbal cue. 

In order to gain further knowledge 
concerning the mode of action and 
differential effectiveness of the visual 
and verbal cues, a scoring system was 
used by which performance could be 
evaluated in terms of both constant 
and variable errors. It was felt that 
this technique would prove valuable 
in determining the underlying effects 
of feedback. Lincoln also had scored 
performance for both constant and 
variable errors but he did not obtain 
significant results for the variable 
errors. Variable errors were measured 
by the number of times S passed in 
and out of the tolerance range. This 
method of measuring variable error is 
contaminated with constant error 
factors which Lincoln may have dis- 
regarded because he felt that the 
variable error would be practically 
important when measured by its effect 
on a total accuracy score only when 
the constant error was relatively 
small. In the present study the ap- 
paratus was specifically designed to 
yield constant and variable error 
scores which were independent of each 
other. 


METHOD 


Subjects —The Ss were 45 paid, volunteer, 
right-handed male college students. 

Apparatus.—Except for the use of a con- 
tinuous visual display the apparatus was 
basically the same as that used by Karlin 
(1960), where a more detailed description 
may be found. Essentially, the apparatus 
consisted of a crank handle 1 in. in diameter 
and 5 in. long, masked from S's view, which 
turned on a mainshaft at a diameter of 7 in. 
Connected to the mainshaft was a Weston 
tachometer generator whose output was fed 
into the electronic scoring system and into 
the display meter. 

The scoring system consisted of 15 channels 
in which high speed counters cumulated the 
time that S was turning at a rate within the 
range that defined each channel. 


609 


The display consisted of a Triplett volt- 
meter carrying a translucent 4 X 2 in. scale, 
illuminated from the rear, and graduated into 
50 units with a center zero marking. The 
meter was mounted in a vertical panel 22 in. 
in front of S. The meter responded to the 
output of the tachometer generator (which 
had a linear response function) such that at 
99 rpm the meter needle would be at the 
center of the scale as indicated by the zero 
mark, 

Procedure.—The Ss were seated in front 
of the crank assembly and grasped the crank 
with the right hand. The task was to learn 
to turn the crank at 99 rpm. 

The Ss were randomly assigned to the 
visual, verbal-visual, and verbal cue condi- 
tions, 15 Ss per condition. Those receiving 
the visual cue were instructed in the use of the 
display meter. Those receiving the verbal 
cue were informed, at the end of a trial, of the 
amount and direction of the mean rate error, 
in rpm. The verbal-visual group was given 
both types of cue. During retention trials the 
feedback cues were removed. 

The Ss were tested on 2 consecutive days. 
On Day 1 Ss received 3 practice trials without 
feedback, 25 learning trials with feedback, 15 
immediate retention trials, and 10 relearning 
trials with feedback. On Day 2 they received 
15 delayed retention trials and 15 relearning 
trials with feedback. The first session lasted 
approximately 50 min. and the second session, 
which took place about 24 hr. later, lasted 
30 min. 

A buzzer was used to indicate the beginning 
of a trial and 3 sec. after S began to turn the 
crank scoring was begun. At the end of a 
further 15 sec. a Hunter timer broke the 
scoring circuit and the buzzer was sounded to 
inform S of the end of a trial. The intertrial 
interval was 30 sec. The interval between 
blocks of trials was about 2 min. All Ss wore 
headphones to muffle outside noise. Masking 
noise was provided by a fan. 


RESULTS 


Three measures of performance 
were obtained for each S on each trial 
as follows: (a) Total time (sec.) that 
S turned at a rate within the tolerance 
range of +13.5 rpm. (b) Constant 
error (rpm); i.e., the arithmetic mean 
of the rate errors, which gave the 
average amount by which S was 
turning too slow or too fast on each 


610 LAWRENCE KARLIN AND RUDOLF G. MORTIMER 


TIME WITHIN SCORING TOLERANCE (SEC.) 


s—s VISUAL 


| 
— VERBAL 
o—« VERBAL- VISUAL } 

as ea | rT h cg Hipp 

LEARNING IMM. RET'N REL'G I DEL, RET'N REL'G II 
TRIAL Ag 
Fic. 1. Mean time within tolerance range by trial and cue (NV = 15 each cue condition). 
TABLE 1 


Duncan RANGE TESTS OF MEAN DIFFERENCES BETWEEN CUES WITHIN BLOCKS 
OF TRIALS FOR DIFFERENT SCORING TECHNIQUES 


Learning Tarola Relearning I E Sele Relearning I! 
Comparison ———— | F 
D P D P D P D P D | bs 
L ae 
Time Within Scoring Tolerance in Seconds — 
Verb: Verb-Vis |—2.61} .01 1,87) ns |—2.83| .O1 3.17] .05 |-2.51| 08 | 
Verb: Vis |—1.73) 01 | 195| ns |—1.97] 01 5.77| .o1 |—1.94] 0t 
Verb-Vis: Vis 88) ns 08| ns 86| ns 2.60| ns 57) 9 
CE in rpm — 
Se eS 
Verb: Verb-Vis | 2.87! .01 |—5.68| o1 | 2.07| or | ~7.80| .os | 147 n 
Verb: Vis ’ 1.40} .05 |-5.61| 01 1.07} 05 |~11.47] .01 oi 
Verb-Vis: Vis |—1.47| ‘05 07| ns |—1.00| 05 | -367| ‘ns 87 
£ a SD in rpm ah cae S 
no Pa as a, j o 
Verb: Verb-Vis | 3.33] .01 | 3.86| o | 4.59) o1 3.00} .05 | 3.93] ; 
Verb: Vis 727) -01 | 180) ms | 326] 101 | 260| os | 300| 0 
Verb-Vis: Vis |—1.06| .01 |—2.06| ns |- 1.33] as -40| as —.93 


a 
a AL ee 


MOTOR SKILL 


trial and was the figure used for the 
verbal cue given at the end of each 
trial. (c) The SD of the rate (rpm); 
i.e., the deviation of each of the mid- 
points of the 15 rate intervals from S's 
mean turning rate during that trial 
was weighted by the time recorded 
for that interval, and the SD was then 
computed as the root mean square 
of these time-weighted deviations. 
Figure 1 shows the results obtained 
for the three groups when total time 
within the tolerance range was aver- 
aged over all Ss within a group for 
each trial. In the learning and re- 
learning trials the scores of the verbal 
group are consistently poorest. On 
the other hand, the verbal group did 
best in immediate and delayed reten- 


611 


tion. A simple variance analysis of 
the differences between conditions 
based on the last five trials in each 
block using a within-groups error 
term with 42 df, yielded Fs significant 
at the .01 level for all blocks except 
immediate retention, which yielded 
insignificant results (F = 1.99, 
P > .05). More detailed evaluation 
of these data using Duncan range 
tests (Edwards, 1960) are given in the 
first section of Table 1. This table 
shows that while the verbal-visual 
group was consistently superior to the 
visual group in all blocks of trials, 
none of these differences was sig- 
nificant. It is worth noting, however, 
that the largest difference was ob- 
tained for delayed retention in which 


e— VISUAL 
20 s— VERBAL 
o—< VERBAL-VISUAL 


CONSTANT ERROR (RPM) 


it FT) 
LEARNING 


Fie, 2, Absolute mean of constant errors by trial and cue (N = 


DEL. RET'N REL'G II 


TRIAL 
15 each cue condition). 


612 


LAWRENCE KARLIN AND RUDOLF G. MORTIMER 


STANDARD DEVIATION (RPM) 


LEARNING INM. RET'N 


S L Emr a aae] 


—- VISUAL 
»— VERBAL 
“—* VERBAL-VISUAL 


10 
REL'G | DEL, RET'N REL'G HI 


TRIAL 


Fic. 3. Mean rate variability (SD) by trial and cue (N = 15 each cue condition). 


the trends show a tendency to diverge. 

When total performance was anal- 
yzed into constant and variable error 
components and averaged over all Ss 
within each condition, the correspond- 
ing trends shown in Fig. 2 and 3, 
respectively, were obtained. T he 
constant error trends show consider- 
able similarity to the trends of Fig. 1. 
In addition, the Duncan range tests 
shown in the second section of Table 1 
yielded significant differences in im- 
mediate retention. The variance 
analyses for constant error are not 
shown, but they all yield F ratios 
which are ‘significant at better than 
the .01 level. On the other hand, the 
trends shown in Fig. 3 for variable 
error are strikingly different from 
those of Fig. 1 and 2. Now the verbal- 
visual and visual groups consistently 
yield lower variable errors than the 


verbal group. While the differences 
between the visual and verbal groups 
are not as large they all favor the 
visual group and, with the exception 
of immediate retention, the Duncan 
Tange tests are all significant as shona 
in the third section of Table 1. Wit 
the exception of delayed retenti 
which was nearly significant at the -0 
level (F = 3.15), a variance ans 
of each block yielded F ratios signifi- 
cant at the .01 level or better. 


Discussion 


When given in terms of time yi 
scoring tolerance, the learning sco! 
differences disagree with those oba 
by Lincoln (1954) and agree with Ta 
obtained by Karlin (1960). Possibly the 
disagreement is due to differences in 
characteristics of the display ar 
the present display differed apprecia 


MOTOR SKILL 


from those used in both of the above 
studies. 

In immediate retention these results 
agree with Lincoln's findings that the 
verbal cue was superior to the verbal- 
visual cue, although in the present study 
this difference was not significant. 

When performance is further analyzed 
into constant and variable error com- 
ponents, differences among the condi- 
tions are more pronounced. Considering 
the constant error first, the verbal cue is 
significantly superior to the verbal-visual 
cue in retention and significantly inferior 
to this cue in learning. A similar picture 
is obtained when the verbal cue is com- 
pared to the visual cue although the 
differences in the learning and relearning 
trials are not so great. Significant differ- 
ences are also obtained (with one excep- 
tion) when performance is analyzed in 
terms of variable error but this time the 
verbal cue is inferior to both verbal- 
visual and visual cues in immediate and 
delayed retention and in learning. 

From this analysis it is clear that when 
performance is measured by total score, 
the superiority of the verbal cue in 
retention is a result of its effect on 
constant rather than on variable errors. 
This conclusion is reasonable since the 
verbal cue provided information directly 
determined by the constant error for a 
given trial. On the other hand, the 
visual cue provided an immediate index 
of performance which did not distinguish 
between constant and variable errors 
since it did not average over time. How- 
ever, the results suggest something that 
was not apparent in those of earlier 
experiments, namely, that with “action” 
cues (see Miller, 1953) like the visual 
or verbal-visual cues something is learned 
that reduces rate variability which 
persists even after the cues are with- 
drawn. Apparently, the visual cue leads 
to a relatively stable improvement in 
smoothness and steadiness of perform- 
ance. On this point hote that the verbal- 
visual cue is superior both in learning and 
retention to the visual cue. These 
results suggest that the two cues may 
interact to produce greater steadiness 
during retention than the visual cue alone 


613 


but since the results are not statistically 
significant further work on this question 
is needed. 

It is worth noting that the verbal cue 
in Lincoln's as well as in the present 
study did not give variable error in- 
formation and one may speculate on how 
a verbal cue which gave an average of the 
variable errors at the end of a trial would 
affect performance. 

On the whole the results of the present 
study show that cues which might 
ordinarily be considered to function as 
action feedback, or as a “crutch” to 
guide performance, can make a contribu- 
tion to retention of a motor skill by way 
of reducing variable error, although this 
contribution can be obscured if only the 
total score is considered. On the other 
hand the results support Lincoln’s finding 
that the visual cue can be a “hind- 
rance” when combined with the verbal 
cue as far as the constant error compo- 
nent of the total score is concerned. 

Finally, it is important to note that 
the present results are based on a type 
of task which involves producing a single 
steady state. In this sense they may 
have a bearing on other types of task 
which involve a single production such 
as the line-drawing tasks of Thorndike 
(1932). Further evidence for this con- 
clusion is to be found in a recent study 
by Baker and Lavery (1960) who used a 
series of tasks requiring a single end 
product. 


SUMMARY 


The effects of visual, verbal, and combined 
verbal-visual cues on the learning and reten- 
tion of a crank-turning task were investigated. 
The task was to turn the crank at 99 rpm. 
The Ss were 45 right-handed males, 15 Ss in 
each condition. 

It was found that: (a) Overall superiority 
in retention tests of task performance meas- 
ured by time within tolerance was due mainly 
to reduction of constant errors. (b) The 
verbal cue was inferior in learning but superior 
in retention tests when performance was 
measured by time within tolerance and 
magnitude of constant error. (c) The verbal 
cue was inferior to the visual and combined 
verbal-visual cues during both learning and 


614 


retention trials when variable error was 
measured, 


REFERENCES 


Baker, C. H., & Lavery, J. J. Performance 
during training as a criterion of retention 
of motor skills. Report, 1960, Defense 
Research Board of Canada. 

Epwarps, A. L. Experimental design in 
psychological research. New York: Rhine- 
hart, 1960. 

Karun, L. Psychological study of motor 
skills: Phase I. USN Train. Dev. Cent. 
tech. Rep., 1960, No. 558-1. 


LAWRENCE KARLIN AND RUDOLF G. MORTIMER 


KARLIN, L., & Mortimer, R. G. Psycho- 
logical study of motor skills: Phase II. 
USN Train. Dev. Cent. tech. Rep., 1961, 
No. 558-2. 

LINCOLN, R.S. Learning a rate of movement. 
J. exp. Psychol., 1954, 47, 465-470. 

MILLER, R. B. Handbook on training and 
training equipment design. USAF WADC 
tech. Rep., 1953, No. 53-136. 

THORNDIKE, E. L. Fundamentals of learning. 
New York: Teachers College, Columbia 
University, Bureau of Publications, 1932. 


(Received December 11, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No, 6, 615-622 


PREDICTION OF SOME STOCHASTIC EVENTS: 
A REGRET EQUALIZATION MODEL! 
MAX S. SCHOEFFLER 
Bell Telephone Laboratories, Incorporated, Murray Hill, New Jersey 


The present paper describes a series 
of experiments which were performed 
to develop some data on human pre- 
diction capabilities, a model for how 
such prediction capabilities are ex- 
pressed as behavior, and a series of 
tests of this model. 

The data were collected in a rather 
simple experimental situation. People 
were asked to make “bids” on 100 
successive numbers that appeared. 
Either they were instructed that these 
numbers referred to “make-believe 
dollars” (MBDs) which they might 
win, or they were told only to guess as 
close as possible to the next number. 
For groups that bid for MBDs, the 
number of MBDs S received depended 
on the relationship between his bid 
and the number that actually occurred 
(and thus on the adequacy with which 
he was able to predict the upcoming 
number). 

All of the experiments to be re- 
ported here involved this general 
situation. Two of these experiments 
provided the intuitive basis for con- 
structing a model of behavior under 
these circumstances. The remaining 
experiments were then used to evalu- 
ate the adequacy of the model when 
the assumptions made in constructing 
the model were specifically tested for 
generality under conditions to which 
the model seemed applicable. 


One variable that was investigated 
concerned the specific relationship be- 
tween the bid that was made and the 


1 The research reported in this paper was 
done at the Willow Run Laboratories, Uni- 
versity of Michigan, under a contract with 
the Department of the Army. 


number that came up. It was termed 
the payoff variable and was varied for 
different groups in order to provide one 
test of the adequacy of model. 

The condition henceforth labeled 
Guess is the one characterized above as 
not receiving MBDs. For the Non- 
punish condition, the payoff to S was 


JB I BAN 
p=? 


if B>N 

where B is the bid made by S, N is the 
number that appeared (input), and P 
is the payoff to Sin MBDs. That is, S 
won as many MBDs as he had bid so 
long as his bid was less than or equal to 
the input number. If his bid exceeded 
the input number, he won nothing. 
Similarly, for the Punish condition, 


ies B 
p={ 3 


Thus the punish condition differed from 
the Nonpunish condition only in that a 
person lost the amount that he bid in 
case of an overbid, rather than simply 
receiving nothing. 

Make-believe dollars were used be- 
cause some pilot work indicated that 
they would function adequately as 
incentives. In an attempt to retain a 
linear value scale for the MBDs, only a 
relatively small range of values was 
used. To the extent that the value 
remained linear with number of MBDs, 
the paradigm permitted a reasonable 
specification (in this arbitrary unit of 
measurement) of the payoff matrix 
relating each response (B; i = 1,... m) 
to each experimental outcome (Nj, 7 = 1; 
_.., k) in terms of the payoff, Ps, 
(positive or negative) to S. For the 
present study involving predicted num- 
bers as responses, B; = 7 and Nj =j- 

The model that was devised to de- 


if BSN 
if B>N 


615 3 


616 3 


MAX S. SCHOEFFLER 


TABLE 1 
INPUT DISTRIBUTIONS FOR THE VARIOUS EXPERIMENTAL GROUPS 
Trials 

Group N Is Payoff Function we Fas T ats Pi... 
Gla 14 Cc Guess 8-15 8-15 els 
A A S Guess 8-15 13-20 8-15 12 
E 30 Cc Guess 13-20 13-20 13-20 13-20 
Nia 16 C Nonpunish 8-15 8-15 8-15 

24 
Noe 18 S Nonpunish 8-15 13-20 8-15 12 

b 20 ‘ d 

N3 19 C Nonpunish 13-20 13-20 13-20 13-20 
Pla | 17 | S | Punish 8-15 13-20 ` 8-15 12 
Pib 15 ‘ iy 
P2a 15 S Punish 13-20 18-25 13-20 
P2b 13 


* Input distribution Constant (C) or Shifting (S). 


scribe behavior in this situation treats 
“regret” as a central concept. Regret is 
defined as the difference between the 
payoff on a trial and the maximum 
possible. That is, if P; represents the 
payoff to S given response B; and event 
Nj, then the regret experienced by S is 
Rij ad | Pi; — max Pal. 


i 

Although for the guess condition Piz 
is not explicitly defined, a concept of 
regret still appears intuitively mean- 
ingful. This regret is considered to be 
the difference between the predicted 
number and the actual number, Form- 
ally, Rij = |B; i N;l. 

Under this definition, the regret out- 
comes of a trial must take on positive 
values. However, since the experiment 
under consideration deals with ordered 
outcomes and the responses can likewise 
be ordered, it is reasonable to consider 
separately a regret due to overbidding 
and one due to underbidding. That is to 
say, since the values of B: and N; have 
been, respectively, identified with the 
numbers represented by i and j, then 
Ri; is due to overbidding if i > j, is due 
to underbidding if 7 >4, and is zero 
ifi=j. 

The regret equalization hypothesis to be 


proposed as a model, involves the notion 
that regret due to overbidding and regret 
due to underbidding result, respectively, 
in tendencies to lower or raise the re 
sponse, Thatris, if for example, Rale 
is the regret on Trial n and i > j, m 
the response on Trial n + 1 tends to be 
less than 7; if i < j, then the response on 
n-+1 tends to be greater than t F, 
another assumption is made, viz., thal 
the amount of this effect is linear Wi 
Ri; it seems reasonable to expect 
havior to stabilize (if indeed it doe 
stabilize) at a point where the exper 
value of the regret due to overbidding i 
equal to the expected value of the regre 
due to underbidding. 


More formally stated, the regret 


equalization hypothesis predicts 
asymptotic bid level 5 such that 
satisfies 

eo 
E (Rij) = > PRs, 
ims i= (0) 


(1) 
= E (Ri) 
= 2 PRs of 


where [b] is the smallest integer 2 band 
py is the probability that N; oci a 
It is clear that b is not necessarily 


A REGRET EQUALIZATION MODEL 


integer, Whereas the Nj; are integers. 
Thus, 6 cannot represent a constant 
terminal response level for an individual. 
Rather it must be an average taken at 
least over several responses of an in- 
dividual. Further, the same asymptote 
is predicted for all individuals facing the 
same sequence of events. No doubt 
individual differences do exist, but for 
the sake of a model with maximal 
simplicity and intuitive, appeal, it was 
deemed proper to avoid introducing a 
parameter to deal with individual differ- 
ences. Rather an attempt will be made 
to evaluate the predictions using 10 
fitted parameters. (This is perhaps an 
. overstatement since the hypothesis was 
constructed using the data’from two of 
the groups. Thus, it may be argued that 
the transformation from MBDs to regret 
in itself constitutes fitting a parameter.) 


METHOD 


All experiments used groups of between 13 
and 24 college students which constituted 
classes in freshman, sophomore, or junior 
psychology courses. The experiments were 
all conducted in a similar fashion and required 
that S predict on 100 successive trials the 
numbers that E wrote on the blackboard on 
those trials. The instructions to Ss differ- 
entiated the three payoff functions that were 
used, 

In the Punish and Nonpunish groups, 5s 
were instructed that they would get 100 
successive opportunities to request between 
0 and 30 make-believe dollars (MBDs). On 
each trial they were to write down the amount 
of money they were requesting on that trial. 
After they wrote this number, Æ wrote a 
number on the blackboard. For the Punish 
groups the instructions stated that if the 
amount requested was less than or equal to the 
amount that Æ wrote on the board S$ would 
receive the amount requested. If, however, 
S requested more than E subsequently wrote, 
S would lose the amount that he bid. For 
the Nonpunish groups the penalty for over- 
bidding was that S simply did not win any 
MBDs on that trial and otherwise the rules 
were the same as for the punish condition. 
For the Guess groups, Ss were simply told 
to guess “as close as possible to the number 
that Æ would write on the blackboard, with 
REE and underestimations being equally 
bad.” 


617 


These differences in instructions provided 
to the different groups constituted the payoff 
variable. 

For each bid, each S entered on his data 
sheet the amount he bid, whether he won or 
lost on that trial, and a running total of his 
winnings. (However, 5s in the Guess groups 
only wrote down the amount bid.) Bids were 
requested and amounts were written on the 
board by E at the rate of about one bid 
every 15 sec. In all, the experiment involved 
100 such trials. 

The numbers that Æ wrote on the black- 
board are called “input.” The numbers were 
randomly selected from particular distribu- 
tions. The distributions from which they 
were drawn differed for the different blocks 
of trials. For all experiments the distribu- 
tions were rectangular, the integers included 
all appearing with equal probability. In 
Table 1 the distributions are indicated by the 
highest and lowest integer used. For ex- 
ample, 8-15 represents the integers 8, 9, 10, 
11, 12, 13, 14, and 15. 


RESULTS AND DISCUSSION 


Payoff variable: Guess.—Curves de- 
scribing performance under the Guess 
condition are presented in Fig. 1 and 2. 
Figure 1 presents the means and 
inter-S SDs for Groups Gla, Gib, and 


24 SARR 
GIA -0 
22l MEAN4 GIB =----a 
G3 t-e 
201 INPUT GIA AND GIB 
ial MEAN (63 
pi 
16 
o 
5 i4 
8 
Sie 
“o 
> 
a 


on 3D OD o 


O 10 20 30 40 50 60 70 80 90 100 
TRIAL NUMBER 


Fic. 1. Means and inter-S SDs for the 
Guess condition with a constant input dis- 
tribution, averaged over five-trial blocks. 
(Also shown are the averages of the input 
distributions. These coincide with the pre- 
dicted asymptotic response levels.) 


618 


MEAN ter ee 


028 oes 


INPUT MEAN 


O 10 20 30 40 50 60 70 80 90 100 
TRIAL NUMBER 

Fic. 2. Means and inter-S SDs for the 
Guess condition with a shifting input dis- 
tribution, averaged over five-trial blocks. 
(Also shown are the averages of the input 
distributions. These coincide with the pre- 
dicted asymptotic response levels.) 


G3. As indicated in Table 1, the 
Groups Gla and Gib differed pro- 
cedurally from Group G3 only in 
having their input distributions dis- 
placed by five units. Thus, these data 
serve to indicate the relationship of 
the asymptotic response mean and 
variability to the input mean. Visual 
inspection of these data permits one to 
conclude that the mean asymptotic 
response level is equal to the mean of 
the input distribution and that the 
inter-S asymptotic variability is in- 
dependent of the input mean. 

Figure 2 presents comparable curves 
for Groups G2a and G2b. The input 
numbers for these groups on Trials 
1-30 and 61-90 were drawn from the 
integers 8-15 (the distribution used 
for Groups Gla and Gib). On Trials 
31-60 the input numbers were drawn 
from the integers 13-20 (the dis- 
tribution used for Group G3). The 
input numbers of Trials 91-100 were 
always 12, but the corresponding data 
will not be discussed for any of the 
groups. 


MAX S. SCHOEFFLER ` 


The data in Fig. 2 can thus be com; 
pared to the data from Fig. 1. They 
indicate that the groups exposed to 
shifts in the input distribution ap- 
proach asymptotes comparable to 
those attained by groups maim- 
tained on the same input distribution 
throughout. The inter-S variability 
is a negatively accelerated decreasing 
function of trials, except that if a shift 
is introduced, the variability ap- 
parently increases temporarily. 

Under this (Guess) condition, the 
regret equalization hypothesis re- 
quires that the mean asymptote 
response, b, satisfy 


b—1 w 

E O-N- È (N; -Dh 
j=0 i= 

where [b] is the smallest integer 2 b 
N, is an input number and 2; is the 
probability that W, is chosen on @ 
given trial. Solving this equation 


© ; 

yields b= $ Np; = Ñ, the input 
n=) 

mean. 

The mean response curves thus al 
in line with the hypothesis. Howevely 
the symmetry of the situation is “a 
that almost any conceptual model 
would make a similar prediction: 
Consequently, the predictions of the 
model are next examined under the 
conditions similar to those used here 
but with an asymmetric value stru& 
ture superimposed. a 

Payoff variable: Nonpunish.—Fi6 
ures 3 and 4 describe the performane™ 
of the groups that were given ta 
Nonpunish instructions. The data 
Fig. 3 are analogous to those of F a 
and the data of Fig. 4 to ers 
Fig. 2. Groups Nia and N 1b differ | 
from Group N3'in that N3 rece 
an input distribution that was J 
placed by five units. Groups N24 ia 
N2b received the input < 


same 


é 


tribution as Groups Nia and Nib on 
Trials 1-30 and 61-90, and the same 
distribution as Group N3 on Trials 
31-60. 

The mean response data are again 
characterized by apparently very 
stable asymptotes when the same in- 
put distribution is used throughout 
(Fig. 3). Again the differences be- 
tween N1 and N3 in the mean 
asymptotic response demonstrate the 
dependence of the response mean on 
the mean of the input distribution; 
while the similarity in the inter-S 
variability between these groups in- 
dicates that this measure is independ- 
ent of the input mean. However, the 
response asymptotes are no longer at 
the mean of the respective inputs, but 
rather are at a new value appreciably 
below the mean of the input. 

In the case of the shifting distribu- 
tion (Fig. 4), the response asymptotes 
are again predictable from the data 
of the groups receiving the same input 
distribution throughout. Again there 
is apparently a slight increment in 


— 


24 
a NA eae NIA 
MEANINIB =-=- SD4 NIB 
N N3 


3 e-e- 


INPUT 
MEAN 


NIA AND NIB 


Eee Sea 


+ 
3 


PREDICTED ASYMPTOTE N3 


BE 


AVERAGE BID 
ō 


~ 
- se sg sep Bag to 
PREDICTED ASYMPTOTE NIA AND NIB 


o 10 20 30 40 50 60 70 80 90 100 
TRIAL NUMBER 


each group.) 


A REGRET EQUALIZATION MODEL 


619 


a i 
N2A —-e-o INPUT 
mean{ N2B &—e-6 MEAN 
sof NZA o—o—o PREDICTED 
N2B aema ASYMPTOTE 


20 30 40 50 60 70 80 9010 
TRIAL NUMBER 


Fic. 4. Means and inter-S SDs for the 
Nonpunish condition with a shifting input 
distribution averaged over five-trial blocks. 
(Also shown are the averages of the input 
distributions and the predicted asymptote for 
each group. The input distribution is the 
same as that of Fig. 2 for the Guess condition.) 


response variability associated with a 
shift in the input distribution and a 
negatively accelerated monotonic de- 
crease in variability when the input 
distribution is not shifted. 

According to the regret equalization 
hypothesis, the asymptotic response 
mean should again be predictable. If 
the response is less than the corre- 
sponding input number, there is 
assumed to be a regret equal to the 
difference between the two. In case 
of an overbid, since S receives nothing, 
his regret is equal to the input. Thus, 
his asymptotic bid level b should 
satisfy 


[b—1] 


= Nps = È N- DB: 
j=0 j=[b] 


where the right side of the equation 
constitutes the expected regret due to 
underbidding and the left side con- 
stitutes the expected regret due to 
overbidding. 

Under the Nonpunish condition for 
the input 8-15, the equation is 


24 I i 
PIA ~—~-—0 INPUT 
2 wean! Pig ee MEAN 
20 PIA c—o—o PREDICTED ____ 
sof PIB ——e ASYMPTOTE 


5 


G 


AVERAGE BID 
ko] 


oN fF HD @ 


O 10 20 30 40 50 60 70 80 90100 
TRIAL NUMBER 

Fic. 5. Means and inter-S SDs for the 
Punish condition with a shifting input dis- 
tribution averaged over five-trial blocks. 
(Also shown are the averages of the input 
distributions and the predicted asymptote 
for each group. The input distribution is the 
same as that of Fig. 2 for the Guess condition 


and that of Fig. 4 for the Nonpunish con- 
dition.) 


satisfied by b = 9.7 and for the input 
13-20, by b = 14.1. These predic- 
tions are indicated on Fig. 3 and 4 and 
are approximately at the observed 
asymptotes (within one standard er- 
ror). In contrast, it may be noted 
that asymptotes of 8 and 13, respect- 
ively, would be required if Ss were 
either to maximize expected payoff or 
to minimize expected regret. 

Payoff variable: Punish—The data 
for the Punish groups are shown in 
Fig. 5 and 6. The results are con- 
sistent with those of the other condi- 
tions. That is, the mean response 
levels depend on the mean of the input 
distribution, and the inter-S SDs are 
negatively accelerated monotonic de- 
creasing except for slight increases 
when the input distribution is shifted. 

The underbidding regret is here 
identical to that of the Nonpunish 
cases. However, in case of an overbid, 
S loses the amount bid, so that the 
regret is defined to be the sum of the 


MAX S. SCHOEFFLER 


input and the response. The asymp- 
totic bid level b should, therefore, 
satisfy 


[b—1] a 
X(N; + 5)p; = X(N; — dp, 
j=0 i=) 


Values of b that satisfy this equation 
are 8.9 for the input distribution 8-15, 
13.2 for the input 13-20, and 18.1 for 
the input 18-25. The predicted 
asymptotes are indicated in Fig. 5 
and 6 and appear to be reasonably 
descriptive of the data. Comparable 
asymptotic predictions would be 8, 13, 
and 18, respectively, if Ss maximized 
expected payoff or minimized expected 
regret. I 

It was noted above that the regret 
equalization hypothesis evolved from 
a consideration of the effect of ovet- 
or underbidding on a subsequent bid. 
The conceptualized process produces 
a decrease in the response level in 
proportion to the amount of regret 
experienced as a result of overbidding 
and an increase in the response level 
in proportion to the amount of regret 
due to underbidding. A detailed look 
at the trial by trial changes in the bi 


24 
22 
20 
18 
16 
a 
@ 14 
8 e 
K 
wi 10 [ PZA oeeo 
a MEAN! p28 =-= MEA 
3 P2A o—o—o PREDICTED .———- 
ef $0) p23 ~——. ASYMPTOTE 
4 
K | 
e io 
"9-0 “2530405060 T0 80 9 


TRIAL NUMBER ye 
è ~¢ t 
Fic. 6. Means and inter-S SDs be is 
Punish condition with a skifting inpu 


oeat : : an us 
tribution having a larger maximum than 
elsewhere. 


A REGRET EQUALIZATION MODEL 


TABLE 2 


CORRELATION BETWEEN RESPONSE CHANGE 
AND REGRET COMPUTED OVER 
30-TriAL BLOCKS 


Trial Block 

Group 
1-30 31-60 61-90 
Gla 683 -672 593 
Gib 695 .100 .577 
G2a .652 641 643 
G2b 583 740 738 
G3 674 -620 644 
Nia 442 A12 .266 
Nib 426 437 359 
N2a 432 A47 462 
N2b 553 552 505 
N3 495 466 All 
Pla 538 490 AOL 
Pib 481 494 568 
P2a 485 393 424 
P2b 542 480 518 


level support this conceptualization. 
Let regret due to underbidding be 
arbitrarily defined to be positive and 
regret due to overbidding be defined 
to be negative. One can use the 
product-moment correlation, 7, be- 
tween amount of regret and amount 
of change in the bid level as an index 
of the extent to which this model of 
the effect of regret actually describes 
the data. Such correlations have been 
computed for each S in each experi- 
ment for the three successive blocks 
of 30 trials. The averages of these 7’s 
taken over the group are shown in 
Table 2. 

For all groups, the r is reasonably 
large on the initial trial block. How- 
ever, those groups that received the 
same input distribution throughout 
the three trial blocks (G1, G3, N1, 
and N3) seem to show a sharp de- 
crease in the degree of correlation 
between regret and response change 
over the three trial blocks. Thus, 
there is evidence that regret is indeed 
intimately related to response change, 


621 


but that with a constant population 
distribution for the input, this relation 
becomes less important. In addition, 
although no interpretation will be 
attempted here, it should be noted 
that all correlations for Guess groups 
are larger than any for the other 
groups. 

An attempt was also made to 
provide some additional insight into 
the values of the correlation coeffi- 
cients. It was noted that the SDs 
of the bids in a group also decreased 
to a low value over a series of trials, 
showing increases only when a change 
occurred in the input distribution. 
If the decrease in these SDs also 
implied a decrease in the SD of a 
series of responses of an individual 
then that might be sufficient to 
account for the decrease in correlation 
of the regret with the change in bid 
level. Accordingly, such intra-S SDs 
were computed for each S for each 
block of 30 trials. Averages of these 
intra-S SDs are shown in Table 3. 
Also given in this table is the SD of 


TABLE 3 


InTRA-S SDs COMPUTED OVER 
30-TrraL BLOCKS 


Trial Block 

Group 

1-30 31-60 61-90 
Gla 4,54 2.73 2.33 
Gib 3.88 2.52 2.21 
G2a 3.88 3.41 2.94 
G2b 3.99 3.54 3.21 
G3 4.63 2.97 2.99, 
Nia 3.46 1.47 1.09 
N1b 2.77 1.57 1.56 
N2a 2.65 2.51 2.45 
N2b 3.65 2.95 2.44 
N3 3.00 1.84 1.47 
Pla 3.34 2.33 1.69 
Pib 2.84 2.55 217 
P2a 2.86 2.55 1.99 
P2b 2.84 2.39 2.22 
Input 2.29 2.29 2.29 


622 


the input distribution. This value of 
2.29 is appreciably larger than the 
asymptotic SDs for at least some of 
the groups (notably Nia, Nib, and 
N3—the Nonpunish groups with con- 
stant input distributions). 

These data show that the Guess 
groups also tend to exhibit relatively 
larger intra~S SDs, and that appreci- 
able decreases in the intra-S SDs 
occur over the successive trial blocks. 
This is in line with the decreasing 1’s 
of Table 3, since it is clear that the r 
must perforce be low if the variability 
in bid level is low. However, the 
patterns of the decreases of the two 
sets of data are different. Most of the 
decrease in correlation occurs only for 
the groups not getting shifts in the 
input distribution and it occurs on the 
last trial block. In contrast, all of the 
groups show a decrease in intra-S 
variability, and the decrease takes 


place primarily on the second trial 
block. 


No explanation is attempted here for 
these effects. This does not detract from 
the importance of these effects for learn- 
ing theory. The data demand that 
whatever theory or model is used to 


MAX S. SCHOEFFLER 


account for them must also produce 
these decrements in variability and cor- 
relation. In particular, an adequate 
theory must make the response vari- 
ability—both intra- and inter-S depend- 
ent on the stationarity of the input 
distribution. 


SUMMARY 


Subjects were instructed to ask for some 
number of ‘make-believe dollars” (MBDs) 
or simply to guess a number which Æ would 
subsequently present. The payoff to S de- 
pended on the relation of S’s bid to E's 
number. Three conditions were used to 
determine the payoff. In two of these, 5s 
were encouraged to bid high, but excessively 
high bids were punished. In the other condi- | 
tion, over- and underbids were treated sym- 
metrically. A model was constructed which — 
predicts the asymptotic bid level under these — 
conditions to be at a point where the expected 
regret due to overbidding is equal to the 
expected regret due to underbidding. 

The results indicated: (a) The asymptotes 
of the bids depend on the payoff conditions 
and the distribution of input numbers as 
predicted by the model. (b) Both the inter 
and the intra-S variability decrease over | 
trials except when the distribution of input 
numbers is changed. (c) The increase OF 
decrease in bid level on a trial is highly 
correlated with the regret associated with the 
preceding trial, 


(Received December 11, 1961) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 623-627 


SOME EFFECTS OF THE PERCENTAGE OF RELEVANT 
CUES AND PRESENTATION METHODS 
ON CONCEPT IDENTIFICATION * 


MARGARET JEAN PETERSON 


Indiana University 


In order to perform the complex 
discriminations necessary for success- 
ful solution of a concept formation 
problem, Ss must distinguish between 
dimensions which are relevant to the 
solution of the problem and those 
which are not. Predicting that adap- 
tation of the irrelevant cues and 
conditioning of the relevant cues 
would be facilitated by temporally 
proximate presentation of instances 
of the relevant dimension, the studies 
reported herein varied the proximity 
of relevant instances by presenting all 
instances relevant to one concept 
before showing instances representing 
another concept (homogeneous condi- 
tion), and by presenting the instances 
representative of three separate con- 
cepts in a mixed sequence (hetero- 
geneous condition). 

The percentage of relevant cues per 
problem was manipulated by using 
three relevant dimensions and one 
irrelevant dimension for one problem 
(75%); two relevant and two ir- 
relevant for another (50%R); and 
one relevant and three irrelevant for 
a third (25%R). 

Underwood (1952) emphasized that 
temporally contiguous presentation of 
stimuli which are instances related to 
the same concept should facilitate 
learning by minimizing the interfer- 
ence effects that might be produced by 


1This research was supported by Grant 
3707A and by Grant M-5209 from the 
National Institutes of Health, Public Health 
Service. The assistance of Miss Keith 
Blattner in this investigation is gratefully 
acknowledged. 


interpolated instances of other con- 
cepts. Although massed practice has 
not always been associated with 
increased efficiency in solving prob- 
lems (Underwood, 1961), recent de- 
monstration by Cahill and Hovland 
(1960) of the importance of memory 
in the acquisition of concepts sug- 
gested the prediction that homo- 
geneous presentation would favor 
faster learning than would hetero- 
geneous presentation of the relevant 
instances. Further, an interaction 
between the two variables was pre- 
dicted; namely, that the advantages 
of homogeneous presentation would 
be greater for the 25%R problems 
than for the 75%R problems. 


EXPERIMENT | 
Method 


Stimulus materials.—Six three-valued di- 
mensions were used: size (small, medium, 
large); number of figures (one, two, three) ; 
form of the figures (circle, triangle, square); 
number of lines on the edges of the cards (one, 
two, three); color (red, blue, green); and 
position of the figures on the cards (right, 
middle, left). From this population, dimen- 
sions were randomly drawn for the problem 
subject to the restriction that each dimension 
be represented equally often as relevant or 
irrelevant over the entire set of problems. 
The stimuli were painted with poster paint 
on white 4 X 6 in, cards. All possible com- 
binations of the irrelevant and the relevant 
dimensions appeared in a given problem deck. 
Instances representing the relevant dimen- 
sions for the 50% R and 75% R problems 
were paired using a table of random numbers 
so that, for example, if color and number of 
figures were relevant, the single figures were 
always painted blue; two figures were always 
red; and three figures, green. Dimensions not 
used were held constant on all cards within a 


623 


624 


MARGARET JEAN PETERSON 


TABLE 1 


TRIALS TO CRITERION AND CORRECTLY IDENTIFIED DIMENSIONS AS A FUNCTION 
or CONDITIONS AND PERCENTAGES OF RELEVANT Dimensions: Exp. I ann II 


Trials to Criterion Sate ea 
Comin 25% R 50% R ISR 25% R | 50% R | 75%R 
Mean | Mdn.| SD |Mean|Mdn.| SD |Mean |Mdn.| SD | Mean 7| Mean |% Mean | % 
Se RSS ee eee mee 
aes | ileal te] ales lage eke] 2 lee gee 
ae 3.83 | 3.00 | 2.67 | 3.00 | 2.00 47 | 1.75 | 1.50 [1.30 | .92 al 


problem deck, e.g., when color was not used 
as either a relevant or an irrelevant dimension, 
all stimuli were painted gray. į The™3 X 2 
factorial design consisted of the three per- 
centages of relevant dimensions and the two 
methods of presentation. Each problem was 
presented under both methods of presenta- 
tion. Each S solved all six problems whose 
sequential order of appearance had been 
determined by a Latin square design such 
that each problem appeared equally often in 
every ordinal position. 

Apparatus.—Cards were viewed through a 
one-way vision mirror mounted in a large 
black screen, The S$ indicated his response 
by pressing one of three telegraph keys, A 
reinforcing light was placed immediately 
above each key, 

Subjects and procedure.—The Ss were 
36 students from introductory psychology 
courses at Indiana University. Experimental 
participation was a course requirement. After 
instructions defining S's task, including an 
enumeration of the possible bases for dis- 
crimination, a set of practice cards which had 
the letters A, B, or C on them was shown toS 
until he had responded correctly three times. 
Homogeneous presentation consisted of show- 
ing cards representative of one concept until 
S had correctly identified the cards three 
consecutive times. Then representations of 
the second concept were shown to the same 
criterion, followed by the presentation of the 
third concept. Concept refers to the relevant 
stimulus characteristics of cards associated 
with one of the response keys so that Ss were 
said to learn three concepts to define a 
problem. If color were the relevant dimen- 
sion, learning Response A to red represented 
one concept; Response B to blue was the 


second; and Response C to green, the third. 
In heterogeneous presentation instances of the 
three concepts were assigned randomly with 
the restriction that one instance of the three 
concepts appear in each block of three cards. 
The Ss were run toa criterion of 9 consecutive 
correct responses or until 54 trials had been 
completed. Each S$ was then queried about 
the basis for solution of the problem before 
going on to the next. 


Results and Discussion 


Generally fewer responses were 
required to reach the criterion follow- 
ing homogeneous presentation than 
following heterogeneous presentation. 
Higher percentage relevant problems 
were learned more rapidly than low 
percentage ones (Table 1). ae 

A Friedman two-way analysis 0 
variance (Siegel, 1956, p. 166), em 
ployed because of the heterogeneity o 
variance, yielded a significant x°- O! 
18.35 (df =2, P <.001). The chi 
square for the three levels of percent- 
age of relevant dimensions with heter- 
ogeneous presentation was also sigh! 
ficant (x*, = 27.56, df = 2, P < 001): 
The z conversions for the sign ns 
(Siegel, 1956, p. 72) used to assess t G 
differences between presentation Co” 
ditions within the 25%R, 50%R, We 
75%R conditions were 3.04 for 25% 
(P = 0012), 2.70 for 50% 


CONCEPT IDENTIFICATION 


(P = .0035), and 1.33 for 75%R 
(P = .0934). The predicted inter- 
action was found: homogeneous pres- 
entation had a greater facilitating 
effect for the low percentage of re- 
levant dimensions than for higher 
ones. 

Hull (1920) in his classical study of 
concept identification using Chinese 
characters noted that Ss were not 
necessarily able to define verbally 
the property common to a specific 
concept even though they could assign 
stimuli to concepts correctly. In 
contrast, Bourne and Haygood (1959) 
reported that their Ss were almost 
always able to label the correct dimen- 
sions, even when more than one 
dimension was relevant for a par- 
ticular problem. Table 1 contains an 
analysis of verbal identification in the 
present experiment. The mean num- 
ber of correctly identified dimensions 
increased significantly from the heter- 
ogeneous to the homogeneous pres- 
entation (z transformation of sign 
test = 2.94, P = 0016) and increased 
as the percentage of relevant dimen- 
sions increased (Friedman x’, = 14.59, 
df = 2, P < .001). The proportions 
of the number of correct identifica- 
tions relative to the total number of 
correct dimensions that could have 
been named demonstrated clearly that 
the majority of Ss were reporting only 
one dimension, even when additional 
dimensions could have been used: 
2 of the 36 Ss identified both correct 
dimensions for the 50%R problems; 
16 of the 36 Ss identified two of the 
three correct dimensions for the 
75%R problems; but no S identified 
all three. The numbers of correct 
assignments of concept instances to 
the response keys were not reported, 
since the p's with the number of 
correct identifications of the dimen- 
sions ranged from .82 for the 25%R 


625 


.96 for the 


75%R 


problems to 
problems. 


EXPERIMENT ll 


The superiority of homogeneous 
presentation in Exp. I may have 
resulted from the closer proximity of 
instances of a given concept in that 
condition. Another possibility is that 
the absence of interference from 
presentation of instances of other 
concepts permitted faster learning. 
In Exp. II the problems were pre- 
sented using a homogeneous sequence 
while preserving the exact temporal 
ordering of the instances in the related 
heterogeneous condition of Exp. I. 
The intervals were filled with a digit 
cancellation task for one group and 
left unfilled for another. The control 
Ss learned the problems using the 
homogeneous condition of Exp. 1. 


Method 


Both of the two experimental conditions, 
temporally proximate (P) and spaced (S), 
used the homogeneous sequence of presenta- 
tion of Exp. I. Variations in temporal 
separation of the concept instances dis- 
tinguished the two conditions. Condition P 
was identical with the homogeneous condition 
of Exp. I. In Cond. S, instances of the first 
concept were shown using the temporal 
intervals which existed in the heterogeneous 
problem of the same percentage of relevant 
dimensions, but the intervals were either 
filled with digit cancellation (Sp) or left 
unfilled (S5). During the unfilled intervals 
Ss sat silently in front of the darkened 
aperture. Then instances of the second 
concept arranged to simulate its heterogene- 
ous problem presentation sequence were 
shown followed by the simulation of the 
third. For all groups, instances of one concept 
were presented until S had emitted three 
consecutive correct responses before instances 
of the second were presented. 

The Ss, 36 students from introductory 
psychology courses at Indiana University who 
had not participated in similar experiments, 
were assigned randomly with the restriction 
that an equal number of Ss experience each 
condition. The remainder of the experimental 
procedure was identical to that of Exp. 1. 


626 


Results and Discussion 


Application of a Kruskal-Wallis 
one-way analysis of variance of trials 
to criterion (Siegel, 1956, p. 184) 
yielded an H of 6.93 comparing the 
three conditions of presentation 
(Table 1) for the 25%R problems 
which, with 2 df, was significant be- 
tween the .02 and .05 levels; the 
comparable Hs for the 50%R_ prob- 
lems (4.66) and the 75%R problems 
(2.66) were associated with prob- 
abilities greater than .05, both with 
2 df. Differences between the tem- 
porally proximate presentation of the 
25%R problem and either of the 
spaced methods of presentation were 
not statistically reliable ; however, the 
use of digit cancellation did signifi- 
cantly increase the number of trials 
required for solution relative to the 
spaced condition with an unfilled 
interval (P = .018 using the median 
test, Siegel, 1956, p. 111). Ap- 
parently, lengthening the interval 
between instances of a concept did not 
in itself significantly slow learning. 
Rather, interference from instances 
displaying another concept or the 
introduction of another task such as 
digit cancellation appeared to retard 
learning, particularly with problems 
characterized by a low percentage of 
relevant cues, 

Results of Ss’ identification of the 
correct dimensions reflected the trends 
shown in Exp. I. The higher the 
percentage of relevant cues the more 
frequently Ss were able to label at 
least one correct dimension, although 
the different manipulations of the 
homogeneous condition in Exp. Il 
were not portrayed in these data. 
Again, few Ss identified more than one 
correct dimension. No Ss reported 
two for the 50%R problems, nine 
reports of two correct dimensions were 
given for the 75%R problems, and two 


MARGARET JEAN PETERSON 


reports of three dimensions were 
made. Differences in the mean num- 
ber of trials to criterion and in the 
mean number of correctly identified 
dimensions between comparable con- 
ditions of Exp. I and II were not 
statistically significant, both chi 
squares being less than 1. 


DISCUSSION 


Because lengthening the interval be- 
tween presentations of instances of the 
same concept did not have a significant 
effect upon the learning of concepts, the 
efficacy of homogeneous presentation did 
not appear to reflect massing of practice, 
per se. Introduction of conditions which 
would be expected to increase the likeli- 
hood of some kind of interference such — 
as the heterogeneous method of presenta- 
tion or the digit cancellation task was 
associated with slower learning of the 
concepts, particularly when the concepts 
to be learned contained a low percentage 
of relevant dimensions. It is possible 
that when a high percentage of the 
dimensions were relevant the problems 
were learned so rapidly that these factors 
became relatively unimportant or exerted 
an influence too transitory to be reflected” 
in the response measures employed. 
Furthermore, an unpublished replication 
of Exp. I, using different dimensions t0 
constitute the problems, yielded almost 
identical results. i 

Another observation was the infre 
quent identification of more than, one 
correct dimension even when additional 
dimensions were available and each 
had been told what dimensions might be 
used. The assumption might be made 
that as relevant dimensions were added, 
the stimulus pool from which S sampled 
would have increased so that Ss were 
actively selecting from a larger popula- 
tion of relevant cues for the problems © 
higher percentage of relevant cues then 
for problems of lower percentage. Moré 
in accord with the data would be the” 
interpretation that over a group of Ss the 
probabilities increased that each 
identify at least one correct dimension 


CONCEPT IDENTIFICATION 


without necessarily having been able to 
report the presence of other relevant 
dimensions. 


SUMMARY 


Two experiments examined the effects of 
the variation of the percentage of relevant 
dimensions and the method of presentation 
of concept instances on rate of concept 
identification. Problems consisting of 257%, 
50%, and 75% relevant cues were combined 
factorially with four different dimensions. 
Instances of one concept were presented until 
the criterion of learning had been achieved, 
then instances of the second concept were 
presented followed by the third for the homo- 
geneous condition. In the heterogeneous 
condition, instances of the three concepts 
were presented in a random sequence. The 
predictions that the number of responses prior 
to criterion would be inversely related both 
to the percentage of relevant cues and to the 
temporal proximity of the instances associated 
with a given response were supported. 
Homogeneous presentation was more ad- 
vantageous with 25% R than with 50% R and 
75% R. Experiment II demonstrated that 
the lesser efficiency of heterogeneous presenta- 
tion was not a function of the greater tem- 
poral intervals occurring between instances 
of the same concept, but rather of inter- 


627 


ference effects from other concepts, at least 
with 25% R problems. 

Analyses of correctly identified dimensions 
suggested an interaction effect between the 
percentage of relevant cues and the method 
of presentation. Few Ss reported the presence 
of more than one relevant dimension for the 
problems with two or three completely re- 
dundant relevant dimensions. 


REFERENCES 


Bourne, L. E., JR., & Haycoon, R. C. The 
role of stimulus redundancy in concept 
identification. J. exp. Psychol., 1959, 58, 
232-238. 

Canty, H. E., & Hovianp, C. I. The role of 
memory in the acquisition of concepts. 
J. exp. Psychol., 1960, 59, 137-144. 

Hu, C. L. Quantitative aspects of the 
evaluation of concepts. Psychol. Monogr., 
1920, 28(1, Whole No. 123). 

SieceL, S. Nonparametric statistics for the 
behavioral sciences. New York: McGraw- 
Hill, 1956. 

UnpErwoop, B. J. An orientation for re- 
search on thinking. Psychol. Rev., 1952, 
59, 209-220. 

Unperwoop, B. J. Ten years of massed 
practice on distributed practice. Psychol. 
Rev., 1961, 68, 229-247. 


(Received December 15, 1961) 


Ji al of Experimental Psychology 
1962, Vol. 64, No. 6, 628-630 


EASE OF CONCEPT ATTAINMENT AS A FUNCTION 
OF ASSOCIATIVE RANK! 


SARNOFF A. MEDNICK? 
University of Michigan 


Underwood (1952) has suggested a 
method for the study of concept 
formation which assumes that the 
attainment of a concept calls for the 
perception of a relationship between 
concept instances. The perception of 
this relationship, in part, depends on 
the probability of the occurrence of 
the relevant associative response to 
the concept instances. This prob- 
ability is termed response dominance. 
The mean response dominance of all 
instances representing the concept is 
termed dominance level. Underwood 
and Richardson (1956b) have shown 
that the ease of attainment of a 
concept is directly related to its 
dominance level. 

This study explores a methodo- 
logical variable which determines ease 
of concept attainment. The variable 
under investigation is the rank posi- 
tion of the concept response in the 
associative hierarchy of the concept 
instance. To the concept instance 
BELLY the sensory associate ROUND is 
of Rank 1 with a dominance level of 
43% (Underwood & Richardson, 
1956a). The sensory associate sorr 
is of Rank 2 with a dominance level 
of 24%. To the concept instance, 


1 This study was completed while the 
senior author was a visiting research psy- 
chologist at the Institute of Personality 
Assessment and Research, University of 
California, Berkeley. The support of the 
National Science Foundation (Grant No. 
G3855) and the Cooperative Research 
Program, Office of Education, United States 
Public Health Service (Contract 1073) is 
acknowledged. 

2 Now at the Psychological Institute, Kom- 
munehospitalet, Copenhagen, Denmark. 


AND 


SHARON HALPERN 
University of California, Berkeley 


| 
PAIL the sensory associate METALLIC 
is of Rank 1 and has dominance level — 
of 24%. While METALLIC to PAIL and ` 
SOFT to BELLY are equal in response 
dominance they vary in their posi- — 
tions in their respective associative 
hierarchies: METALLIC is of Rank 1; $ 
while sort is of Rank 2. 

This experiment compares ease of 
attainment of concepts as a function 
of the rank position of the concepts in 
the associative hierarchy of the con- 
cept instances. For reasons developed 
below it is predicted that first ranking — 
concepts will be attained in fewer 
trials and with fewer errors than will 
second ranking concepts. 


METHOD 


Lists—The words used were concrete 
nouns selected from a list of 213 nouns for 
which Underwood and Richardson (1956a) 
have ascertained the dominance level 
various responses, As can be seen in Table 4 
four groups of instances were assembled 
associative Rank 1 (AR 1) warre, AR 
ROUND, associative Rank 2 (AR2) WHITE 
and AR 2 rounp, This was necessary be 
cause of the possibility that the concept 
might differ in difficulty or that concept 
difficulty might interact with associative 
rank. List 1 consisted of AR 1 ROUND ani 
AR2 ware while List 2 contained the other 
two concepts. A buffer concept “LONG 
included in both lists (EEL, BEAK, ae 
CUCUMBER) was used to make it more difficu! 
for Ss to attain the concepts by elimination 
Procedures. As is shown in Table 1, 
mean dominance levels of concepts were kept 
nearly constant. In constructing the A 
concepts care was taken to avoid having ta 
concept instances ne 
ranking response. ; | 

The instances were presented to S in three i 
random orders, The same three orders be 
used for both lists with positions occupi y 


elicit a common 


628 


CONCEPT ATTAINMENT 


629 


TABLE 1 


Concert Lists WITH ASSOCIAT 


IVE RANK, RESPONSE DOMINANCE, AND MEAN 


DOMINANCE LEVEL INDICATED 


List I i 
Associative ki Tia 
RE 
ierarchy z Response | Concept and M . a 
Noun Dominance! Dominance aa Noun P nA Concent ea g 
POT 29 7 HOSPITAL 32% 
1 EYE 32% Round ENAMEL 28% White 
DIME 30% 27% GOAT 29% 31% 
GRAPE 18% BREAD 35% 
FROST 34% © ti BADGE 21% 
2 GARDENIA 28% White PILL 28% Round 
LARD 27% 31% WAIST 24% 24% 
BONE 34% CAPSULE 22% 
AR 1 instances in List 1 being occupied by analysis. AR 1 concepts (List 1, 


AR 2 instances in List 2. The positions of the 
buffer terms were not changed between lists, 
although they were randomized in the three 
orders that were used. 

Subjects —The Ss were 30 undergraduate 
paid volunteers. The 15 men and 15 women 
were divided as equally as possible between 
the two lists. 

Procedure.—The lists were presented at a 
4-sec. rate on a Gerbrand's type memory 
drum. A 12-sec. interval occurred between 


presentations of the lists. 

|The Ss were informed that the list con- 
tained 12 words that could be placed in three 
groups of 4 words each, and that all of the 4 
words in each of these groups could be 
described by the same adjective. The Ss 
were required to respond to each word, The 
task was continued to a criterion of one 
perfect trial, or terminated at 20 trials. A 
more complete discussion of the materials 
and procedure may be found elsewhere 
(Freedman & Mednick, 1958). 


E 

The data which were subjected to 
analysis were the number of trials to 
one perfect trial on a concept (this 
meant giving the correct concept 
response to all four instances of a 
concept on the same trial) and the 
number of errors made on each con- 
cept in the entire course of the experi- 
ment. As noted above, the buffer 
concept LONG was omitted from this 


RESULTS AND DiscussiON 


WHITE, List 2, ROUND) were compared 
with AR2 concepts (List 1, ROUND, 
List 2, wHITE). Two List 2 Ss failed 
to solve any concept and were dropped 
from further analysis. 

The AR 1 concepts were attained 
earlier. The mean number of trials 
taken to solve each concept was 5.90 
for the AR 1 concepts and 8.09 for the 
AR 2 concepts, a significant difference 
(t = 2.96, df = 27, P < 01). The 
mean number of errors on the AR 1 
and AR 2 concepts were 12.17 and 
18.14, respectively. This difference 
was significant (t= 2.09, df = 27, 
P < .05). 


The results are intuitively satisfying 
but detailed analysis of their interpreta- 
tion is somewhat intricate. We have 
found that AR 1 concepts are attained 
more easily than AR 2 concepts despite 
the fact that their dominance levels are 
equal. Above, we have referred to these 
ranks as indicating a position in an 
associative hierarchy. However, this 
hierarchy is, in a sense, a figment. The 
norms which provide us with dominance 
levels and ranks are based on Ss giving 
single sensory associates to each of 213 
nouns. Actually, while WHITE is a 
second ranking response to GARDENIA 
and a first ranking response to ENAMEL 


A 


630 


28% of the norm group gave WHITE as 
their first and only response to these 
nouns, and in both cases, 72% gave some 
other response. Thus, when we refer 
to these concept responses as occupying 
positions in an S’s associative hierarchy 
we are making the implicit assumption 
that the associative hierarchy produced 
by collating the group’s single responses 
is reflected to a large extent, in each 
individual. In other words, we are 
assuming that everyone has just about 
the same basic associative hierarchy; the 
fact that we get variation in single 
response norms we would then attribute 
to momentary fluctuations in associative 
strength. Thus, if Underwood and 
Richardson (1956a), had asked their Ss 
to give more than one response to each 
noun, a large proportion of the 72% that 
did not give wH1Te as their first response 
to ENAMEL would have given it as their 
second or third response. If this 
situation were applied to the present 
experiment then the superiority of the 
AR 1 concepts would be understandable. 
The AR 2 concept responses occupy an 
inferior position (relative to the AR 1 
concept responses) in almost everyone's 
associative hierarchy. This means the 
AR 1 concept responses would be elicited 
earlier in the course of the experiment, 
This experiment may then be seen as 
supporting this assumption of homo- 
geneity of hierarchies. Research on 
word associations (Cofer, 1958; Rosen & 
Russell, 1957) has contributed consider- 


SARNOFF A. MEDNICK AND SHARON HALPERN 


able support to this same assumption in 
another context. 


SUMMARY 


Thirty Ss were presented with lists of 12 
nouns and instructed to discover into what 
three groups the nouns could be divided and 
what adjective could describe each group. 
The lists consisted of concepts of equal levels 
of dominance; the position of the concept 
responses in the associative hierarchy was 
manipulated. The concepts having higher 
rank position in the associative hierarchy 
were attained more quickly and with fewer 
errors. 


REFERENCES 


Corrr, C. N. Comparison of word associa- 
tions obtained by the methods of discrete 
single word and continued association. 
Psychol. Rep., 1958, 4, 507-510. 

FREEDMAN, J. L., & Mepnicx, S. A. _ Ease 
of attainment of concepts as a function of 
response dominance variance. J. exp. 
Psychol., 1958, 55, 463-460. 

Rosen, E., & Russet, W. A. Frequency 
characteristics of successive word associa- 
tion. Amer. J. Psychol., 1957, 70, 120-122. 

Unperwoop, B. J. An orientation for re 
search on thinking. Psychol. Rev., 1952, 
59, 209-220. 

UNDERWOOD, B. J., & RICHARDSON, J. Some 
verbal materials for the study of concept 
formation. Psychol. Bull, 1956, 53 
84-95. (a) 

UNDERWOOD, B. J., & RICHARDSON, J. „Verbal 
concept learning as a function of instruc- 
tions and dominance level. J. exp. Psychol. 
1956, 51, 229-238. (b) 


(Received December 18, 1961) 


Journal of Experimental Psychology 
1962, Vol, 64, No. 6, 631-635 


CONCEPT IDENTIFICATION UNDER MISINFORMATIVE 
AND SUBSEQUENT INFORMATIVE 
FEEDBACK CONDITIONS? 

WALTER J. JOHANNSEN 


Veterans Administration Center, Wood, Wisconsin 


Recent research on human concept 
identification has aimed at delineating 
the effects of feedback class on attain- 
ment rate. In particular a recent 
study by Pishkin (1960), using mis- 
informative feedback (MF), reveals 
striking decrement when even small 
percentages of erroneous, task rele- 
vant feedback information are in- 
serted into schedules of informative or 
correct feedback (IF). The present 
study seeks to extend Pishkin’s results 
by assessing the effect of MF on 
subsequent concept identification un- 
der conditions of 100% IF. 

Appropriate design makes possible 
the concurrent examination of an- 
other, associated question. Pishkin 
found probability matching behavior 
in his concept identification study, as 
did Goodnow and Postman (1955) in 
astudy using MF by implication. On 
the other hand, Morin (1955), who 
made use of MF in a simpler learning 
situation, was unable to demonstrate 
an adequate match in his data. He 
suggested that the failure of the ob- 
tained curves to approach an asymp- 
tote was a factor in his results. The 
present experiment is designed to 
circumvent this problem by carrying 
performance under MF/IF to a point 


1 The statistical analysis of this paper was 
carried out in part under contract with the 
Wisconsin Alumni Research Foundation. 
The author wishes to express his appreciation 
to the University of Wisconsin Numerica 
Analysis Laboratory and to E. James Archer 
for assistance with computations of the trend 
test; and to Conrad Nuthmann, Samuel H. 
Friedman, H. Allen Page, and Richard M. 
Lundy for their critical comments. 


where asymptote is more 
approximated. 

A final interest of this paper_is the 
description of acquisition curves under 
MF/IF.  Pishkin’s analysis, related 
to the Restle (1955) discrimination 
learning model, is unconcerned with 
the nature of the attainment process. 
Yet Morin’s trend analysis of his 
data suggests a more complex process 
than that typical of the probability 
matching studies. It is of interest to 
determine whether these findings can 
be replicated in data derived from a 
more difficult task. 


nearly 


METHOD 


Experimental conditions:—The Ss were ran- 
domly assigned to one of four MF/IF condi- 
tions and one of three task complexity con- 
ditions. The percentages of MF/IF em- 
ployed were 0:100, 12.5:87.5, 25:75, and 
37.5:62.5. Task complexity was simul- 
taneously manipulated by varying the 
number of dimensions irrelevant to problem 
solution while holding constant the number of 
relevant dimensions. A single dimension was 
relevant for all conditions, and either 1, 3, or 6 
dimensions were irrelevant. The design 
therefore describes a 3 X 4 orthogonal plot. 
Ten Ss were tested in each of the 12 resulting 
cells. 

Subjects —The Ss were 124 sophomore 
psychology students attending the University 
of Wisconsin. All had yolunteered for the 
experiment in order to gain credit applicable 
to their class grades. Four Ss were eliminated 
for failure to comply with instructions. 

Apparatus and procedure. —Stimuli con- 
sisted of geometric figures drawn on 3 X 5 in. 
cards. Each card was inscribed with a single 
figure. Figures varied according to the 
following dimensions and values within 
dimensions: form (rectangle-triangle), size 
(large-small), location (center-right of center), 


631 


632 
TABLE 1 

Duncan RANGE ANALYSIS: MEAN 

PERFORMANCE SCORES DURING 

MF/IF TRIALS 

ca Mean SD 
0:1 198.8 -83 
0:3 194.4 5.02 
0:6 186.6 9.29 
12.51 173.8 22.26 
25:1 142.5 33.89 
12.5:3 142.0 28.99 
12.5:6 141.4 32.79 
25:3 119.6 15.20 
Stork 105.3 10.39 
37.5:3 102.7 10.60 
25:6 100.9 5.26 
37.536 96.3 13.41 


Note.—Means joined by vertical line do not differ 
sgnibcantly; means not so joined are significantly 
different (P < .05). 


position (vertical-horizontal), figure color 
(black-blue), ground color (red-white), dot 
within figure (presence-absence). 

The apparatus consisted of a 20 X 36 in. 
flat-black panel mounted vertically on a table 
of normal height, which served to separate S 
from E. A 3X5 in. aperture was cut into 
the center of the panel slightly below eye 
level, Stimuli were manually inserted into 
this aperture from the rear. 

Two 7.5-w. bulbs, one red and one white, 
were mounted side by side in sockets set 6 in. 
apart, 8 in. above and to either side of the 
presentation aperture. These lights served 
as feedback signals and were controlled by 
E, using two Western Union telegraph keys, 

Instructions were read to S, informing him 
that he was to take part in a concept identi- 
fication experiment, and that his task would 
involve the classification of cards placed 
before him. Specifically, S was told to label 
each card either A or B, with each A card 
having something in common and each B 
card having something in common. The 
flashing of a white light would indicate a 
correct response and a red light an incorrect 
response. No references were made to the 
presence of MF. Groups serving under the 
different dimensions-irrelevant (DI) condi- 
tions were read supplementary instructions 
in accordance with Hovland'’s (1952) pro- 

cedure, in which Ss are informed of the range 
of values and dimensions available to them, 
For all groups Category A constituted a 
vertical figure and B a horizontal figure. 
Following these instructions questions were 


WALTER J. JOHANNSEN 


answered with a paraphrase of the original 
instructions, 

A schedule of MF/IF was developed for 
each condition. The occurrence of MF was 
randomized within each block of 10 trials, 
but with each block receiving approximately 
the same number of MF trials. All Ss within 
a feedback group performed under the same 
schedule. i. 

All experimental Ss received 200 trials 
under MF/IF conditions and were then 
shifted to a 100% IF schedule until a criterion 
of 10 successive correct responses was 
achieved. Control Ss who usually made long 
runs of correct responses in less than 100 trials 
were terminated after 20 successive correct 
responses and, for purposes of analysis, were 
credited with an additional number of correct 
responses equal to the difference between 200 
and the number of the terminal trial. 

On a given trial Æ randomly selected a 
stimulus card from the shuffled pack before 
him and placed it in the presentation aperture 
in front of S. After S responded verbally, E 
recorded the classification of the card (A or B) 
and whether S had responded correctly or 
incorrectly. After reference to the MF/IF 
schedule, Æ determined whether MF or IF 
was to be administered on that trial and 
pressed one of the two keys to signal feedback. 
Average time per trial was approximately 
0 sec, 


RESULTS AND DISCUSSION 


Performance under MF/IF condi- 
tions.—Prior to examination of the 
acquisition process, a preliminary 
analysis of variance was performed on 
the mean number of correct responses 


3” iy” rd 

RS cama 

: J / 
po af 
Ea 
Ai Á AN 
s> - 

° 
i | 
ia ak oa 

SS tattretsssrs ree eter 
ame | on we 
EEES e eons 
Fic. 1. Mean number correct responses 


per 20-trial block under MF/IF and sub- 
sequent IF conditions, (Parameter is M 
percentage. One irrelevant dimension.) 


CONCEPT IDENTIFICATION 


wesPowses 


Mean wo CORRECT 


Fic. 2. Mean number correct responses 
per 20-irial block under MF/IF and subse- 
quent IF conditions. (Parameter is MF 
percentage. Three irrelevant dimensions.) 


during the MF/IF block, but also 
including control conditions. The 
results demonstrate significant differ- 
ences as a function of DI (F = 14.60, 
df = 2/108, P < 01), of MF% 
(F = 668.20, df = 3/108, P <.01) 
and of the interaction between the two 
variables (F = 6.98; df = 6/108, 
P <.01). A supplementary Duncan 
range analysis, reported in Table 1, 
indicates the ordering of means and 
the position of significant differences 
dividing the cells. 

As anticipated, the increasing num- 
ber of DI is related to poorer perform- 
ance, in agreement with earlier re- 
search (Archer, Bourne, & Brown, 
1955). The extremely large F ratio 
ascribed to MF% is partially a func- 
tion of bias introduced by inclusion of 
control conditions where optimal per- 
formance was reached early in train- 
ing. Examination of the Duncan 
range results indicates that the num- 
ber of correct responses diminishes 
regularly with increasing stimulus 
difficulty and MF%, although a few 
inversions of order exist. 

In order to analyze the acquisition 
process under MF/IF conditions, a 
trend test (Grant, 1956) was per- 
formed on the group scores, with the 
number of correct responses per 20- 


633 


trial block providing the raw data. 
Performance curves obtained from 
each of the 12 conditions appear in 
Fig. 1-3. Because the small cell fre- 
quencies and the limitations on the 
range of possible cell scores yielded 
truncated distributions, an arc-sine 
transformation was performed (Sne- 
decor, 1946, p. 445) and analysis 
conducted on the transformed data. 
Significant differences between group 
means occur as a function of MF% 
(F = 38.53, df = 2/81, P < 01) and 
DI (F = 10.33, df = 2/81, P < .01) 
when control data are omitted. Elim- 
ination of the control cells reduces the 
MF X DI interaction to the extent 
that it is no longer significant. The 
overall acquisition curve appears com- 
plex, consisting of significant linear 
(F = 73.13, df = 1/81, P 1:03); 
quadratic (F = 31.56, df = 1/81, 
P < .01), and cubic (F = 10.33, 
di = 1/81, PS 01) components. 
Group differences occur only in the 
slope of the different MF curves 
(F = 91.25, df = 2/81, P < .01). 
These results are consistent with 
Morin’s observation of curve com- 
ponents of a higher order than 
quadratic. 

Performance under subsequent 100% 
IF conditions —Analysis of subse- 
quent learning under 100% IF condi- 


? 


Bos o 


Fic. 3. Mean number correct responses 
per 20-trial block under MF/IF and subse- 
quent IF conditions. (Parameter is MF 
percentage. Six irrelevant dimensions.) 


© 


634 


tions involved several procedural 
problems. A few Ss serving in the 
12.5% MF cells achieved levels of 
errorless performance during the last 
block of MF/IF trials. To drop these 
Ss from the succeeding IF series would 
have resulted in a sampling bias, in 
the sense that the “better learners” 
would be eliminated from the simpler 
conditions and retained in the more 
difficult conditions. The alternative, 
which was adopted, was to retain 
these Ss and require achievement of 
the same criterion as the other Ss. 
The net effect would be slightly to 
enhance the probability of a spuri- 
ously significant difference between 
MF groups on learning under 100% 
IF conditions. 

Control of differential group attain- 
ment under MF/IF was also needed, 
It was reasoned that performance 
differences under 100% IF was a 
joint function of DI, prior learning, 
and the residual effects of the termi- 
nated MF. Thus, analysis of variance 
dealing with trials to criterion under 
100% IF would yield an apportioning 
of variance attributable to the effect 
of experimental variables combined 
with the effect of previous learning. 
An analysis of covariance, partialing 
out the effect of earlier learning, 
would allow evaluation of the experi- 
mental variables alone. 

Since a Pearson r revealed high 
correlation between within-group vari- 
ances and means for the 100% IF 
cells, a square-root transformation 
was performed on all trial-to-criterion 
scores in order to reduce the effect. 
An analysis of variance was then per- 
formed comparing the transformed 
trial-to-criterion scores of the experi- 
mental groups with those obtained by 
the control groups in order to deter- 
mine whether the 200 MF/IF trials 
had acted to increase the number of 
trials to achieve a level of 10 successive 


WALTER J. JOHANNSEN 


correct responses. The results showed 
that only DI (F = 17.58, df = 2/108, 
P <.01) and the MF xX DI inter- 
action (F = 2.92, df = 6/108, P < .05) 
reached significance. Trials needed to 
achieve criterion were not increased 
as a function of different percentages 
of MF administered during the 
MF/IF block. 

Disregarding the control data, anal- 
ysis of variance performed on the 
transformed trials-to-criterion scores 
of the experimental cells provides a 
similar picture. Here only the differ- 
ences between DI groups achieve 
significance (F = 10.31, df = 2/80, 
P <.01). However, if the effect of 
prior learning is partialed out (in 
terms of terminal level of performance 
during the last two blocks of MF/IF), 
the MF variable becomes significant 
(F = 6.78, df = 2/80, P < .01) and 
the effect of DI is sharply diminished. 
Thus performance under subsequent 
IF conditions is affected by the 
terminated MF trials, but the effect is 


complex and requires further ex- 


plication. 

Probability matching.—Probability 
matching behavior was evaluated by 
subtracting each S's attained number 
of correct responses on the terminal 
40 MF/IF scores from the theo- 
retically expected scores. These dif- 
ference scores were evaluated by 
separate ! tests for each cell. The 
results present a complex picture. 
Adequate matching was noted on 
five of the nine experimental cells: 
the three 12.5% MF groups, the 25% 
MF-1 DI, and the 25% MF-3 DI 
conditions. Significant negative de- 
viations from probability matching 
were noted on all three 37.5% MF 
cells and a significant positive devia- 
tion on the 25% MF-1 DI cell. 1 he 
breakdown of probability matching ™ 
the 37.5% MF conditions is note- 
worthy but not unique, since 4 


CONCEPT IDENTIFICATION 


similar phenomenon occurs in Pish- 
kin’s most difficult MF conditions. 
One possible explanation resides in 
the fact that Ss’ use of response 
patterns which yield positive rein- 
forcement on 67.5% of the trials are 
not distinguishably more effective 
than use of patterns which are 
successful on 50%, a level which could 
be reached by randomly responding. 


SUMMARY 


The effect of percentage of misinformative 
feedback (MF: 0, 12.5, 25, 37.5%) and the 
number of dimensions irrelevant to solution 
(DI: 1, 3, 6) on acquisition in concept 
identification and on subsequent performance 
under 100% informative feedback (IF) were 
investigated. A total of 120 Ss served, with 
90 experimental Ss being administered 200 
MF/IF trials, then shifted to 100% IF until 
criterion was reached. 

The results were: (a) Under MF/IF condi- 
tions significant differences occurred as a 
function of MF, DI, and MF X DI, with 
increasing MF and. DI leading to poorer 
performance. (b) Trend analysis on blocks of 
trials under MF/IF revealed a curve com- 
posed of significant liner, quadratic, and 
cubic components; the linear component was 
significantly affected by MF%. (e) Sub- 
sequent 100% IF learning was significantly 
affected by D1; inclusion of control Ss in the 


635 


analysis lead to MF X DI achieving signifi- 
cance. (d) Analysis of covariance on the 
100% IF data, partialing out the effect of 
prior learning, revealed only a significant MF 
effect. (e) Probability matching appeared in 
five of nine MF/IF cells. 


REFERENCES 


ARCHER, E. J., Bourne, L, E., JR., & BROWN, 
F. G. Concept identification as a function 
of irrelevant information and instructions. 
J. exp. Psychol., 1955, 49, 153-164. 

Goopnow, J. J., & Postman, L. Probability 
learning in a problem solving situation. 
J. exp. Psychol., 1955, 49, 16-22. 

Grant, D. A. Analysis of variance tests in 
the analysis and comparison of curves. 
Psychol. Bull., 1956, 53, 141-154. 

Hovianp, C. I. A communication analysis 
of concept learning. Psychol. Rev., 1952, 
40, 461-472. 

Morin, R. E. Factors influencing rate and 
extent of learning in the presence of mis- 
informative feedback. J. exp. Psychol., 
1955, 49, 343-351. 

Pisnxin, V. Effects of probability of mis- 
information and number of irrelevant 
dimensions upon concept identification. 
J. exp. Psychol., 1960, 59, 371-378. 

RESTLE, F. A theory of discrimination learn- 
ing. Psychol. Rev., 1955, 62, 11-19. 

SnepEcor, G. W. Statistical methods. (Ath 
ed.) Ames, Iowa: State Coll. Press, 1946. 


(Received December 23, 1961) 


is 


J al of Experimental Psychology 
1962, Vol. 64, No. 6, 636-639 


RESISTANCE TO EXTINCTION AS A JOINT FUNCTION 
OF REWARD MAGNITUDE AND THE SPACING 
OF EXTINCTION TRIALS! 
WINFRED F. HILL ann NORMAN E. SPEAR 


Northwestern University 


The effect of reward magnitude on 
resistance to extinction is an unsettled 
question, even for that subset of 
studies in which the independent 
variable is the weight of food on 
a continuous reinforcement schedule 
and the dependent variable is the 
running speed of rats. Metzger, 
Cotton, and Lewis (1957) and Zeaman 
(1949) found that a larger reward gave 
faster running early in extinction, 
with the group curves tending to con- 
verge as extinction proceeded. This is 
what would be expected if in extinc- 
tion K (Hull, 1951; Spence, 1956) 
adjusts to the absence of reward from 
different levels. On the other hand, 
Armus (1959) and Hulse (1958) 
found faster running throughout ex- 
tinction after a smaller reward. This 
might reflect a contrast or depression 
effect for the large reward group 

The most prominent difference in 
procedure between these two sets of 
studies was in the distribution of the 
extinction trials. Metzger, Cotton, 
and Lewis and Zeaman gave massed 
extinction, whereas Armus and Hulse 
gave spaced extinction. The present 
experiment is a test of the hypothesis 
that reward magnitude and Spacing of 
extinction trials will interact within a 
single experiment. If confirmed, this 
relationship would be of considerable 
significance for the interpretation of 
extinction. 


1 This research was supported by Grant 
G-8706 from the National Science Foun- 
dation. 


METHOD 


The Ss were 64 experimentally naive 
female albino rats of the Sprague-Dawley 
strain, 74 days old at the beginning of train- 
ing. Training and extinction took place inan 
enclosed runway previously described by 
Lewis (1956). . i 

Each S received six daily 3-min. sessions 
of prehandling, the last session 48 hr. priot 
to the beginning of experimental training, 
During each session S was allowed to explore 
a large unpainted wooden box, presented with 
four of the pellets later to serve as reward, 
and picked up and replaced at least five times 
by E. A once-daily feeding schedule began 
on the first day of prehandling and was 
maintained throughout experimental training: 
The ration was 10 gm. of finely ground Purina 
lab chow and was presented 50 to 60 min. 
after the start of prehandling or experimen 
training. A 

All Ss received 25 trials of acquisition, 
5 per day, and 20 trials of extinction be 
ginning on the sixth day. During we 
acquisition and extinction, 5S was confined a 
the goal box for a minimum of 15 sec. 
until all pellets were consumed (maximum 
4 min.). Between trials on the same dayi y 
was confined in its home cage, with wat 
available, for 20 sec. During ext 
the food cup was removed from the goal i 

Differential training was introduc Bi 
way of a 2 X 2 factorial design, varying a 
number of .045-gm. Noyes pellets svon 
reward during acquisition (four pellets or e 
and the intertrial interval during extinct 
(20 sec. or 24 hr.). Thus the four bee 
mental groups of 16 Ss each may be a 
nated according to extinction spacing a 
massing and according to acquisition ™ 
nitude as Sp-4, Sp-1, M-4, and M-1. 


RESULTS AND DiscussioN 


Acquisition —Curves of acquis + 
speed are shown in Fig. 1. Dou j 
classification analysis of variance & 
the mean speeds for the last five ™ 


636 


= une 
ie a 
ooo Oo a a a i aÁ -~—‘— 


ition 


RESISTANCE TO EXTINCTION 


o—» SP-4 
ah | o--= SPI 
— M -4 


e... M-I 


RUNNING SPEED (FT/SEC) 


123456 7 8 9 10 ll (2 13 14 15 16 I7 18 19 2021 22 23 24 


Fic. 1. Mean speeds in acquisition 


637 


TRIALS 
(five trials a day) for two magnitudes 


of reward and for Ss subsequently receiving massed or spaced extinction. 


confirms the superiority of the four- 
pellet condition (F = 18.90, df = 1/60, 
P <.001). The dummy distribution 
variable and the interaction are both 
nonsignificant (Fs = 2.91 and 3.44, 
respectively), indicating that the two 
spacing groups were roughly equiva- 
lent before thes pacing variable was 
introduced. 

The curves show a tendency for the 
greatest increases in speed to come 
between the end of 1 day and the 
beginning of the next. This reminis- 
cence was more marked in the later 
stages of learning and in the one- 
pellet groups, combined under these 
conditions with a marked within-days 
decrement in speed. To quantify this 
reminiscence effect, a score was com- 
puted for each S on each day by sub- 
tracting the main gain in speed be- 
tween each trial and the next from the 
gain in speed between the last trial of 
the previous day and the first trial 


of the day in question. When this 
score is averaged over the 4 days of 
acquisition (excluding the first day, 
for which it cannot be computed), the 
mean is significantly positive at the 
001 level for both the four-pellet 
(Sp-4 plus M-4) and the one-pellet 
(Sp-1 plus M-1) groups (’s = 4.85 
and 6.75, respectively, for the differ- 
ence from zero). This indicates that 
the trial-to-trial gain was greater over 
the 1-day interval than over the 20- 
sec. interval. A trend analysis showed 
the overall mean to be significantly 
higher at the .05 level for the one- 
pellet than for the four-pellet group 
(F = 3.98, df = 1/62). The increase 
over trials yielded a significant F of 
3.98 (df = 3/186, P = .01) but one 
which is not quite significant for the 
1 and 62 df recommended as con- 
servative by Geisser and Greenhouse 
(1958). The F for Group X Trend 
interaction was less than 1. 


E 
e -_ 


638 


Extinction.—The course of extine- 
tion is shown in Fig. 2. It is evident 
that larger reward and spaced practice 
resulted in greater resistance to ex- 
tinction, the latter in spite of the 
(nonsignificant) superiority of the 
to-be-massed group in acquisition. 
The statistical reliability of these 
findings is confirmed by analysis of 
variance of mean speeds on Trials 2-6 
and Trials 16-20, with 1 and 60 df 
for all F ratios. In the analysis of 
early extinction, magnitude and spac- 
ing were both significant at the .05 
level (Fs = 4.71 and 5.18, respect- 
ively), with an F for interaction less 
than 1. In the analysis of late ex- 
tinction, magnitude was significant at 
the .05 level (F = 6.68), distribution 


RUNNING SPEED (FT/SEC) 


1'!234567689 


WINFRED F. HILL AND NORMAN E. SPEAR 


at the .001 level (/ = 26.28), and 
interaction at the .01 level (F = 7.45). 
The interaction reflects the conver- 
gence of the two magnitude curves in 
the massed but not in the spaced 
condition. 


Discussion.—The main hypothesis of 
the experiment was that reward magni- 
tude has opposite effects on extinction 
depending on the spacing of trials during 
extinction. This prediction was clearly 
not confirmed. Larger reward gave 
greater resistance to extinction with both 
massed and spaced extinction, and the 
interaction of the two variables late in 
extinction was in the opposite direction 
from what was predicted. The present 
results thus confirm Metzger, Cotton, 
and Lewis (1957) and Zeaman (1949), 
as well as several studies of reward 


o—0 SP-4 
O=---0 SP- | 
e—. M -4 
©... M ~ | 


10 Il 12 13 14 5 i6 17 18 19 20 


TRIALS 


Fic. 2. E 


xtinction speeds after two reward magnitudes with massed and spaced extinction. 


RESISTANCE TO EXTINCTION 


magnitude using concentration of sucrose 
in the Skinner box (e.g., Collier & Willis, 
1961; Guttman, 1953). They do not, 
however, explain the contradictory re- 
sults of Armus (1959) and Hulse (1958). 

The reminiscence effect in acquisition 
was unexpected, It is possible that this 
effect and the greater resistance to ex- 
tinction in the distributed group may 
both be due to the same mechanism. 
This mechanism might be either reactive 
inhibition (Hull, 1951) built up during 
massed practice or, alternatively, activity 
drive (Hill, 1956) built up during rest in 
small cages and satiated by massed 
practice. 


SUMMARY 


Rats received 25 trials of acquisition and 
20 trials of extinction in a straight alley, with 
reward magnitude (four pellets or one) and 
intertrial interval in extinction (20 sec. or 
24 hr.) varied factorially. Resistance to 
extinction was greater for large reward and 
for spaced extinction, without the interaction 
predicted from a comparison of earlier 
studies. Marked reminiscence was observed 
from day to day in acquisition. 


REFERENCES 


Armus, H. L. Effect of magnitude of rein- 
forcement on acquisition and extinction 
of a running response. J. exp. Psychol., 
1959, 58, 61-63. 


639 


Couuier, G., & Wiis, F. N. Deprivation 
and reinforcement. J. exp. Psychol., 1961, 
62, 377-384. 

Getsser, S., & GREENHOUSE, S. W. An 
extension of Box’s results on the use of the 
F distribution in multivariate analysis. 
Ann. math. Statist., 1958, 29, 885-891. 

Gutman, N. Operant conditioning, extinc- 
tion, and periodic reinforcement in relation 
to concentration of sucrose used as rein- 
forcing agent. J. exp, Psychol., 1953, 46, 
213-224, 

Hitt, W. F. Activity as an autonomous 
drive. J. comp. physiol. Psychol., 1956, 49, 
15-19, 

Hutt, C. L. Essentials of behavior. 
Haven: Yale Univer. Press, 1951. 
Hutse, S. H. Amount and percentage of 
reinforcement and duration of goal con- 
finement in conditioning and extinction. 

J. exp. Psychol., 1958, 56, 48-57. 

Lewis, D. J. Acquisition, extinction, and 
spontaneous recovery as a function of 
percentage of reinforcement and intertrial 
intervals. J. exp. Psychol., 1956, 51, 45-53. 

Merzcer, R., Coron, J. W., & Lewis, D. J. 
Effect of reinforcement magnitude and of 
order of presentation of different magni- 
tudes on runway behavior, J. comp. 
physiol. Psychol., 1957, 50, 184-188. 

Spence, K. W. Behavior theory and condition- 
ing. New Haven: Yale Univer. Press, 1956. 

Zeaman, D. Response latency as a function 
of the amount of reinforcement. J. exp. 
Psychol., 1949, 39, 466-483. 


New 


(Received January 4, 1962) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 640-645 


HIERARCHIES IN CONCEPT ATTAINMENT 


ULRIC NEISSER anp PAUL WEENE?! 


Brandeis University 


In the laboratory or out of it, new 
ideas are always built on old ones. 
To attain a typical experimental con- 
cept, say “three borders,’ S must 
already be able to identify borders, 
to count, to distinguish between E’s 
positive and negative statements, and 
so on. Much cognitive activity is 
hierarchically organized, in that the 
abstractions at one level form the 
basis of new abstractions at the next. 
The present experiment is an attempt 
to study hierarchical concepts ex- 
plicitly. Although only binary con- 
cepts were used (more than two 
features were never relevant), the 
range of possibilities included three 
degrees of hierarchical depth. 

There are many ways in which two 
or more features of a stimulus pattern 
may be combined into an attribute of 
higher order. For example, they may 
be conjoined: the attribute is defined 
by the joint presence of several 
features. A certain object is “of good 
quality” if it has been made skillfully 
(A), and of first-class materials (B), 
Neither feature alone is sufficient ; 
both together are decisive. Bruner, 
Goodnow, and Austin (1956) worked 
extensively with conjunctive prop- 
erties, but studied disjunctive attri- 
butes as well. In a disjunction, the 
Presence of either property (or of 
both) is sufficient to define the 
concept. A patient may have an 
allergic reaction to either strawberries 
(A) or tomatoes (B). In conjunctive 


1The experiment was performed while 
both authors were staff members of Lincoln 
Laboratory, Massachusetts Institute of Tech- 
nology, operated with support from the 
United States Army, Navy, and Air Force. 


concepts, the criterial attribute may 
be written symbolically as “A-B,” 
while the corresponding notation for 
disjunctive attributes is “AvB.” 
These two cases do not exhaust the 
possibilities, even if nothing matters 
but the presence or absence of two 
distinguishing features. There are 10 
types of criterial attributes which can 
be based on one or on two features. 
They are listed in Table 1. Attributes 
based on more complex relations (“A 
followed by B,” “A within B,” and so 
on) will not be considered here. 

The 10 types of bivariate attributes 
fall naturally into three levels, as 
indicated in Table 1. The univariate 
attributes are evidently the simplest. 
Next are a group of six bivariate 
attributes, made up directly from the 
univariate ones by negating, con- 
joining, or disjoining them. Finally, 
the two most complex attributes are 
formed by disjoining certain con- 
junctive pairs. Successive levels rep- 
resent increasing complexity, not only 
in terms of the number of symbols 
needed to define the attributes, but in 
terms of a hierarchical structure. 
That is, attributes of Level II are 
combinations of those at Level 1, and 
are components of those at Level III. 
It must be understood that this 
ordering arises only because we have 
taken negation, conjunction, and dis- 
junction (rather than, say, double 
implication) as the basic operations, 
to be represented by elementary 
symbols. The hierarchy is merely @ 
tautology until it is related to em- 
Pirical findings like those presented 
here. In a sense, the findings of the 
present experiment support the selec- 


640 


HIERARCHIES IN CONCEPT ATTAINMENT 


TABLE 1 


641 


TYPES OF ATTRIBUTES Waich Can Be DEFINED BY PRESENCE 
OR ABSENCE OF Two FEATURES 


Name and Symbolic 


Designation Description of Positive Instance Example 
a I 
esence (A) A must be present Vertebrate: must have a ba 
Absence (—A) A must not be present (comple- Invertebrate: es me og ce 
ment of presence) backbone 
Level II 


Conjunction (A-B) Both A and B must be present 


Disjunction (AvB) Either A or B or both must be 


present 


Exclusion (A-—B) A must be present and B not 


Disjunctive ab- 
sence (—Av—B) 


Conjunctive ab- 
sence (—A-—B) 


Implication 


present 


Either A or B, or both, must be ab- 
sent (complement of conjunc- 
tion) 

A and B must both be absent 
(complement of disjunction) 


A may be absent, but if A is pres- 


must be first class 

Allergenic: a food which contains 
either tomatoes or strawberries 
(for example) 

Eligible for Pieti license: must 
have passed test and not have 
committed felony 

Poor quality: either material or 
workmanship is not first class 


Nonallergenic: a food which con- 
tains neither tomatoes nor 
strawberries (for example) 

Ipang for driver’s license: must 

er have not passed test or 
have committed felony 


Negative product: either factor 
negative, but not both 


Positive product: both factors may 
be negative, or neither, but not 


—AvB) ent then B must be also; thus A eit 
x implies B (complement of ex- 
clusion) 
Level III 
Either /or Either A or B must be present, but 
(A-—B)v not both together 
(—A:B) 
Both/neither Both A and B must be present, un- 
(A-B)v less neither is (complement of 
(—A:—B) either/or) just one 


tion of these three operations as 
primitive. 

The 10 types of attributes fall into 
five complementary pairs. Anything 
that is a positive instance of one 
member of such a pair (i.e., which has 
its attribute) is a negative instance of 
the other. For example, A-B is the 
complement of —Av—B because all 
and only those objects which are 
described by the former expression 
are not covered by the latter. Sym- 
bolically, one may find the com- 
plement of an expression by changing 
every “+” toa úy” (and vice versa) 
and also every plus to a minus (and 
vice versa). 

The experiment reported here is a 


study of the relative difficulty of 
attaining concepts at these several 
levels. The underlying hypothesis 
was that concepts at hierarchically 
higher levels would be more difficult 
to attain than those of lower levels. 


METHOD 


Experimental materials: —The stimulus ob- 
jects were strings of four consonants, each 
string printed on a 4 X Gin. filing card. Only 
J, Q, V, X, and Z were used. Thus there were 
625 distinguishable stimuli altogether (JJJJ, 
JJJQ, JJJV, +++, QJQZ, «++, VOZX, ++, 
ZZZZ). The concepts were defined in terms 
of the presence or absence of one or of two 
of these letters. The order and frequency of 
the letters in the string was never relevant, 
so that (for example) QQVZ was always 


Z 
e <_ 


642 : 


equivalent to VZZQ, ZVQV, and to any other 
string which contained Q, Z, and V but did 
not contain either J or X. Each of the types 
of criterial attribute represented in Table 1 
could be realized in a number of specific ways. 
For example, JvX, QvZ, etc., are all dis- 
junctions. It can easily be verified that 
altogether 110 different univariate and bi- 
variate attributes can be defined on these 
stimuli. Any given stimulus is a positive 
instance of 55 of these and negative instance 
of the other 55. For example, QQVZ is a 
positive instance of Q, of —X, of VvJ, of 
(Q-V)v(—Q-—V), etc, and a negative 
instance of —Q, of X, of —V-—J, of 
(Q-—V)v(—Q-V), ete. 

Subjects—The Ss were 20 students of 
college age. They worked for about 3 hr. 
every morning, in groups of 5. A group of 
practiced Ss could complete about four 
problems in such a session.” 

Apparatus.—The sequence of stimuli for a 
given concept was arranged as a deck of cards 
and set in a wooden frame, with the front card 
concealed by a spring-loaded shutter. To 
present a stimulus, E released the shutter, 
Between trials, he closed the shutter and 
removed the front card. 

Procedure—When the stimulus appeared, 
each S, working independently, responded 
“plus” if he thought it was a positive instance 
of the attribute he was to discover, «and 
“minus” if he thought not. Responses were 
made by means of toggle switches which con- 
trolled appropriate indicators on a panel 
visible only to E. When all Ss had responded, 
E noted the response, informed the Ss of 
the correct answer, and then presented the 
next stimulus. No attempt to time the pres- 
entations was made, but an S who hesitated 
more than about 15 sec, was asked to guess 
rather than delay further. 

All sequences of stimuli were arranged to 
make positive and negative instances equally 
probable, and successive stimuli independent, 
(Appropriate sequences were prepared with an 
IBM 709 computer.) Since pure guessing 
would yield 50% correct responses, S was 
judged to have attained a concept when he 
had made 25 consecutive responses with only 
a single error, (The possibility of carelessness 
made a 100% criterion inadvisable.) Ordi- 
narily, a single problem was continued for 100 
stimuli or until all Ss had reached criterion. 
The situation was kept as noncompetitive as 
possible. The group was not informed about 
the performance of any individual, and each $ 
responded on every trial whether or not he 
had reached criterion. 


ULRIC NEISSER AND PAUL WEENE 


Before the experiment, Ss were told about 
the kinds of attributes that would be criterial. 
They were instructed that only the presence 
or absence of particular letters mattered, and 
that not more than two letters would be 
relevant. It was stressed that sequence and 
possible reduplication of letters on the cards 
was irrelevant. It was made clear that the 
absence of a letter, or of two letters, could be 
as important as its presence, and that the 
absence of one could be systematically con- 
nected with the presence of another. 

Experimental design.—The first three prob- 
lems (V, XvJ, Q- —Z) were the same for each 
group, and were considered practice. Ex- 
planation by Æ of potentially relevant and 
irrelevant attributes continued during these 
problems. Thereafter, each group of 5 Ss 
was given two consecutive cycles through 
the 10 types of problems described in Table 1. 
The order of problems within each cycle was 
varied from group to group, as were the 
letters which exemplified each type of concept; 
conjunction, for example, might be repre- 
sented by J-X, Q-V, Z-J, etc. Thus each $ 
was presented with 23 concept attainment 
problems. f; 

For two of the groups, a “nonresponding 
cycle through the 10 types was interpolated 
between the three practice trials and the first 
of the cycles mentioned above. ‘The Ss were 
shown 100 positive instances of each concept, 
and then asked to write a description of it. 
Since these groups did not differ appreciably 
from the others in their performance on the 
concept-formation cycles, the data have been 
combined for this paper. The results of the 
nonresponding cycle (and of other such cycles 
carried out at the conclusion of the main 
experiment) were too ambiguous to merit 
description here. 


Resutts 


Table 2 exhibits the median trials 
needed to reach criterion on each type 
of problem, considering the two cycles 
separately. (Means cannot be given, 

cause some Ss failed to attain the 
criterion on some problems.) The 
results support the hypothesis that 
three distinct levels of difficulty are 
represented. Problems of Level H 
are systematically harder than those 
of Level I and easier than those of 
Level III. There is also a substantial 
practice effect: in 8 of 10 cases the 


HIERARCHIES IN CONCEPT ATTAINMENT 643 


TABLE 2 


TRIALS TO CRITERION FOR DIFFERENT Tyres OF PROBLEMS 


Cycle 1 Cycle 2 
Type of Concept e 
Median Q:/Q: Median Q:/0: 
ae I 
resence (A) 11.0 3.0/22.0 4.0 1.0/12.0 
Absence (—A) 7.0 2.0/21.5 1.5 0.0/ 3.0 
Level II 
Conjunction (A-B) 13.0 6.0/43.5 18.0 4.5/50.5 
Disjunction (AvB) 21.0 8.0/46.0 24.0 7.5/29.5 
Exclusion (A-—B) 28.0 14.0/51.0 17.0 2.5/30.5 
Disjunctive absence (—Av —B) 50.0 25.0/ % 23.0 9.5/37.5 
Conjunctive absence (—A- —B) 29.0 17.0/61.0 8.0 3.0/18.0 
Implication (—AvB) % 57.5/ 2 19.5 9.0/59.0 
Level II iy 
Either/or (A- —B)v(—A-B) 68.0 47.5/2 41.5 22.5/ 2 
Both/neither (A-B)v(—A- —B) æ% 54.5/ 0 53.5 38.0/ © 


Note.—Q1/0s indicates the first and third quartiles; N = 20 throughout. * æ" indicates that the median or 
quartile S did not attain criterion, The 25 criterion trials are not included in these totals. 


TABLE 3 


PROPORTIONS OF Ss FOR WHOM OnE Concert Was EASIER 
THAN ANOTHER: Att CONCEPT PAIRS 


Level III Level II Level I 
Level 
(ey oa Ry -AvB |-A-—B|-Ay—B| A--B | AvB | A-B SA 
I i 
A 16/18* | 17/18* ı 17/19* | 13/20 | 16/20* 14/20 |14/19 |11/19 | 7/19 
19/19* | 18/19* : 17/20* | 14/20 | 15/20* | 12/18 15/19* | 15/19* | 3/15* 
-A 18/20* | 18/19* ı 18/19* 16/18* | 17/20* | 14/20 | 14/17* 12/18 
19/20* | 19/20* : 19/20* | 15/18* | 17/18* 14/17* | 16/19* | 17/19* 
4 16/17* 14/18* | 12/19 | 11/20 
aby 14/18" 8/18 | 9/18 | 10/20 
AvB 18/20* 15/19* | 13/20 3 
17/20* 9/18 | 9/19 i 
A--B 15/18* 15/19* 
15/19" 12/19 i 
-—Av-B 9/14 i 
15/19* ! 
—A--—B 14/17* ' 
19/20* : 
—AvB 7/13 1 
14/19 ' 
IIT i 
(A-—B)v 10/16 i : 
(—A-B) 10/17 


į 'kly than the 

—l r of Ss who attained the concept of that row more quic. 

col ree ohare ch nuameretor is the nuiis ne number available for the comparison. Upper shag feet i ace 

lower Tractions for Cycle 2. Com involving two different levels are above and to the left y line. 
* P < 0S; two-tailed binomial test. 


L 
. T 


644. 


median for the second cycle is below 
that for the first. 

In Table 3, every type of concept 
is explicitly compared with every 
other type. The comparisons, made 
separately for the two cycles, are in 
terms of the proportion of Ss who 
found one type easier than the other. 
Most proportions are based on slightly 
fewer than 20 Ss, since those who 
found the two problems equally 
difficult, or solved neither, are not 
counted. For each comparison, the 
null hypothesis is that the two con- 
cepts are equally difficult, and that 
the tabulated proportion differs from 
4 only by chance. In all those cases 
where the comparison is between 
concepts of different levels, we have 
the counterhypothesis that an S is 
more likely to find the lower-level 
hypothesis easier. Table 3 is so 
arranged that the counterhypothesis 
is supported by proportions above 4, 
and not by those below 4. It is also 
arranged so that all cases to which the 
counterhypothesis applies (i.e., com- 
parisons between concepts at different 
levels) fall above and to the left of the 
heavy line. It appears that all but 
1 of the 56 interlevel comparisons 
are in the predicted direction. More- 
over, 39 of these proportions are 
significantly different from 4 when 
considered individually. It is evident 
that levels of complexity play an 
important role in determining the 
difficulty of concept attainment. 

No prediction was made about the 
relative difficulty of concepts within 
a single level. Indeed, Table 3 shows 
proportions near } for most such com- 
parisons. But implication (—AvB) 
and disjunctive absence (—Av—B) 
are significantly more difficult than 
the other second-level concepts on the 
first cycle of problems. The probable 
explanation is that Ss did not fully 
understand the definition of these 


ULRIC NEISSER AND PAUL WEENE 


concepts at first. On the second cycle 
this obstacle had been overcome by 
familiarity, and these concepts lost 
their special status. There is one 
other anomalous finding: —A was 
easier than A. This result is difficult 
to understand, since these types differ 
only in which half of the universe of 
stimuli is called “‘plus.” 


Discussion 


Why are higher-level concepts more 
difficult to attain? It might be supposed 
that, for complex combinatorial reasons, 
an unusually large number of stimuli is 
needed for logical elimination of com- 
peting hypotheses when a_ high-level 
attribute is the criterial one. We ex- 
plored this possibility by writing a com- 
puter program (for the IBM 709) which 
solves our problems by rote. It has a 
list of the 110 possible concepts, and 
checks off those which are eliminated by 
each stimulus as it appears until only one 
concept remains. On the average, this 
program needs from 8 to 12 instances to 
pinpoint the defining attribute, although 
it may occasionally take much longer 
(if the string of stimuli happens to be 
unusually redundant), Paradoxically, 
the program takes slightly longer to 
identify the simple attributes (A and 
—A) than those of Level II, while the 
concepts of Level III take the fewest 
trials of all! The reason seems to be that 
when a series of stimuli are all com- 
patible with a simple attribute such as 
“Z,” there is a relatively high probability 
that they will all be compatible with 
certain high-level disjunctions, such as 
ZvQ, as well. 

We wish to emphasize that the com- 
puter program was not written tO 
simulate the behavior of human Ss, but 
simply to establish the rates at which the 
different concepts could be attained by 
logical elimination, The discovery that 
human Ss do not attain concepts in this 
way is hardly surprising. 

A second explanation of the difficulty 
of attaining high-level concepts might 
appeal to the difficulty of formulating 


— pun 


HIERARCHIES IN CONCEPT ATTAINMENT 


them verbally. Perhaps Ss find them 
unfamiliar, or cannot easily keep them in 
mind. The unexpected results with im- 
plication and disjunctive absence suggest 
that there is some validity to this 
interpretation. It is not fully adequate, 
however. The Ss seemed to have a 
better verbal understanding of either/or 
than of most of the concepts at Level II 
which were more quickly attained. 

In our opinion, higher-level concepts 
are more difficult because of their hier- 
archical organization. To identify an 
instance of (Z-Q)v(—Z-—Q) one must 
have Z-Q and —Z-—Q available as 
components. After all, any individual 
instance of the first concept is also an 
instance of one of the latter two. More- 
over, to work with Z-Q, S must know a 
Z and a Q when he sees one. Thus the 
levels into which we have divided the 
possible binary concepts may correspond 
to actual levels of input analysis by Ss. 
To attain a complex concept, they must 


645 


use, and therefore must have attained, 
preliminary concepts at lower levels. 


SuMMARY 


Twenty Ss were employed in a study of the 
relative difficulty of attaining 10 different 
types of concepts. All types involved only 
the presence or absence of two properties, but 
some were hierarchically more complex than 
others. For example, “Both A and B" is 
more complex than “A” but less complex than 
“Both A and B or neither.” The results 
indicate that the difficulty of a concept varies 
directly with its complexity. This order of 
difficulty does not appear when a computer 
program is used to attain the concepts by 
simple elimination. It seems to reflect a 
hierarchical organization of conceptual proc- 
esses in the Ss themselves. 


REFERENCE 


Bruner, J. S., Goopnow, J. J., & Austin, 
G. A. A study of thinking. New York: 
Wiley, 1956. 


(Received January 19, 1962) 


Journal of Experimental Psychology 
1962, Vol. 64, No. 6, 646 


REPLICATION REPORT: LATENT LEARNING IN A T MAZE 
AFTER SHOCK IN ONE END BOX 


_ HENRY GLEITMAN ann MAGDALENA M. HERMAN 
Swarthmore College 


Tolman and Gleitman (1949) have re- 
ported latent learning in a T maze with highly 
differentiated end boxes. They found ap- 
propriate choice behavior after rats were 
shocked in one or the other of the two end 
boxes, following an equal number of reinforce- 
ments on both sides. 

Method—The original experiment was 
replicated in all respects but the following: 
(a) Guillotine doors were used instead of the 
one-way doors used in the original experi- 
ment, (b) all Ss were run under 24 hr. of 
food deprivation, and (c) the location of the 
two differentiated end boxes was systemat- 
ically varied, the dark one being on the right 
side for half the Ss, and on the left for the 
other half. Finally, since there was some 
possibility that the positive results of the 
first experiment might be due to the distribu- 
tion of trials (only two trials per day, one free 
and the other forced), two conditions of dis- 
tribution were employed. 

The Ss were 38 experimentally naive 
female rats of white Angora strain, approxi- 
mately 100 days old at the beginning of the 
experiment. All Ss were reduced to 90% of 
their original body weight and kept at that 
level throughout the experiment. After 16 
trials of pretraining on a straight runway, 
they were divided into two equal groups 
roughly matched on running times during pre- 
training. Group I received two trials per day 
on the apparatus over 10 days, the first trial 
free and the second forced. Group II received 
four trials per day over 5 days, the first and 
third being free and the others forced. One 
day following their last training trial, Ss were 
placed in one of the two end boxes to find 
food, then into the other to receive two 
periods of intermittent shock. As in the 


original experiment, the spatial location of the 
end boxes was markedly different during this 
phase of the experiment as compared to 
training. Again as in the original experiment, 
half of the Ss were shocked in the preferred, 
the other half in the nonpreferred end box. 
The Ss were tested in the original apparatus, 
about 1 hr. after they had been shocked. 
Results.—Fourteen out of 19 Ss in Group I, 
and 13 out of 19 Ss in Group II, chose the 
side away from that on which they had been 
shocked. It is thus apparent that at least 
for this limited range of values there was no 
effect of distribution of practice on the final 
choice, Since the two groups were virtually 
identical in their choice behavior, their 
results were combined and tested for sta- 
tistical significance. Chance selection of the 
harmless side could be ruled out at the 1% 
level of significance (CR = 2.60, P < .01). 
While the major finding of the original 
study was substantiated in the present experi- 
ment, there is some difference in the magni- 
tude of the effects. In the present study, the 
harmless side was chosen by 71% of the Ss, 
in the original experiment by 88%. This 
difference may be due to rather strong 
turning or place preferences developed in the 
course of the present experiment, which some- 
times were strong enough to override other 
factors. Of the 11 Ss who did not choose the 
harmless side of the final test, 9 were 58 
who had been shocked in the preferred end box. 


REFERENCE 


Touman, E, C., & GLEtrMan, H, Studies in learning 
and motivation: I. Equal reinforcements in both end- 
boxes followed by shock in one end-box. J. es? 
Psychol., 1949, 39, 810-819, 


(Received October 7, 1961) 


THE WEINSTOCK PARTIAL 


REINFORCEMENT EFFECT AND HABIT REVERSAL 
LEON M. WISE 
Heidelberg College 


Sheffield (1949) found the partial rein- 
forcement effect (PRE) for massed acquisition 
(15-sec. intertrial interval) but not for 
distributed acquisition (15-min. intertrial 
interval). Weinstock (1954, 1958) found the 
| PRE under widely spaced trials (24-hr. 
i intertrial interval). Both of the investigators 

used a simple running response. Wike (1953) 

and Grosslight and Radlow (1954) found the 

PRE for massed acquisition in a habit 
, reversal discrimination problem. The pur- 
pose of the present experiment was to de- 
termine whether or not the PRE would be 
present in a habit reversal discrimination 
problem with a 24-hr. intertrial interval. 

Method—A 2X3 factorial design was 
used incorporating 100%, 70%, and 40% 
reinforcement, and 20-sec. and 24-hr. inter- 
trial intervals. The Ss were 60 experimentally 
naive male albino rats. A Y alley discrimi- 
nation apparatus was employed. Stimuli and 


MEAN NUMBER CORRECT RESPONSES 


DISTRIBUTED 


a a ae 
SUPPLEMENTARY REPORT: 


trial procedures were essentially the same as 
in Grosslight and Radlow's experiment. All 
Ss were given 40 acquisition trials and 40 
habit reversal trials. 

Results and discussion.—Figure 1 shows 
the mean number of correct responses for all 
groups for both acquisition and reversal. An 
analysis of covariance for the first five trials 
in massed habit reversal shows statistically 
significant differences among the 100%, 70%, 
and 40% groups (F = 19.08; P < .01) with 
the 100% group showing the least resistance 
to extinction (fastest reversal). Additional 
analyses of covariance at successive five trial 
intervals continue to show statistically 
significant differences in the same direction. 
This is in agreement with Sheffield's findings 
for massed acquisition. An analysis of 
covariance for the first five trials of dis- 
tributed habit reversal shows no statistically 
significant differences among the three 


HABIT REVERSAL , Eyi pen 


I 25 38 — HAST ev 


BLOCKS OF FIVE TRIALS 


Fic. 1. 


ber of correct responses for all grou 
rag a aad habit reversal in blocks of five trials. 


647 


ps throughout acquisition 


648 


groups. However, a similar analysis con- 
ducted on the second five trials shows 
statistically significant differences (F = 3.93; 
P < .05) with the 100% group showing the 
least resistance to extinction. Subsequent 
analyses conducted at successive five-trial 
intervals showed even greater statistical 
significance. This finding does not agree with 
Sheffield’s results for distributed acquisition. 
It does, however, substantiate the findings of 
Weinstock. 

The present experiment can be added to a 
growing body of studies denying the Shef- 
field aftereffects hypothesis. There seems to 
be little doubt now but that PREs can be 
obtained under both massed and distributed 
conditions and must be accounted for by any 
theory attempting to explain PREs. Whether 
or not Weinstock’s habituation hypothesis is 


Journal of Experimental Psychology 
1962, Vol. 64, No, 6, 648-649 


YVONNE BRACKBILL AND ANTHONY BRAVOS 


the correct interpretation the writer cannot 
say, but the present data are in agreement 
with it. 

REFERENCES 


Reinforcement 
J. exp. 


Grossier, J. H., & RapLow, R. 
schedules in habit reversal: A confirmation. 
Psychol., 1954, 48, 173-174. 

SHEFFIELD, V, F. Extinction as a function of partial 
reinforcement and distributed practice. J. exp. 
Psychol., 1949, 39, 511-525. 

Weinstock, S. Resistance to extinction of a running 
response following partial reinforcement under widely 
spaced trials. J. comp. physiol. Psychol., 1954, 47, 
310-322. 

Weinstock, S. Acquisition and extinction of a partially 
reinforced running response at a 24-hour intertrial 
interval, J, exp. Psychol., 1958, 56, 151-158, 

Wike, E.L. Extinction of a partially and continuously 
reinforced response with and without a rewarded 
alternative. J. exp. Psychol., 1953, 46, 255-260. 


(Received November 2, 1961) 


SUPPLEMENTARY REPORT: THE UTILITY OF CORRECTLY 
PREDICTING INFREQUENT EVENTS 


YVONNE BRACKBILL 
University of Colorado Medical School 


Brackbill, Kappy, and Starr (1962) found 
that maximum gain responding increased 
with increasing amounts of reward for correct 
prediction, The authors’ expectation that a 
first-order sequence analysis of their data 
would show previous actual occurrence, rather 
than previous prediction, to be the only 
reliable predictor from Trial n — 1 to Trial n, 
was not confirmed for n — 1 trials on which 
the less frequent event actually occurred. 
Maximum gain responding more often fol- 
lowed success in predicting the less frequent 
event than lack of success in predicting it. 
This effect suggested a second, independent 
source of reinforcement—the utility to S of 
correctly predicting the occurrence of the less 
frequent event. Whatever the interpretation, 
the sequence analysis findings are not directly 
predictable from reinforcement theory nor 
from current theories of probability learning 
(cf. Suppes & Atkinson, 1960). It seemed 
advisable, therefore, to find out whether these 
results were reproducible and whether their 
occurrence was limited to the particular 
values of the experimental parameters of the 
original study. 

Method.—First-order sequence analyses 
were performed on 12 independent sets of 


AND 


ANTHONY BRAVOS 


Johns Hopkins University 


noncontingent probability learning data pre- 
viously collected. These data were obtained 
under the same experimental conditions as 
those of the Brackbill, Kappy, and Starr (1962) 
study except for variation of the following 
parameters: amount of tangible reward given 
for a correct prediction; number of stimulus 
events; relative frequency of occurrence of 
the stimulus events; number of Ss; S’s age 
and grade in school; and number and series 
position of the asymptotic trials within each 
sequence analysis, Table 1 shows the value 
used for each of these parameters for each 
of the 12 groups of the present study as well 
as the four groups of the original experiment 
(Rows 2-5). In Table 1, the letters M and È 
stand for the more (or most) and less (or 
least) frequent events. Under “tangible 
reward,” 1 M or L shows that one unit O 

reward was given for a correct prediction © 
either event, and 1 M: 4 L shows that one 
unit of reward was given for a correct pre 
diction of the more frequent event, four units 
for a correct prediction of the less frequent 
event. A unit of reward was 1 marble for the 
younger Ss and 1 point for the older Ssi; 
100 marbles were exchanged for one toy, an 

100 points for $1.00. In the last five rows 


re 
a 


—, 


SUPPLEMENTARY REPORT 


649 


TABLE 1 


SUMMARY OF SUCCESSIVE TRIAL CONTINGENCIES FOR 16 Sirs or PROBABILITY LEARNING DATA 
OBTAINED UNDER VARYING EXPERIMENTAL CONDITIONS 


No. and Rela- 


Tangible > it, ea veauency 
urrence 
Reward | of Stimulus 


Events 


MpMo LpMo Molo 
75 :25 321-400 77 J 56 
225 101-200 74 82 38 
1MorL 101-200 80 83 6S 
3MorL 101-200 82 87 67 
SMorL 101-200 89 87 OF 
IM:4L 121-200 76 79 50 
1M:3L 121-200 83 70 68 
2M:3L 121-200 -80 84 68 
1 M:4L 121-200 -74 -57 71 
1M:3L 121-200 .72 62 63 
2M:3L 121-200 -84 -38 70 
None 301-400 47 48 27 
None 201-400 60 -63 22 
None 201-400 76 68 29 
None 301-400 71 77 36 
None 301-400 86 72 67 


No. and Posi- 
tion of Trials 


Mean Probability of Predicting Event M 
Trial n, Given Prediction (p) pr Gocerenen 
(o) on Trial a — 1 


Column 2, the frequencies in parentheses 
indicate that Table 1 does not include the 
sequence analysis results for the stimulus 
events of intermediate frequency under the 
three-stimulus conditions. 

Results and discussion—The last four 
columns of Table 1 show the mean prob- 
abilities of predicting Event M on Trial x, 
given the prediction (p) and actual occurrence 
(0) on Trial n — 1. Thus, for example, the 
entry in the upper right-hand cell indicates 
that, for those instances in which Ss had 
predicted the less frequent event (Ly) on 
Trial n — 1, and the less frequent event had 
actually occurred (Lo) on Trial n — 1, the 
mean probability of predicting the more 
frequent event (M) on Trial was .68. 

The question under investigation is 
whether S's prediction on Trial n is deter- 
mined by the nature of his previous prediction 
as well as by the previous actual occurrence 
or reinforcement, Therefore, it is appropriate 
to compare the M,M. to the L,M. prob- 

abilities and the L,L, to the MpLo prob- 
abilities. For the present data, shown in 
Rows 1 and 6-16 of Table 1, the mean value 
of MyMo exceeds that of LpMo in 6 cases out of 
12, while the mean value of LpL, exceeds that 
of M,Le in 10 cases out of 12 (P = .04, by 


binomial expansion). For all 16 sets of data, 
the mean value of L,L, exceeds that of MpLe 
in 14 cases (P = .004). 

In spite of wide variations within several 
experimental parameters, the same result 
has emerged as before. In order to maximize 
prediction to Trial » from preceding trials on 
which the less or least frequent event occurred, 
it is necessary to consider S's previous pre- 
diction in addition to the previous actual 
occurrence. Also, the direction of the effect 
in the present results supports the original 
interpretation: that there is a relatively 
greater utility to S of correctly predicting the 
occurrence of the less (or least) frequent 
event. It would be interesting to see if the 
same phenomenon might occur generally in 
any type of learning situation in which S, 
finding Z's “game” tedious and uninteresting, 
can and does invent one of his own. 


REFERENCES 


BRACKBILL, Y., Kappy, M. S., & STARR, R. H. Magni- 
tude of reward and probability learning. J. exp. 
Psychol., 1962, 63, 32-35, 

SUPPES, P., & ATKINSON, R. C. Markov learning models 
for multiperson inleractions. Stanford: Stanford 
Univer. Press, 1960, 


(Received November 16, 1961) 


VW 


5 
y 
Journal of Experimental ao ology 
1962, Vol. 64, No. 6, 6$0> 


z x t + 
SUPPLEMENTARY REPORT: FREQUENCY OF STIMULUS PRESENTATION 
AND SHORT-TERM DECREMENT IN RECALL? 


S. HELLYER 


Defence Research Medical Laboratories, Toronto, Canada a | 


Peterson and Peterson (1959) report clear- 
cut evidence of a progressive improvement in 
recall scores with an increase in the number 
of repetitions of the material by S before the 
delay of recall began. Certain anomalies in 
their data suggested the obtained differences 
might have resulted from Æ unintentionally 
interfering with S's response pattern. The 
present experiment repeated the Petersons’ 
work using visually presented material to 
reduce the likelihood of inadvertent in- 
terference. 

Method—The verbal items were three- 
consonant units with a Witmer association 
value nō greater than 33%. The material 
used to keep Ss active during the recall delay 
interval consisted of groups of three randomly 
selected digits. 

One three-consonant unit and a series of 
digit groups were typed as a list on a memory 
drum tape. There were eight such lists on a 
tape and a pair of tapes constituted a set of 
all 16 experimental conditions in random 
order, i.e., one, two, four, and eight repetitions 
of the three-consonant unit and recall delay 
intervals of 3, 9, 18, and 27 sec. The stimuli 
were displayed at a rate of 1/sec. 

Five seconds after Æ started the memory 
drum, a green star appeared in the window 
as a warning that the consonant unit was 
about to appear. The S was instructed to 
read aloud what appeared in the window and 
not to anticipate what might appear next, 
This was done to decrease the possibility 
that S was preparing for another rehearsal as 
the three digits of the intervening activity 
appeared. When the entire list had been 
presented, a red star appeared as a signal to 
recall the consonants presented at the start of 
the list. The intervening activity consisted 
in reading groups of three digits. After recall 
of the consonants was completed, $ was 
required to make two judgments about these 
numbers, estimates as to which digit had 
appeared least frequently and which most 
frequently. This was done to make the 
number task a more meaningful part of the 
experiment. On each of the 5 days, a 


' Defence Research Medical Laboratories Project N, 
246, DRML Report No, 246-18, PCC No. DIT 
le - NO. o 4 


TABLE 1 l 
PROPORTIONS OF ITEMS CORRECTLY RECALLED 


Number Recall Delay Interval (Sec.) 

of Pres- -T 

entations 3 > 18 27 
8 -99 89 74 66 
4 94 13, 56 46 
2 92 54 31 22 
1 89 38 21 14 


different pair of lists was presented in a 
random order to each S. The 25 paid Ss 
were housewives, 

Results and discussion —Following Peter- 
son and Peterson (1959), an item was con- 
sidered to be correctly recalled only if every 
consonant was correct and in its proper 
position. Table 1 records the mean propor- 
tion of ‘items recalled correctly for 25 58 'gn 
5 days. 

These data confirm the Petersons’ con- | 
clusions that there is better recall with an 
increase in the number of stimulus repetitions 
and with shorter periods of delay before 
recall. 


An analysisof varianceshowed that number — 


: of presentations and recall delay interval are 


both significant (P <.01), The only signifi- 
cant interaction was recall delay with number 
of presentations. This arises because the 
effect of an increase in recall delay was mote 
Pronounced on the trials where the consonant 
groups were presented once or twice than 
when they were presented four or eight times. 

The Ss in the present experiment obtained 
markedly higher recall scores than those 
reported by Peterson and Peterson, perhaps 
because in the present study the stimuli were 
Presented both visually and aurally. In 
Spection of the present data, confirmed by an 
analysis of variance, yields no evidence for 
learning over blocks of trials. 


REFERENCE 
PErkRSON, L. R., & Paterson, M. J. Short-term 
retention of individual verbal tems, J. exp, Psychol 
1959, 58, 193-198. 


(Received November 25, 1961) 


